-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathDesktop_SearchEngine.tex
More file actions
48 lines (38 loc) · 1.51 KB
/
Desktop_SearchEngine.tex
File metadata and controls
48 lines (38 loc) · 1.51 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
\chapter{Desktop Search Engines}
\label{chap:Desktop_SearchEngine}
\url{http://en.wikipedia.org/wiki/List_of_search_engines\#Desktop_search_engines}
\section{Google Desktop}
\label{sec:Google_Desktop}
Google Desktop is a discontinued product, due to the huge shift from local to
cloud-based storage and computing, as well as the integration of search and
gadget functionality into the modern operating system (O/S).
\begin{enumerate}
\item Email messages
\item Computer files
\item Music
\item Photos
\item Chats
\item Web pages viewed
\item Google gadget on user's desktop sidebar of Windows
\end{enumerate}
To display on Linux's desktop sidebar, use
\url{https://code.google.com/p/google-gadgets-for-linux/}
\section{DocFetcher}
\label{sec:DocFetcher}
DocFetcher is open-source multi-platform, written in Java with standard Widget
Toolkit for GUI. The searching and indexing capabilities of DocFetcher is based
on Apache Lucene (Sect.\ref{sec:Lucene}).
It can parse text from documents in
\begin{enumerate}
\item formats: PDF, HTML, EPUB, MS Office, OpenOffice
\item zip file: zip, 7z, rar, tar.*
\item Outlook emails: PST files
\item customized to search in any kind of source-code files
\end{enumerate}
It also
\begin{itemize}
\item automatically updates the indexes (when files are modified)
\item feature to exclude files from being indexed using regular expressions
\end{itemize}
\url{http://en.wikipedia.org/wiki/DocFetcher}
\section{Find and Run Robot}