Getting Started with Tovek Tools
Transcription
Getting Started with Tovek Tools
FIND IT UNDERSTAND IT USE IT Getting Started with Tovek Tools Tovek Tools Introduction HOW TO INSTALL TOVEK TOOLS? .............................................................................................. 2 HOW TO CONNECT INFORMATION SOURCES WHERE I WANT TO SEARCH? .................... 2 HOW TO SEARCH IN INFORMATION SOURCES? ...................................................................... 3 MAY I VIEW DOCUMENTS IN DIFFERENT CONTEXTS? ............................................................. 6 MAY I IDENTIFY THE KEY TOPICS IN SEARCHED DOCUMENTS? .......................................... 7 MAY I CREATE A MORE COMPLEX QUERY THAT DESCRIBES A SPECIFIC TOPIC?.............. 8 MAY I CONNECT TOVEK TOOLS AND ANALYST’S NOTEBOOK?............................................ 9 WHAT OPTIONS ARE OFFERED BY THE TOVEK ENGINE QUERY LANGUAGE? ....................... 9 www.tovek.com PRODUCTS Getting started with Tovek Tools This brochure is a simple guide for quick orientation in Tovek Tools and should be used by users that have no experience with this product. More detailed information can be found in product documentation or help. This brochure focuses on product installation, connection, and indexing of information sources and information retrieval. Advanced analytical functions are mentioned only partially. Sufficient knowledge of the product requires the completion of product training or advanced study of documentation. How to install Tovek Tools? 1. Run the installation file (ttxxx.exe). 2. Select the language of application. Select Language. 3. Select the folder in which you want to install the application. It is recommended to keep the default settings. 4. In the next window, select Specify a New One that you record from a file. Add a license. 5. Select the Typical installation. To choose individual applications for installation or install CTK demo data, select Custom setup. 6. Run the installation. How to connect information sources where I want to search? Index Manager application enables the preparation of information sources needed for searching. This application provides indexing of various types of documents, database records, or e-mails. Index Manager also enables to connect already existing full-text sources. 1. At the end of the installation, select Run Index Manager and index documents, or Start > Programs > Tovek Tools > Index Manager. 2. Select the documents that should be indexed. After selecting the documents, you can specify the name of index, its description, location, and language of indexed documents. © 2011 TOVEK Page 2 PRODUCTS Getting started with Tovek Tools 3. If you choose demo data installation, you will be automatically connected to information sources Demo.CS and Demo.EN which include a choice of Czech and English articles CTK (Czech News Agency) from June 2009. Selecting types of documents for indexing. On the screen shot the user indexes files on his computer in directories Firma and Moje. On left we can see an overview of all connected full-text sources. How to search in information sources? Tovek Agent application enables to search in information sources. 1. Run Tovek Agent. 2. Our recommendation: For easier work we recommend the Topic Agents View (Tools -> Options -> Layout Window) Topic Agents View. © 2011 TOVEK Page 3 PRODUCTS Getting started with Tovek Tools 3. Sources selection. In left panel, select information sources in which you want to search. Three buttons in bottom can be used for selecting of all, none, or update of sources. On the screen shot the user chose to search sources ABCnews and BBCnews. 4. Adding Query. First of all, add a simple query, e.g. one or two words separated by commas. Entering complex queries should be done after the user becomes familiar with all possibilities of the query language. 5. Search results. After running the query, in the bottom window we can see the list of results. Simple click on the respective document enables to display the document with highlighted key words. The right corner in the bottom shows the number of found documents and number of all documents that were searched. On the screen shot the user entered query: Afghanistan, Bin Laden Query Overview of searching conditions The result list shows the documents sorted by their relevance to the entered query. After clicking on the document, we can see at the bottom its content with highlighted key words. Note: 400 documents were displayed and 408 was found in documents. Totally 291 351 documents were searched. © 2011 TOVEK Inserting, editing, and deleting conditions Selection of sources Result list according to its relevance with query Viewing documents with highlighted keywords Page 4 PRODUCTS Getting started with Tovek Tools 6. Query in time. For better orientation in the results, you can use the Query in Time function (Tools -> Query in time or Ctrl + Q). This function displays the number of documents on the timeline. Query in time request shows the number of found documents in time relation. 7. Export of results. To view found documents, Tovek Agent offers various options for output (txt, html, xml). In order to export chosen documents, select them by clicking left mouse button (simultaneously with the Shift or Ctrl buttons). From the Tools menu, select Html/Xml Export… Documents selection for the export to the chosen format. Documents selected for export Export of selected documents in HTML format. © 2011 TOVEK Page 5 PRODUCTS May I view documents in different contexts? Getting started with Tovek Tools InfoRating application enables to view documents in different contexts. The application shows the link between content of searched documents and defined topics. 1. Importing documents. The Tovek Agent application exports selected documents to the application InfoRating (in a similar way as export to HTML). 2. Topics for contextual analysis. In the left section of the application by mouse right-click select the option for entering more queries (queries may be even more complicated). On the screen shot the user imported chosen documents to the InfoRating application. After that, the user chose Entering more queries and defined the following topics: Topics, which I‘m interested in. coal gas gold uranium russia ukraine usa 3. Contextual analysis view. After entering the query, the results are displayed in context analysis or according to the relationship between the content of documents searched and defined topics/questions. The results can be viewed in form of matrix, chart, or graph. The icons on right side allow to modify the graphical output. View of results in contextual analysis. The results are viewed in form of matrix. User found that in 21 documents USA and Russia are mentioned at the same time. Icons for editing graphical output Defined topics Intersection of queries illustrates the relations between the content of documents and defined topics List of documents and their relation to defined topics 4. Export of results. Similarly as Tovek Agent, the InfoRating enables to export the results of analysis into various formats. © 2011 TOVEK Page 6 PRODUCTS Getting started with Tovek Tools The user has the possibility to display the results of contextual analysis in the diagram. The user has the possibility to display the results of contextual analysis in the timeline. May I identify the key topics in searched documents? © 2011 TOVEK Harvester application enables to automatically identify the key topics in the documents. 1. Import of documents. From the Tovek Agent application export the selected documents to the Harvester application (in a similar way as export to HTML). 2. Content analysis. Harvester automatically identifies important topics and links between them. In the list of keywords we can find trends, scores, number of pairs and the number of documents in which these keywords occur. 3. Export of results. Similarly as in Tovek Agent application, the Harvester enables to export the results in various formats. Page 7 PRODUCTS The user has made a content analysis which automatically identified the keywords from the respective documents. Getting started with Tovek Tools Topics and links between them are automatically identified Trends found in the vicinity of defined topics List of documents relevant to selected topics May I create a more complex query that describes a specific topic? Words found in the vicinity of defined topics Query Editor application enables to create more complex queries. This application is intended mainly for advanced users with knowledge of the Tovek Engine Query Language. More complex queries (topics) enable to specify all words, phrases, and other search features in a structure that can be used to describe the topic. Queries created in Query Editor can be used in other applications of Tovek Tools. Complex query (topic) created by an expert with knowledge of the Tovek Engine Query Language. © 2011 TOVEK Page 8 PRODUCTS May I connect Tovek Tools and Analyst’s Notebook? What options are offered by the Tovek Engine Query Language? Getting started with Tovek Tools Only for Analyst’s Notebook product users: The connection of Tovek Tools and Analyst’s Notebook is provided by Fulltext Plug-in for Analyst’s Notebook. Through this link, the users can view and analyze textual information in the Analyst’s Notebook. This is an example used for illustration with the usage of the basic operators of the Tovek Engine Query Language. Example Find all documents that contain word RUSSIA and in its surrounding is one of the following words: URANIUM, GAS, COAL (LIGNITE). The documents should not contain the word GOLD. Query .NEAR/10 (russia, (uranium, gas, (coal .OR lignite))) .AND .NOT gold Operators used by the Query Language russia finds all words forms russia – russian, russians etc. (usage of quotes enables to search only the main form "russia") coal .OR lignite presents two word forms uranium, gas the comma between the words means .BEST, which corresponds to the logical .OR, moreover increases the weight of documents in which the words appear more than once. .NEAR/10 (russia, uranium) distance between these words should not exceed 10 words (n words). .AND .NOT gold Excludes from the list all documents containing the word gold. © 2011 TOVEK Page 9