Web Engineering Web Engineering
Transcription
Web Engineering Web Engineering
News to discuss? Internet May Be Just A Fad, Says European Body http://www.newsbytes.com/news/00/158904.html Web Engineering Prof. Dr. Dr. h.c. mult. Gerhard Krüger, Albrecht Schmidt An interim report for the Economic and Social Research Council (ESRC), says that the development of the Internet over the last decade may not be the epoch -making event that many have said.... ... The report stems from the results of a two-year investigation into the use of the Internet, undertaken by the University of Warwick, which claims that there is already some evidence that the Net may be a passing fad, as usage among British teenagers is already starting to fall from its previously high levels. Universität Karlsruhe Fakultät für Informatik Institut für Telematik The report speculates that as many as 2 million users of the Internet in the UK have stopped using it regularly because of the vast amounts of data online and the need to spend time on the Web to research that information.... Wintersemester 2000/2001 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 (04-Dez-00) page 1 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 3 Table of Contents 1. Introduction and History 2. Media and Information Web Engineering 3. Organizing Information 4. Access to Information and Navigation Chapter 3: The Web – An Information System § Methods for Information Access § Hypertext Navigation § Access Strategies for the Web 5. Describing and coding Information Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 2 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 4 Methods for Information Access ¨ addressing pages directly ¨ search/browse in catalogs ¨ search for keywords ¨ associative ¨ navigation Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 Quality of Search Results ¨ recall = the ratio of documents retrieved versus the number of available documents relevant to the query, i.e., the fraction returned out of all desirable documents ¨ precision = the ratio of the number of relevant documents retrieved versus the total number of documents retrieved, or the useful fraction of what was actually retrieved ¨ see information retrieval page 5 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 Search / Browse in Catalogs Addressing Pages Directly ¨ ¨ ¨ requires unique addresses ¨ § URL: Uniform Resource Locator ¨ ¨ ¨ how to get the address? § external channel, e.g. TV, radio, press, print, email, etc. § guessing the DNS-name, e.g. www.uni-karlsruhe.de, www.oracle.com, www.linux.org conventions ¨ address of the catalog must be known examples § § § § § § § E.g. index.html, default.htm , robots.txt Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 browse predefined classes search if classes/subclasses are unknown less directed, open search strategy if you can not specify what you are looking for ... How to get listed in a catalog? § automatically (submit in a certain category) § pages are manually selected, reviews, and added ¨ ¨ page 7 page 6 yahoo.com web.de lycos.de netcenter.com dmoz.org open directory project and many others ... Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 8 Search for Keywords ¨ Incremental Search § refine search results § search within results specific WWW server § parts of a search engine ¨ ¨ ¨ ¨ ¨ ¨ source: www.altavista.com www.northernlight.com § concept map § categorizing results robot/crawler indexer analyzer module database retrieval component recommendation system § other user rank content § number of links to a page § how often is the page opened when displayed § etc ... Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 9 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 Advanced Search Concepts I Search using a Query Language ¨ ¨ mechanisms to search for keywords ¨ ¨ ¨ ¨ Boolean operators “fuzzy operators” e.g. NEAR search for values of certain attributes examples: www.altavista.com +domain:com specify search by § § § § § § § § § keyword matching § query language +Netscape page 11 search term term exclusion date top level domain domain hostname language country § matching (exact, fuzzy) § search term in: ¨ ¨ ¨ ¨ title body domain link + url:Java +applet:* +host:www.ibm.com +image:sharon * +confidential +title:stone link:www.teco .uni- karlsruhe.de (image:bill NEAR clinton ) OR (image:clinton NEAR bill) OR title:"Bill Clinton" Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 source: www.google.com page 10 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 12 Advanced Search Concepts II ¨ Multimedia Search searching for similar pages ¨ ¨ ¨ § images § video § audio searching for pages that link to a page searching within a certain domain à specialized search engines ¨ ¨ ¨ cached pages – no 404 source: www.google.com Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 content based on text/keywords next to content e.g. § § § § page 13 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 Natural Language Search ¨ www.altavista.com www.ditto.com music.lycos.com multimedia.lycos.com page 15 Associative query a search engine in „plain“ English ¨ semantic links in hypertext § links in text (related content) § icons and images § maps ¨ different types of links ¨ different semantics source: www.askjeeves.com Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 14 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 16 Search Functionality ¨ ¨ ¨ Hypertext – Navigational Patterns providing a search function to visitors of a specific site (company web site, university site) when a users is on the site you want to help him to find what he/she is looking for two approaches § search engine or directory for the specific site ¨ ¨ customized interface detailed inclusion and exclusion rules ¨ ¨ ¨ navigational structure within the content e.g. www.useit.com e.g.: www.sun.com e.g.: www.amazon.de z.B.: www.iflyswa .com, www.hagebau.de http://www8.org/w8-papers/5b- hypertext-media/improving/improving.html page 17 Hypertext - Navigation links beyond content and semantic ¨ § landmark „Provide direct access to critical sub -systems in the Web Information System“ § set-based navigation „Organize the information in Sets of related information items. Provide intra-set navigation capabilities.“ § often combined landmark & set-based navigation § metaphorical navigation also useful instead of a missing page ¨ § text based ¨ embedded in the site Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 Navigational Design Patterns [Rossi et al. 98] ¨ § using a „public“ search engine with attributes to restrict the search to this site ¨ ¨ Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 19 Text Based Navigation § up, down, next, ... § often based on icon § author builds/defines this structure ¨ tools for navigation § history, back/forward, home § bookmarks § in the user interface of the web client ¨ navigational patterns Source: http://www.useit.com Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 18 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 20 Landmark & Set-based Navigation I Source: http://www.amazon.de Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 21 Landmark & Set-based Navigation II Source: http://www.wapforum.org Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 22 Metaphorical Navigation I Source, 1998: http://www.iflyswa.com Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 23 Metaphorical Navigation II Source: http://www.hagebau.de Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 24 Table of Contents Markup ¨ roots: computerized typesetting 1. Introduction and History § typesetting macros embedded in ASCII 2. Media and Information § presentational markup ¨ ¨ 3. Organizing Information ¨ layout directives, e.g. TEX commands to define the layout semantic markup 4. Access to Information and Navigation § authors put annotations into their texts to help the publisher to understand what type of text this is (e.g. „this is a heading“) § annotations are agreed between author and publisher 5. Describing and coding Information § publisher decides on the layout § descriptive markup § § § § Markup SGML XML HTML ¨ describing content not the layout § markup to support search in documents ¨ ¨ Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 Coding Documents \twocolumn[ \begin{center}\Large { A Modular Neural Network Architecture with Additional Generalization Abilities for High Dimensional Input Vectors} tex \end{center} page 25 GIF89a` P ÷ •àc ˆ„ +0 ÿ1))scÿ”{„Üo%4$1Üoa*~1 ÖÎÿÞçççï÷ï÷÷ÿ~8¾•þ`ˆ •íŸ£ÚÏ#³ó•àc ˆ„ ¯ )zˆR QÄV„Ä&˜!,ÖY ¹BCÛ ßr³j fÜQ¢ùÌ4Ífƒ?sÔCÞ¹Ã/jò†þ GIF „Ä&˜!,ÖY ¹BCÛ ß„Ä&˜!, ÖY ¹BCÛ ßÜQ¢ùÌ4ÜQ¢ùÌ4 %%EndSetup PS %%Page: 1 1 1 0 bop 1007 872 a Fl(Location\255Based) 32 b(Personal)i(Agents) 820 y Fk(\003)781 1112 y Fj(Seng)5 b(W)-8 b(ai)25 b(Lok)o(e)1392 1076 y Fi(y)1427 1112 y Fj(,)g(Andry)f(Rak)o(otonirain)o(y)2325 1076 y Fi(yy)2391 1112 y Fj(,)g(Karsten)h(Schulz)3046 {\listname ;}\listid3} RTF {\list\listtemplateid0\ {\leveltext\'01-;} {\levelnumbers;}\f0 \chbrdr\brdrnone\brdrcf1 \chshdng0\chcfpat1\chcbpa \fbias0 \fi-360\li360 <td><font face="Arial, Helvetica"> <P> In 1995/1996 I did an MSc inHTML computing at the <A HREF="http://www.mmu.ac.uk">Manchester Metropolitan University</A> in the <A HREF="http://www.doc.mmu.ac.uk"> Department of Computing</A>. Studying in <A REF="http://www.manchester.com/"> Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 words in headings are more important than in footnotes markup for machines vs. markup for humans page 26 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 27 Markup - ISO-Definitions ¨ markup § text that is added to the data of a document in order to convey information about it ¨ descriptive markup § markup that describes the structure and other attributes of a document in a non-system-specific way, independently of any processing that may be performed on it ¨ processing instruction (PI) § markup consisting of system -specific data that controls how a document is to be processed Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 28 Descriptive Markup versus Processing Instruction ¨ Editorial Structure II ¨ descriptive markup § editorial structure § content structure ¨ § editorial structure § content structure ¨ processing instructions § layout ¨ descriptive markup processing instructions § layout ¨ Example Announcement In the course Web Engineering Albrecht Schmidt will talk about ... Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 Example <main-title>Announcement</main-title> <paragraph>In the course Web Engineering Albrecht Schmidt will talk about ... </paragraph> page 29 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 Editorial Structure ¨ Content Structure I ¨ descriptive markup ¨ processing instructions ¨ Example <main-title>Announcement</main-title> In the course Web Engineering Albrecht Schmidt will talk about ... Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 processing instructions § layout § layout ¨ descriptive markup § editorial structure § content structure § editorial structure § content structure ¨ page 31 page 30 Example <main-title>Announcement</main-title> <paragraph>In the course <course>Web Engineering </course> Albrecht Schmidt will talk about ... </paragraph> Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 32 Processing Instruction Layout II Content Structure II ¨ ¨ descriptive markup § editorial structure § content structure § editorial structure § content structure ¨ ¨ processing instructions ¨ Example <main-title>Announcement</main-title> <paragraph>In the course <course>Web Engineering </course> <name>Albrecht Schmidt</name> will talk about ... </paragraph> Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 processinginstructions § layout § layout ¨ descriptive markup Example <head columns=1> <main-title><bold>Announcement</bold></main-title> </head> <content columns=2> <paragraph>In the course <course>Web Engineering </course> <name><italic>Albrecht Schmidt </italic> </name> will talk about ... </paragraph> </content> page 33 Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 Processing Instruction Layout I ¨ Generic vs. Specific Markup ¨ descriptive markup processinginstructions ¨ § layout ¨ Example <head columns=1> <main-title>Announcement</main-title> </head> <content columns=2> <paragraph>In the course <course>Web Engineering </course> <name>Albrecht Schmidt</name> will talk about ... </paragraph> </content> Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 layout structure § specific <italic>Web Engineering</ italic > § generic <emphasis>Web Engineering</emphasis > § editorial structure § content structure ¨ page 35 page 34 content structure § specific § generic ¨ <course-title>Web Engineering</course-title > <keyword>Web Engineering</keyword> publication structure § specific <verse> ... </verse> § generic <paragraph> ... </paragraph> Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 36 SGML ¨ Standard Generalized Markup Language § ISO-Standard [ISO/IS 8879, 1986] § developed from GML, IBM 1969 (Goldfarb, Mosher, Lorie) § distinction between content and presentation ¨ dilemma of specific markup language: § What is an appropriate set of tags? ¨ generalized markup § documents are desribed in three parts ¨ ¨ ¨ SGML declaration: mapping of the abstract SGML Syntax onto concrete characters definition of STAGO e.g. '<' or TAGC e.g. '>' or charset Document Type Definition, DTD: definition of tags and their meaning the document (content) in the markup define in the DTD § semantic of markup is context dependent Prof. Dr. Dr. h.c. mult . Gerhard Krüger, Albrecht Schmidt: Web Engineering, WS00/01 page 37