W4F wrapper architecture
Retrieval Rules
Extraction Rules
Parser
NSL
NSL
NSL
String
String[]
Actor[]
DOM tree
HTML page
title
genre
cast
<MOVIE>
<TITLE>Casablanca</TITLE>
<GENRE>Drama, War, Romance</GENRE><CAST>
<ACTOR>Humphrey Bogart</ACTOR>
<ACTOR>Ingrid Bergman</ACTOR>
...
Mapping to Java objects
Mapping to XML
The Java objects can now be used by any Java application.
ExtractionWizard
Mappingwizard
Mapping Rules
ExtractionEngine
WorldWideWeb
XML document
Previous slide
Next slide
Back to first slide
View graphic version