𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Wrapper induction: Efficiency and expressiveness

✍ Scribed by Nicholas Kushmerick


Publisher
Elsevier Science
Year
2000
Tongue
English
Weight
519 KB
Volume
118
Category
Article
ISSN
0004-3702

No coin nor oath required. For personal study only.

✦ Synopsis


The Internet presents numerous sources of useful information-telephone directories, product catalogs, stock quotes, event listings, etc. Recently, many systems have been built that automatically gather and manipulate such information on a user's behalf. However, these resources are usually formatted for use by people (e.g., the relevant content is embedded in HTML pages), so extracting their content is difficult. Most systems use customized wrapper procedures to perform this extraction task. Unfortunately, writing wrappers is tedious and error-prone. As an alternative, we advocate wrapper induction, a technique for automatically constructing wrappers. In this article, we describe six wrapper classes, and use a combination of empirical and analytical techniques to evaluate the computational tradeoffs among them. We first consider expressiveness: how well the classes can handle actual Internet resources, and the extent to which wrappers in one class can mimic those in another. We then turn to efficiency: we measure the number of examples and time required to learn wrappers in each class, and we compare these results to PAC models of our task and asymptotic complexity analyses of our algorithms. Summarizing our results, we find that most of our wrapper classes are reasonably useful (70% of surveyed sites can be handled in total), yet can rapidly learned (learning usually requires just a handful of examples and a fraction of a CPU second per example).


πŸ“œ SIMILAR VOLUMES


FAS/FAS ligand expression and induction
✍ Sanshiro Hashimoto; Morteza Setareh; Robert L. Ochs; Martin Lotz πŸ“‚ Article πŸ“… 1997 πŸ› John Wiley and Sons 🌐 English βš– 795 KB

Objective. To examine the expression of FasIFas ligand and the role of this ligand/receptor interaction in the regulation of apoptosis in normal human articular chondrocytes and in osteoarthritis (OA) cartilage. Methods. Normal and OA human knee cartilage and cells isolated from these tissues were

Induction of macrophage procoagulant exp
✍ Helen R. Wheeler; Carolyn L. Geczy πŸ“‚ Article πŸ“… 1990 πŸ› John Wiley and Sons 🌐 French βš– 724 KB

Cisplatin, doxorubicin and daunorubicin (drugs which intercalate with DNA) influenced the membrane-bound procoagulant potential of murine thioglycollate-induced peritoneal exudate (TG-PEC) macrophages and the monocytoid cell line WEHl 265, whereas the antimetabolites 5-fluorouracil and methotrexate

Simple and efficient inductive telemetry
✍ Jordi SacristΓ‘n-Riquelme; Fredy Segura; M. Teresa OsΓ©s πŸ“‚ Article πŸ“… 2008 πŸ› Elsevier Science 🌐 English βš– 451 KB

This paper presents an inductive telemetry system useful for isolated applications that have to be powered and/or controlled from outside. The transmitter is based on a Class-E amplifier with ASK modulation generated with a resonant modulation to obtain the different levels to transmit. In the recei

Power factor and efficiency characterist
✍ Yoshiyuki Shibata; Nuio Tsuchida; Koji Imai πŸ“‚ Article πŸ“… 1999 πŸ› John Wiley and Sons 🌐 English βš– 203 KB

We have developed a unique induction motor that has free rotating magnets inside its rotor. In experimental results for a prototype motor, we have obtained superior performance over existing motors in terms of the power factor, efficiency, and output torque. The experimental details are as follows.