9f5e594fb2a37a67064f8200db59bdbfff861db0
[bsc-thesis1415.git] / thesis2 / abstract.tex
1 Within the leisure activity field, information is often bundled badly and
2 contains empty or wrong data. Hyperleap tries to solve this problem by bundling
3 the information from various sources including RSS feeds. Currently the
4 feedback loop for fixing site-specific crawlers requires multiple steps which
5 demand someone with a computer science background to perform. We introduce a
6 new adaptable crawler generation system using subword matching via an adapted
7 form of directed acyclic word graphs. The application allows users with no
8 particular computer science background to create, edit and test crawlers for
9 RSS feeds. In this way the feedback loop for broken crawlers is shortened, new
10 sources can be incorporated in the database quicker and, most importantly, the
11 information about the latest movie show, theater production or conference will
12 reach the people looking for it as fast as possible.