Abstract:
Internet is a source of live data that is constantly updating with data of almost any field
we can imagine. Having tools that can automatically detect these updates and can select
that information that we are interested in are becoming of utmost importance nowadays.
That is the reason why I focus on some economic websites, studying their structures
and identifying a common type of website in this field: Dynamic Websites. Even when
there are many tools that allow to extract information from the Internet, not many tackle
these kind of websites. For this reason I study and implement some tools that allow the
developers to address these pages from a different perspective.Web scraping refers to a
software program that mimics human web surfing behavior by pointing to a website and
collecting large amounts of data that would otherwise be difficult for a human to extract.
A typical program will extract both unstructured and semi-structured data, as well as
images, and convert the data into a structured format.