Creating Semantically Enhanced Documents with the Help of Text Mining

Héder Mihály <>
MTA SZTAKI ITAK

In this decade, the amount of textual content stored on the Web became enormous. But the basic structure of a Web document remained unchanged: a mixture of text and markup. When creating a document, the user rarely has the opportunity of embedding semantic information in the content because editor applications do not have such a feature or it is too difficult to use. The author thinks that the creation of semantically rich documents is the best facilitated by a neat content editor with text mining technology running in the background. In the presentation some of these technologies are brought into spotlight.