Found in Translation

Abstract

We present a complete working system that gathers multilingual news items from the Web, translates them into English, categorises them by topic and geographic location and presents them to the final user in a uniform way. Currently, the system crawls 560 news outlets, in 22 different languages, from the 27 European Union countries. Data gathering is based on RSS crawlers, machine translation on Moses and the text categorisation on SVMs. The system also presents on a European map statistical information about the amount of attention devoted to the various topics in each of the 27 EU countries. The integration of Support Vector Machines, Statistical Machine Translation, Web Technologies and Computer Graphics delivers a complete system where modern Statistical Machine Learning is used at multiple levels and is a crucial enabling part of the resulting functionality.

Cite

Text

Turchi et al. "Found in Translation." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2009. doi:10.1007/978-3-642-04174-7_55

Markdown

[Turchi et al. "Found in Translation." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2009.](https://mlanthology.org/ecmlpkdd/2009/turchi2009ecmlpkdd-found/) doi:10.1007/978-3-642-04174-7_55

BibTeX

@inproceedings{turchi2009ecmlpkdd-found,
  title     = {{Found in Translation}},
  author    = {Turchi, Marco and Flaounas, Ilias N. and Ali, Omar and De Bie, Tijl and Snowsill, Tristan and Cristianini, Nello},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2009},
  pages     = {746-749},
  doi       = {10.1007/978-3-642-04174-7_55},
  url       = {https://mlanthology.org/ecmlpkdd/2009/turchi2009ecmlpkdd-found/}
}