OPUS - an open source parallel corpus
Time 2020-10-22 22:06:29Web Name: OPUS - an open source parallel corpus
WebSite: http://opus.lingfil.uu.se
ID:91056
Keywords:
open,an,OPUS,Description:
2020-06-30: New: OPUS-100 corpus 2020-05-22: New: ELRC public 2019-10-16: New: MultiParaCrawl 2019-10-14: New: Infopankki v1 2019-09-28: Update: ParaCrawl v5 2019-08-28: JW300 corpus added 2019-08-14: Various new and updated corpora 2018-10-06: New corpus: memat (Xhosa/English) 2018-02-15: New corpora: ParaCrawl, XhosaNavy 2017-11-06: New version: OpenSubtitles2018 2017-11-01: New URL: http://opus.nlpl.eu 2016-01-08: New version: OpenSubtitles2016 OPUS is a growing collection of translated texts from the web. In the OPUS project we try to convert and align free online data, to add linguistic annotation, and to provide the community with a publicly available parallel corpus. OPUS is based on open source products and the corpus is also delivered as an open content package. We used several tools to compile the current collection. All pre-processing is done automatically. No manual corrections have been carried out. The OPUS collection is growing! Check this page from time to time to see new data arriving ... Contributions are very welcome! Please contact jorg.tiedemann helsinki.fi Search download resources: Parallel Data, Tools and Interfaces in OPUS. [pdf] In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC'2012) J rg Tiedemann, 2016a OPUS - Parallel Corpora for Everyone. In Baltic Journal of Modern Computing (BJMC), Vol 4, No. 2, Special Issue: Proceedings of the 19th Annual Conference of the European Association of Machine Translation (EAMT), 2016 J rg Tiedemann, 2016b Finding Alternative Translations in a Large Corpus of Movie Subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC-2016), 2016. Pierre Lison and J rg Tiedemann, 2016 OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC-2016), 2016. Raivis Skadi , J rg Tiedemann, Roberts Rozis and Daiga Deksne, 2014 Billions of Parallel Words for Free [bib] [pdf] In Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'2014), Reykjavik, Iceland J rg Tiedemann, 2009, News from OPUS - A Collection of Multilingual Parallel Corpora with Tools and Interfaces [pdf] In N. Nicolov and K. Bontcheva and G. Angelova and R. Mitkov (eds.) Recent Advances in Natural Language Processing (vol V), pages 237-248, John Benjamins, Amsterdam/Philadelphia J rg Tiedemann, 2011, Bitext Alignment, Synthesis Lecture on HLT, Morgan Claypool Publishers (at Amazon) J rg Tiedemann, 2008, Synchronizing Translated Movie Subtitles. [pdf] In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC'2008) J rg Tiedemann, 2007, Building a Multilingual Parallel Subtitle Corpus. [pdf] In Proceedings of CLIN 17, Leuven, Belgium, 2007. J rg Tiedemann, 2007, Improved Sentence Alignment for Movie Subtitles. [pdf] In Proceedings of RANLP '07, Borovets, Bulgaria, 2007. J rg Tiedemann, unpublished OPUS - an open source parallel corpus. [pdf] In Proceedings of the 13th Nordic Conference on Computational Linguistics, University of Iceland, Reykjavik, 2003. J rg Tiedemann, Lars Nygaard, 2004 The OPUS corpus - parallel free. [pdf] In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC'04). Lisbon, Portugal, May 26-28.<<< Thank you for your visit >>>
Websites to related : Peterson Brothers Funeral Home |
Join our obituary notification email listAll ObituariesWelcome to Our Funeral HomeThank you for visiting our website.The loss of a loved one can leave
Civil Engineering, Surveying, GeCivil Engineering, Surveying, Geotechnical and Geophysical Engineering, Utilities Engineering, Construction Administration, Utility Locating and Mappi
Home - Record Research 2020ROCK TRACKS1981-2020ROCK TRACKS1981-2020ROCK TRACKS1981-2020ROCK TRACKS1981-2020ROCK TRACKS1981-2020Save $15Save $15Save $15Save $15Save $15Pre-Public
Fear The Sword, a Cleveland CavaMailbag: What should we make of Dylan Windler? The Cavs remain high on the 2019 first-round pick.
SheThePeople.TV : India’s FirstSheThePeople.TV is India's biggest digital storytelling for women, dedicated to passionately championing and promoting their journeys. We Empower, Eng
Home | Orgasmic GuyStuck?You Have More Sexual Choice Than You Thought When Your Sex Life Is Missing Out... Learn More Here.Free ReportLearn MoreNo one wants to be asking
Study & Learn Italian in Italy -Learn Italian Online with Linguaviva Start your journey to Florence and Milan Italian language courses in Milan Italy’s capital of fashion, design
Jewish Celebrations: Guide to JeMazorGuide network of websites is proud to add JewishCelebrations.com to its growing list of online guides for Living Jewish. JewishCelebrations.com
Self Hypnosis | Download HypnosiJavaScript seem to be disabled in your browser. You must have JavaScript enabled in your browser to utilize the functionality of this website. I great
Drama Queen MakeupAbout us Our mission is to offer high-quality makeup to women andyoung ladies around the world to help empower them to look and feel beautiful. SIGN U
adsHot Websites