OPUS - an open source parallel corpus

Web Name: OPUS - an open source parallel corpus

WebSite: http://opus.lingfil.uu.se

ID:91056

Keywords:

open,an,OPUS,

Description:

2020-06-30: New: OPUS-100 corpus 2020-05-22: New: ELRC public 2019-10-16: New: MultiParaCrawl 2019-10-14: New: Infopankki v1 2019-09-28: Update: ParaCrawl v5 2019-08-28: JW300 corpus added 2019-08-14: Various new and updated corpora 2018-10-06: New corpus: memat (Xhosa/English) 2018-02-15: New corpora: ParaCrawl, XhosaNavy 2017-11-06: New version: OpenSubtitles2018 2017-11-01: New URL: http://opus.nlpl.eu 2016-01-08: New version: OpenSubtitles2016 OPUS is a growing collection of translated texts from the web. In the OPUS project we try to convert and align free online data, to add linguistic annotation, and to provide the community with a publicly available parallel corpus. OPUS is based on open source products and the corpus is also delivered as an open content package. We used several tools to compile the current collection. All pre-processing is done automatically. No manual corrections have been carried out. The OPUS collection is growing! Check this page from time to time to see new data arriving ... Contributions are very welcome! Please contact jorg.tiedemann helsinki.fi Search download resources: Parallel Data, Tools and Interfaces in OPUS. [pdf] In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC'2012) J rg Tiedemann, 2016a OPUS - Parallel Corpora for Everyone. In Baltic Journal of Modern Computing (BJMC), Vol 4, No. 2, Special Issue: Proceedings of the 19th Annual Conference of the European Association of Machine Translation (EAMT), 2016 J rg Tiedemann, 2016b Finding Alternative Translations in a Large Corpus of Movie Subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC-2016), 2016. Pierre Lison and J rg Tiedemann, 2016 OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC-2016), 2016. Raivis Skadi , J rg Tiedemann, Roberts Rozis and Daiga Deksne, 2014 Billions of Parallel Words for Free [bib] [pdf] In Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'2014), Reykjavik, Iceland J rg Tiedemann, 2009, News from OPUS - A Collection of Multilingual Parallel Corpora with Tools and Interfaces [pdf] In N. Nicolov and K. Bontcheva and G. Angelova and R. Mitkov (eds.) Recent Advances in Natural Language Processing (vol V), pages 237-248, John Benjamins, Amsterdam/Philadelphia J rg Tiedemann, 2011, Bitext Alignment, Synthesis Lecture on HLT, Morgan Claypool Publishers (at Amazon) J rg Tiedemann, 2008, Synchronizing Translated Movie Subtitles. [pdf] In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC'2008) J rg Tiedemann, 2007, Building a Multilingual Parallel Subtitle Corpus. [pdf] In Proceedings of CLIN 17, Leuven, Belgium, 2007. J rg Tiedemann, 2007, Improved Sentence Alignment for Movie Subtitles. [pdf] In Proceedings of RANLP '07, Borovets, Bulgaria, 2007. J rg Tiedemann, unpublished OPUS - an open source parallel corpus. [pdf] In Proceedings of the 13th Nordic Conference on Computational Linguistics, University of Iceland, Reykjavik, 2003. J rg Tiedemann, Lars Nygaard, 2004 The OPUS corpus - parallel free. [pdf] In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC'04). Lisbon, Portugal, May 26-28.

TAGS:open an OPUS 

<<< Thank you for your visit >>>

Websites to related :
Peterson Brothers Funeral Home |

  Join our obituary notification email listAll ObituariesWelcome to Our Funeral HomeThank you for visiting our website.The loss of a loved one can leave

Civil Engineering, Surveying, Ge

  Civil Engineering, Surveying, Geotechnical and Geophysical Engineering, Utilities Engineering, Construction Administration, Utility Locating and Mappi

Home - Record Research 2020

  ROCK TRACKS1981-2020ROCK TRACKS1981-2020ROCK TRACKS1981-2020ROCK TRACKS1981-2020ROCK TRACKS1981-2020Save $15Save $15Save $15Save $15Save $15Pre-Public

Fear The Sword, a Cleveland Cava

  Mailbag: What should we make of Dylan Windler? The Cavs remain high on the 2019 first-round pick.

SheThePeople.TV : India’s First

  SheThePeople.TV is India's biggest digital storytelling for women, dedicated to passionately championing and promoting their journeys. We Empower, Eng

Home | Orgasmic Guy

  Stuck?You Have More Sexual Choice Than You Thought When Your Sex Life Is Missing Out... Learn More Here.Free ReportLearn MoreNo one wants to be asking

Study & Learn Italian in Italy -

  Learn Italian Online with Linguaviva Start your journey to Florence and Milan Italian language courses in Milan Italy’s capital of fashion, design

Jewish Celebrations: Guide to Je

  MazorGuide network of websites is proud to add JewishCelebrations.com to its growing list of online guides for Living Jewish. JewishCelebrations.com

Self Hypnosis | Download Hypnosi

  JavaScript seem to be disabled in your browser. You must have JavaScript enabled in your browser to utilize the functionality of this website. I great

Drama Queen Makeup

  About us Our mission is to offer high-quality makeup to women andyoung ladies around the world to help empower them to look and feel beautiful. SIGN U

ads

Hot Websites