Matches in DBpedia 2015-10 for { <http://dbpedia.org/resource/Europarl_corpus> ?p ?o }
Showing triples 1 to 41 of
41
with 100 triples per page.
- Europarl_corpus abstract "The Europarl Corpus is a corpus (set of documents) that consists of the proceedings of the European Parliament from 1996 to the present. In its first release in 2001, it covered eleven official languages of the European Union (Danish, Dutch, English, Finnish, French, German, Greek, Italian, Portuguese, Spanish, and Swedish). With the political expansion of the EU the official languages of the ten new member states have been added to the corpus data. The latest release (2012) comprised up to 50 million words per language with the newly added languages being slightly underrepresented as data for them is only available from 2007 onwards.The data that makes up the corpus was extracted from the website of the European Parliament and then prepared for linguistic research. After sentence splitting and tokenization the sentences were aligned across languages with the help of an algorithm developed by Gale & Church (1993).The corpus has been compiled and expanded by a group of researchers led by Philipp Koehn at Edinburgh University. Initially it was designed for research purposes in statistical machine translation (SMT). However, since its first release it has been used for multiple other research purposes, including for example word sense disambiguation.".
- Europarl_corpus wikiPageExternalLink Europarl.php.
- Europarl_corpus wikiPageExternalLink europarl.
- Europarl_corpus wikiPageID "36200511".
- Europarl_corpus wikiPageLength "5560".
- Europarl_corpus wikiPageOutDegree "23".
- Europarl_corpus wikiPageRevisionID "672246682".
- Europarl_corpus wikiPageWikiLink BLEU.
- Europarl_corpus wikiPageWikiLink Back_translation.
- Europarl_corpus wikiPageWikiLink Category:Corpora.
- Europarl_corpus wikiPageWikiLink Category:European_Parliament.
- Europarl_corpus wikiPageWikiLink Enlargement_of_the_European_Union.
- Europarl_corpus wikiPageWikiLink European_Parliament.
- Europarl_corpus wikiPageWikiLink European_Union.
- Europarl_corpus wikiPageWikiLink Gale-Church_alignment_algorithm.
- Europarl_corpus wikiPageWikiLink Gale–Church_alignment_algorithm.
- Europarl_corpus wikiPageWikiLink Linguistic.
- Europarl_corpus wikiPageWikiLink Linguistics.
- Europarl_corpus wikiPageWikiLink Philipp_Koehn.
- Europarl_corpus wikiPageWikiLink Statistical_machine_translation.
- Europarl_corpus wikiPageWikiLink Target_language.
- Europarl_corpus wikiPageWikiLink Text_corpus.
- Europarl_corpus wikiPageWikiLink Tokenization_(lexical_analysis).
- Europarl_corpus wikiPageWikiLink Translation.
- Europarl_corpus wikiPageWikiLink Word-sense_disambiguation.
- Europarl_corpus wikiPageWikiLinkText "EUROPARL".
- Europarl_corpus wikiPageWikiLinkText "Europarl Corpus".
- Europarl_corpus wikiPageWikiLinkText "Europarl".
- Europarl_corpus hasPhotoCollection Europarl_corpus.
- Europarl_corpus wikiPageUsesTemplate Template:Reflist.
- Europarl_corpus subject Category:Corpora.
- Europarl_corpus subject Category:European_Parliament.
- Europarl_corpus hypernym Corpus.
- Europarl_corpus type Work.
- Europarl_corpus comment "The Europarl Corpus is a corpus (set of documents) that consists of the proceedings of the European Parliament from 1996 to the present. In its first release in 2001, it covered eleven official languages of the European Union (Danish, Dutch, English, Finnish, French, German, Greek, Italian, Portuguese, Spanish, and Swedish). With the political expansion of the EU the official languages of the ten new member states have been added to the corpus data.".
- Europarl_corpus label "Europarl corpus".
- Europarl_corpus sameAs m.0k0wc81.
- Europarl_corpus sameAs Q5412081.
- Europarl_corpus sameAs Q5412081.
- Europarl_corpus wasDerivedFrom Europarl_corpus?oldid=672246682.
- Europarl_corpus isPrimaryTopicOf Europarl_corpus.