Listing English-Czech Resources

Name Type Description Supported Languages
InterTran Web Site Translation Server MT System
InterTran Web Site Translation Server
Company: Translation Experts Ltd.
Category: MT system (for Web... (more info)
InterTran Web Site Translation Server
Company: Translation Experts Ltd.
Category: MT system (for Websites)
Languages: Albanian, Bosnian, Bulgarian, Chinese, Croatian, Czech, Danish, Dutch, English,
Finnish, French, German, Greek, Hungarian, Icelandic, Italian, Japanese, Korean, Latin,
Norwegian, Polish, Portuguese (inc. Brazilian Portuguese), Romanian, Russian, Serbian, Slovenian,
Spanish, Swedish, Tagalog, Turkish, Ukrainian, Vietnamese, Welsh
Requirements: Windows2000/NT/XP, Unix, Linux; Web browser
Features: regular (daily, weekly, monthly) translation of webpages
Access: http://www.tranexp.com/win/itserver.htm
Price: $995 per language pair (annual licence fee) (less info)
Bulgarian to Bosnian, Bulgarian to Czech, Bulgarian to Danish, Bulgarian to English, Bulgarian to Cr... (more info)
Bulgarian to Bosnian, Bulgarian to Czech, Bulgarian to Danish, Bulgarian to English, Bulgarian to Croatian, Bulgarian to Dutch, Bulgarian to Albanian, Bulgarian to Chinese, Bosnian to Bulgarian, Bosnian to Czech, Bosnian to Danish, Bosnian to English, Bosnian to Croatian, Bosnian to Dutch, Bosnian to Albanian, Bosnian to Chinese, Czech to Bulgarian, Czech to Bosnian, Czech to Danish, Czech to English, Czech to Croatian, Czech to Dutch, Czech to Albanian, Czech to Chinese, Danish to Bulgarian, Danish to Bosnian, Danish to Czech, Danish to English, Danish to Croatian, Danish to Dutch, Danish to Albanian, Danish to Chinese, English to Bulgarian, English to Bosnian, English to Czech, English to Danish, English to Croatian, English to Dutch, English to Albanian, English to Chinese, Croatian to Bulgarian, Croatian to Bosnian, Croatian to Czech, Croatian to Danish, Croatian to English, Croatian to Dutch, Croatian to Albanian, Croatian to Chinese, Dutch to Bulgarian, Dutch to Bosnian, Dutch to Czech, Dutch to Danish, Dutch to English, Dutch to Croatian, Dutch to Albanian, Dutch to Chinese, Albanian to Bulgarian, Albanian to Bosnian, Albanian to Czech, Albanian to Danish, Albanian to English, Albanian to Croatian, Albanian to Dutch, Albanian to Chinese, Chinese to Bulgarian, Chinese to Bosnian, Chinese to Czech, Chinese to Danish, Chinese to English, Chinese to Croatian, Chinese to Dutch, Chinese to Albanian (less info)
PC Translator 2007 MT System
PC Translator 2007
Company: LangSoft
Category: MT system (home use)
Languages: Czech?English, Czech?... (more info)
PC Translator 2007
Company: LangSoft
Category: MT system (home use)
Languages: Czech?English, Czech?French, Czech?German, Czech?Italian, Czech?Russian,
Czech?Spanish, English?German, English?Slovak, German?Slovak
Requirements: Windows 95/98/2000/ME/NT/XP; 50MB HD space; CD-Rom drive; Internet
Explorer 5
Dictionaries: up to 2.7 million (each language pair); 17 technical dictionaries (automobile, biology,
chemistry, transport, physics, technical, computer, electronics, traffic, medicine, geology, law,
business, agriculture, military, zoology); user dictionaries
Input: Word 6.0 and higher, text, email, webpages
Features: spoken output of dictionary entries; also available a pack for English, French, German,
Italian, Russian, Spanish
Price: from K? 3750, (less info)
Czech to German, Czech to English, Czech to French, Czech to Italian, Czech to Russian, German to Cz... (more info)
Czech to German, Czech to English, Czech to French, Czech to Italian, Czech to Russian, German to Czech, English to Czech, French to Czech, Italian to Czech, Russian to Czech (less info)
PetaMem MT System
PetaMem
Company: PetaMem s.r.o
Category: MT system (enterprise), custom-built
Languages: Czech, Dutc... (more info)
PetaMem
Company: PetaMem s.r.o
Category: MT system (enterprise), custom-built
Languages: Czech, Dutch, English, French, German, Hungarian, Portuguese, Swedish; plus reduced
functionality for other languages with dictionaries (below)
Requirements (server): UltraSparc64, pa-risc, p-series, x86 1GHz; Unix-OS (AIX, HP-UX, Solaris,
Linux); 1GB RAM, 36GB HD space; also beowulf-cluster
Requirements (client): browser, Mail-Client
Dictionaries: Croatian (28054 words), Czech (451180 words), Danish (312722 words), Dutch
(178418 words), English (155544 words), Finnish (287368 words), French (137214 words),
German (702707 words), Hungarian (17655 words), Italian (62342 words), Latin (76990 words),
Norwegian (61831 words), Polish (109852 words), Portugese (6654 words), Spanish (85846
words), Swedish (12118 words), Turkish (25854 words)
Input: HTML, ASCII (ISO,WIN,UTF-8), RTF, PS, and others
Note: also available online on 'NLP portal' (http://nlp.petamem.com) electronic dictionaries and
language recognition.
Price: contact company
Source: http://www.petamem.com (less info)
Czech to German, Czech to English, Czech to French, Czech to Hungarian, Czech to Dutch, Czech to Por... (more info)
Czech to German, Czech to English, Czech to French, Czech to Hungarian, Czech to Dutch, Czech to Portuguese, German to Czech, German to English, German to French, German to Hungarian, German to Dutch, German to Portuguese, English to Czech, English to German, English to French, English to Hungarian, English to Dutch, English to Portuguese, French to Czech, French to German, French to English, French to Hungarian, French to Dutch, French to Portuguese, Hungarian to Czech, Hungarian to German, Hungarian to English, Hungarian to French, Hungarian to Dutch, Hungarian to Portuguese, Dutch to Czech, Dutch to German, Dutch to English, Dutch to French, Dutch to Hungarian, Dutch to Portuguese, Portuguese to Czech, Portuguese to German, Portuguese to English, Portuguese to French, Portuguese to Hungarian, Portuguese to Dutch (less info)
Version MT System
Version: 4.3
Language Weaver SMTS
Company: Language Weaver... (more info)
Version: 4.3
Language Weaver SMTS
Company: Language Weaver Inc.
Category: MT system (client-server)
Languages: Arabic?English, Chinese?English, Czech?English, Dutch?English,
English?French, English?German, English?Hindi, English?Italian, English?Korean,
English?Persian, English?Polish, English?Romanian, English?Russian, English?Spanish,
English?Swedish, Somali?English
Requirements (standalone): Pentium 4 2.4GHz; WindowsXP Professional, 2GB RAM; Language
Weaver CGI and web browser (or customer generated via Language Weaver API)
Requirements (server): Windows2003 Server, Windows 2000 Advance server or Linux 2.8GHz
Intel Xeon, 4GB RAM; Linux; Language Weaver CGI and web browser (or customer generated via (less info)
Arabic to English, Czech to English, English to Arabic, English to Czech, English to Dutch, English ... (more info)
Arabic to English, Czech to English, English to Arabic, English to Czech, English to Dutch, English to Chinese, Dutch to English, Chinese to English (less info)
Acquis Communautaire Parallel Corpus
The Acquis Communautaire (AC) is the total body of European Union (EU) law applicable in the the EU ... (more info)
The Acquis Communautaire (AC) is the total body of European Union (EU) law applicable in the the EU Member States. This collection of legislative text changes continuously and currently comprises selected texts written between the 1950s and now. At the beginning of the year 2007, the EU has 27 Member States and 23 official languages (see the Wikipedia entry). The Acquis Communautaire texts exist in these languages, although Irish translations are not currently available. The Acquis Communautaire thus is a collection of parallel texts in the following 22 languages: Bulgarian, Czech, Danish, German, Greek, English, Spanish, Estonian, Finnish, French, Hungarian, Italian, Lithuanian, Latvian, Maltese, Dutch, Polish, Portuguese, Romanian, Slovak, Slovene and Swedish. (less info)
Bulgarian, Czech, Danish, German, Greek, English, Spanish, Estonian, Finnish, French, Hungarian, Ita... (more info)
Bulgarian, Czech, Danish, German, Greek, English, Spanish, Estonian, Finnish, French, Hungarian, Italian, Lithuanian, Latvian, Maltese, Dutch, Polish, Portuguese, Romanian, Slovak, Slovenian, Swedish (less info)
EuroParl release 3.0 Parallel Corpus
The Europarl parallel corpus is extracted from the proceedings of the European Parliament. It includ... (more info)
The Europarl parallel corpus is extracted from the proceedings of the European Parliament. It includes versions in 11 European languages: Romanic (French, Italian, Spanish, Portuguese), Germanic (English, Dutch, German, Danish, Swedish), Greek and Finnish.

For a detailed description of this corpus, please read:
Europarl: A Parallel Corpus for Statistical Machine Translation, Philipp Koehn, MT Summit 2005 (less info)
French, Italian, Spanish, Portuguese, English, Dutch, German, Danish, Swedish, Greek, Finnish
OPUS: EUconst Parallel Corpus
21 languages, 210 bitexts
total number of files: 987
total number of tokens: 3099290
total number... (more info)
21 languages, 210 bitexts
total number of files: 987
total number of tokens: 3099290
total number of sentence fragments: 224919 (less info)
Czech, Danish, German, Greek, English, Spanish, Estonian, Finnish, French, Irish, Hungarian, Italian... (more info)
Czech, Danish, German, Greek, English, Spanish, Estonian, Finnish, French, Irish, Hungarian, Italian, Lithuanian, Latvian, Maltese, Dutch, Polish, Portuguese, Slovak, Slovenian, Swedish (less info)
OPUS: KDE Parallel Corpus
source: http://i18n.kde.org/
61 languages, 1830 bitexts
total number of files: 24586
total number... (more info)
source: http://i18n.kde.org/
61 languages, 1830 bitexts
total number of files: 24586
total number of tokens: 20414063
total number of sentence fragments: 4777530 (less info)
Afrikaans, Arabic, Azerbaijani, Belarusian, Bulgarian, Breton, Bosnian, Catalan, Czech, Welsh, Danis... (more info)
Afrikaans, Arabic, Azerbaijani, Belarusian, Bulgarian, Breton, Bosnian, Catalan, Czech, Welsh, Danish, German, Greek, English, Esperanto, Spanish, Estonian, Basque, Finnish, French, Irish, Galician, Hebrew, Croatian, Hungarian, Indonesian, Icelandic, Italian, Japanese, Korean, Kurdish, Lithuanian, Latvian, Maori, Macedonian, Maltese, Norwegian Bokmål, Dutch, Norwegian Nynorsk, Occitan, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Serbian, Swedish, Tamil, Thai, Turkish, Ukrainian, ven, Vietnamese, Walloon, Xhosa, Zulu (less info)
OPUS: KDEdoc Parallel Corpus
source: http://i18n.kde.org/
24 languages, 226 bitexts
total number of files: 3736
total number ... (more info)
source: http://i18n.kde.org/
24 languages, 226 bitexts
total number of files: 3736
total number of tokens: 3783411
total number of sentence fragments: 302634 (less info)
Danish, German, English, Spanish, Estonian, French, Hungarian, Italian, Japanese, Dutch, Norwegian N... (more info)
Danish, German, English, Spanish, Estonian, French, Hungarian, Italian, Japanese, Dutch, Norwegian Nynorsk, Portuguese, Portuguese, Romanian, Russian, Slovak, Slovenian, Serbian, Swedish, Turkish, Ukrainian, Walloon, Xhosa, Chinese (less info)
OPUS: OpenOffice Parallel Corpus
6 languages, 15 bitexts
total number of files: 10983
total number of tokens: 2612156
total number... (more info)
6 languages, 15 bitexts
total number of files: 10983
total number of tokens: 2612156
total number of sentence fragments: 246760

The original documentation of the office package OpenOffice.org (http://www.openoffice.org/) contains 2014 English documents which have been partly translated into 5 languages: French, Spanish, Swedish, German, and Japanese. The original documentation in English comprises about 500,000 words and translations contain between 400,000 and 500,000 words per language. All documents have been tokenized and, except of the Spanish part, tagged with parts of speech. The English part of the corpus has been marked with syntactic chunks as well. (less info)
German, English, Spanish, French, Japanese, Swedish
OPUS: PHP Parallel Corpus
22 languages, 231 bitexts
total number of files: 71518
total number of tokens: 3303964
total numb... (more info)
22 languages, 231 bitexts
total number of files: 71518
total number of tokens: 3303964
total number of sentence fragments: 1381582

PHP manuals and translations have been downloaded from (http://www.php.net/download-docs.php). The original documents are written in English and have been partly translated into 21 languages. The original manuals contain about 500,000 words. The amount of actually translated texts varies for different languages between 50,000 and 380,000 words. The corpus is rather noisy and may include parts from the English original in some of the translations. The corpus is tokenized and each language pair has been sentence aligned. (less info)
Czech, German, English, Spanish, Finnish, French, Hebrew, Hungarian, Italian, Japanese, Korean, Dutc... (more info)
Czech, German, English, Spanish, Finnish, French, Hebrew, Hungarian, Italian, Japanese, Korean, Dutch, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Swedish, Turkish, Chinese (less info)