Talk:Translation memory/Archive 1

This page is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page.

Translation memories generated by MT programs

Translation memories generated by MT programs such as Trados, TM2, Transit and the like, as I understand, work with identifying a number of (generally 7)(semi)identical words in a segment to be translated (L1) and a segment that was to be (L2m) and actually translated earlier (L2n). On the basis of that match they will offer the sentence in L2 for checking whether it corresponds to the original and earlier input in L1n and therefore its reuse is justified. But, the syntagmatical and paradigmatical features of English (Indo-Euuropean languages) and other languages such as Hungarian are so much apart that this mechanical match does not work reliably and efficiently, especially backward, from L2 to L1. As long as the intellegence built in MT is focused around AI, including matching and fuzzy matching you cannot expect more than fuzy hits, just as in the results of machine/mechanical indexing used in most of the Internet search operations, engines, etc. Apogr 12:21, 15 Feb 2004 (UTC)

Does anyone new free translation memory ? -- Youssefsan

This tool is all right as long as you work with two indo-european languages. But it is a nonsense, if you are required to use Trados for instance to translate from English into Hungarian and vice versa. A better approach to the problem is when you do more pre-procesing and less post-processing after translation and hire highly skilled translators as opposed to employing novices who are fed the machine translated garbage for "editing". See the practice of the translation center(s) of the European Union

The idea of translation memory comes from the false assumption of making translation work more efficient by re-using former translations in a particular subject where the vocabulary is almost controlled or recurrent. Translators, as a result have become more involved in the issues of editing, searching and formatting texts, aligning their segments, phrases, etc. that drastically decreases their productivity and lowers the demand on their skills.

Since most of the translation done from English into other languages (including Hungarian) consists of inventing new words, explaining terms that are kept in their original forms, etc. because of the very nature of the content of the works to be translated, translation memories just as other software developped in the English speaking world are a nuisance to non-native-English speaking translators who try to comply with the rules of their native tongues in translating non-repetitive texts.

In Hungarian for the past twenty years not only the vocabulary, the signs, company names, etc., but e.g. the ways nouns and adjectives are put together, verbs are inflected, addresses, dates and names are written, etc. have changed for the less-complying and new varieties of pidgin and lingo are to emerge. See also translation, translation problems Apogr 09:15, 9 Jan 2004 (UTC) more to come, if solicited Apogr 09:15, 9 Jan 2004 (UTC)

This information is disputable: "As stated previously, translation memory managers may not be suitable for text that lacks internal repetition or which does not contain unchanged portions between revisions. Technical text is generally best suited for translation memory, while marketing or creative texts will be less suitable." Translation memory managers (Cat tools would be a better term) are suitable for any translation work. They prevent skipping text, usually have a glossary manager built-in and other features that help translators to work faster.

I added the "Second generation translation memories" paragraph in the "History" section. I own the copyright of this text extracted from the Similis web site. If someone needs a proof of this, thanks to give me a procedure. EMLM 11:39, 18 August 2006 (UTC)

On the whole, both of the points of view are correct. What regards the first one. Using CAT tools in translation is aimed at speed and general productivity increase and applies to most languages supported by information systems. Due to its core principle, translation and localization (if that is the case) is done by the linguist (translator) and is independent on the tool used. Though high overall leverage and fine match rates are harder to obtain when working with texts of non-roman language origin, such as cyrilic, Asian etc. Danz0 13:55, 29 September 2006 (UTC)

Free versus commercial

I dislike the distinction "free" versus "commercial". Free tools are not necessarily non-commercial tools. There has got to be some other words one can use. Or, here's a thought... why make the distinction at all? Why not add "(freeware)" or "(free version available)" or "(GPL)" or "(time-unlimited shareware)" or similar next to the ones in the list? leuce 10:13, 18 December 2006 (UTC)

TMX tools added

I've added some TMX tools to the Translation_Memory_eXchange page... please if you know of more such tools, please add them there.--leuce 15:53, 30 April 2007 (UTC)

Not an advertisement page

Someone (from IdiomInc.com) has added the following text to the Desktop translation memory software/Freeware section:

Idiom WorldServer Desktop Workbench, developed to provide the tightest integration between desktop translation and WorldServer, our desktop workbench supports all the best translation memory (TM) features such as ICE matching, scoping, contextual previewing and online TM searching. View the release notes, including system prerequisites, before downloading.

Idiom Desktop can be hardly considered "freeware". Yes, maybe you can download it, but the last time I have worked with it as a translator, it only processed proprietary databases, prepared with the "higher" editions of this software. Unless this has changed (I won't buy that), I do not think it has any place here. Oh, and you surely noticed the "our desktop workbench" part - it is a straight copy of the advertising on their page. So I removed it. 217.11.246.4 (talk) 13:31, 16 December 2007 (UTC)