Forum: CAT Tools Technical Help
Topic: How to convert TMX to tab-delimited?
Poster: Jean Dimitriadis
Post title: Why not use CafeTran Espresso?
Could you simply use CafeTran Espresso for that conversion?
1. Create or open a project with the required language pair.
2. Open or Import the TMX file (or an SDLTB/TBX, which will be automatically converted to TMX), possibly not as read-only and with fragments enabled
3. Select the tab of the glossary you wish to import into (an empty Project Terms page will do, or you create a new glossary and select its tab)
4. Memory menu > Export > Export segments to glossary. A dialog will ask you to select which memory to import segments from. And if the currently selected/opened tab is not a glossary, it will first ask you to select one.
That's it.
CafeTran also includes some TM Filter options, including one called "Clean and replace foreign codes": Some TMX files from third-party tools have unusual codes in the segments such as codes inside the curly brackets or emdash, endash, tab code. CafeTran clears or replaces them with equivalent unicode characters.
[url removed] #tm-filter-options
If needed, prior TMX editing (including search and replace, with or without regular expressions) can also be done from within CafeTran.
[Edited at 2022-10-27 05:50 GMT]
Topic: How to convert TMX to tab-delimited?
Poster: Jean Dimitriadis
Post title: Why not use CafeTran Espresso?
Could you simply use CafeTran Espresso for that conversion?
1. Create or open a project with the required language pair.
2. Open or Import the TMX file (or an SDLTB/TBX, which will be automatically converted to TMX), possibly not as read-only and with fragments enabled
3. Select the tab of the glossary you wish to import into (an empty Project Terms page will do, or you create a new glossary and select its tab)
4. Memory menu > Export > Export segments to glossary. A dialog will ask you to select which memory to import segments from. And if the currently selected/opened tab is not a glossary, it will first ask you to select one.
That's it.
CafeTran also includes some TM Filter options, including one called "Clean and replace foreign codes": Some TMX files from third-party tools have unusual codes in the segments such as codes inside the curly brackets or emdash, endash, tab code. CafeTran clears or replaces them with equivalent unicode characters.
[url removed] #tm-filter-options
If needed, prior TMX editing (including search and replace, with or without regular expressions) can also be done from within CafeTran.
[Edited at 2022-10-27 05:50 GMT]