Forum: CAT Tools Technical Help
Topic: Is TermBase eXchange (TBX) really an industry standard for terminology exchange?
Poster: anchelee
Post title: Thanks
>Jean Dimitriadis
Wow, so many options!
Many that did not come out of my research.
I will check them out.
>Dan Lucas
Yes, from what I understand, TBX become an ISO standard in 2008 and because Localization Industry Standards Association (LISA) shut down in 2011, TBX standard is completely open now available to public ( [url removed] ) but it seemed to me that no one is interested in building a simple and easy to use converter between TBX and CSV or Excel.
[quote]FarkasAndras wrote:
In my experience TBX is very rarely used by translators and translation agencies.
This area is a real mess so there is no simple answer. Pick your format depending on what the data is like and who you want to share it with. If the glossary is simple (no synonyms, just one column per language) and you don't know what software the addressee has then xls is probably the best. Everyone can open an xls file and every glossary tool should be able to import it. [/quote]
Aha, so that is what I thought.
The only publicly available and useful TBX I found was that of Microsoft Terminology Collection ( [url removed] ).
But when modifying and maintaining a list of terms, Excel seems to be a good old way to manage and share.
[quote]CafeTran Training wrote:
I’ve never received a TBX file but many times an Excel file. Tools like SDL’s glossary converter can produce them for exchanging a MuliTerm base.
Rearranging and deleting columns in Excel is very easy. So is swapping of source and target language.
Tab-delimited is basically the same, with less extra data for the file format and also readable in text editors. You can paste a tab-delimited glossary in Excel for manipulation. That’s why I prefer them.
CSV are alike but can require field-surrounding delimiters like quotes, which adds extra characters and makes them more fragile to edit. You can import import them in Excel for manipulation.
Personally, I don’t like TMX for storing terms, since they have a lot of extra data and rearranging and manipulation is more complicated—though not impossible, with a dedicated TMX editor (or e.g. CafeTran). It has fans, but like I said: too complicated for my purposes. Editing a TMX in a text editor can easily lead to corrupted files that cannot be imported (opened) in other tools again. You have to know what you are doing (which is the case with the other solutions too ;) ). Note that TMX comes in many flavours. It’s also (therefore?) the file format with the most problems regarding exchange. Wordfast, for instance, is notorious for not wanting to open perfectly valid TMX files from other apps (you have to use work arounds like WF Anywhere). [/quote]
Yes, I've found the combination of SDL MultiTerm and Glossary Converter is a good way to convert from Excel to TBX or vice versa.
But then I wasn't so sure if it was worth the effort to try to keep all my glossary in TBX format if no was using it in the first place.
I would definitely prefer Excel over CSV or tab-delimited for the reasons you mentioned.
Topic: Is TermBase eXchange (TBX) really an industry standard for terminology exchange?
Poster: anchelee
Post title: Thanks
>Jean Dimitriadis
Wow, so many options!
Many that did not come out of my research.
I will check them out.
>Dan Lucas
Yes, from what I understand, TBX become an ISO standard in 2008 and because Localization Industry Standards Association (LISA) shut down in 2011, TBX standard is completely open now available to public ( [url removed] ) but it seemed to me that no one is interested in building a simple and easy to use converter between TBX and CSV or Excel.
[quote]FarkasAndras wrote:
In my experience TBX is very rarely used by translators and translation agencies.
This area is a real mess so there is no simple answer. Pick your format depending on what the data is like and who you want to share it with. If the glossary is simple (no synonyms, just one column per language) and you don't know what software the addressee has then xls is probably the best. Everyone can open an xls file and every glossary tool should be able to import it. [/quote]
Aha, so that is what I thought.
The only publicly available and useful TBX I found was that of Microsoft Terminology Collection ( [url removed] ).
But when modifying and maintaining a list of terms, Excel seems to be a good old way to manage and share.
[quote]CafeTran Training wrote:
I’ve never received a TBX file but many times an Excel file. Tools like SDL’s glossary converter can produce them for exchanging a MuliTerm base.
Rearranging and deleting columns in Excel is very easy. So is swapping of source and target language.
Tab-delimited is basically the same, with less extra data for the file format and also readable in text editors. You can paste a tab-delimited glossary in Excel for manipulation. That’s why I prefer them.
CSV are alike but can require field-surrounding delimiters like quotes, which adds extra characters and makes them more fragile to edit. You can import import them in Excel for manipulation.
Personally, I don’t like TMX for storing terms, since they have a lot of extra data and rearranging and manipulation is more complicated—though not impossible, with a dedicated TMX editor (or e.g. CafeTran). It has fans, but like I said: too complicated for my purposes. Editing a TMX in a text editor can easily lead to corrupted files that cannot be imported (opened) in other tools again. You have to know what you are doing (which is the case with the other solutions too ;) ). Note that TMX comes in many flavours. It’s also (therefore?) the file format with the most problems regarding exchange. Wordfast, for instance, is notorious for not wanting to open perfectly valid TMX files from other apps (you have to use work arounds like WF Anywhere). [/quote]
Yes, I've found the combination of SDL MultiTerm and Glossary Converter is a good way to convert from Excel to TBX or vice versa.
But then I wasn't so sure if it was worth the effort to try to keep all my glossary in TBX format if no was using it in the first place.
I would definitely prefer Excel over CSV or tab-delimited for the reasons you mentioned.