Quantcast
Channel: ProZ.com Translation Forums
Viewing all articles
Browse latest Browse all 3915

How to extract content from SGML to create TMX file | I ♥ EmEditor

$
0
0
Forum: CAT Tools Technical Help
Topic: How to extract content from SGML to create TMX file
Poster: Michael Beijer
Post title: I ♥ EmEditor

[quote]gianghl1983 wrote:

[quote]Michael Beijer wrote:

Extract every line from your file that contains "s id='vn" and save it as a new txt file. This is your Vietnamese half.

Extract every line from your file that contains "s id='en" and save it as a new txt file. This is your English half.

This can easily be done in EmEditor (using the Filter Toolbar).

Convert the two files into a tab-delimited txt file. There are many ways to do this. I have an EmEditor macro for this, but there is also a little tool in LF Aligner's "grab bag" (available on sourceforge) that can do this. Or just copy paste them both into a new file.

Then use e.g. the open source Heartsome TMX editor to convert this into a TMX (Tools > Convert to TMX).

Or, let me do it for you (for £30/hour) :-)

[Edited at 2015-09-16 09:59 GMT] [/quote]

Thank you all, I followed Michael Beijer method with EmEditor and I can easily extract text content separately into Vietnamese and English.

Many thanks! [/quote]

Yes, that Filter Toolbar in EmEditor is priceless. Note that it also allows you to filter negatively. So much quicker and easier than messing around with Macros. The whole thing would take maybe 5 minutes in EmEditor.

Viewing all articles
Browse latest Browse all 3915

Trending Articles