Quantcast
Channel: ProZ.com Translation Forums
Viewing all articles
Browse latest Browse all 3915

How to extract text from this bilingual text to create TMX | The following works in Notepad++

$
0
0
Forum: CAT Tools Technical Help
Topic: How to extract text from this bilingual text to create TMX
Poster: Mikhail Zavidin
Post title: The following works in Notepad++

You could try deleting the English text using find-replace dialogue. Just input:

To find: (?:\[sentence id=\"\d+\"\].*\[\/sentence\]\r\n){20}((?:\[sentence id=\"\d+\"\].*\[\/sentence\]\r\n){20})

Raplace with: \1

Set the cursor to the beginig of first line and push Raplace All button.

This will raplace every 20 others English lines in your file.

Then to get English text file just delete first 20 lines of the original file set the cursor to the beginig of first line and run Replace All.
After that delete last 20 segments of Vietnamese text which remains in the end of the file.

Don't forget to set regular expression mode.

Hope this helps.

[Редактировалось 2015-12-06 14:26 GMT]

Viewing all articles
Browse latest Browse all 3915

Trending Articles