Quantcast
Channel: ProZ.com Translation Forums
Viewing all articles
Browse latest Browse all 3905

How to extract text from this bilingual text to create TMX | Excel sorting by condition

$
0
0
Forum: CAT Tools Technical Help
Topic: How to extract text from this bilingual text to create TMX
Poster: Soonthon LUPKITARO(Ph.D.)
Post title: Excel sorting by condition

[quote]gianghl1983 wrote:

Dear Prozers,

I have a huge bilingual text (English-Vietnamese) in this below format. 20 Vietnamese sentences followed after every 20 English sentences (Total ~ 40.000 sentences).

I am using EmEdit and Notepad++ to process text, my strategy is to select every 20 others lines, then paste into English file. Then, delete all these lines to save into Vietnamese file to create TMX from these 02 files. However, I could not find a suitable filter/regex to do my job.

Do anybody know how to solve my problem?
.... [/quote]

Seeing your text file content, I suggest as follows:
1. Copy all texts into Excel cells.
2. In Excel, sort rows by conditions e.g. extract by multipliers of each 10 rows or others into a new file (based on your contents) e.g. on row 1-10, 21-30, 41-50, ....... Do the same for target texts e.g. extract on row 11-20, 31-40, 51-60, .....
3. You get 2 new Excel files.
4. If you have CAT tools, use them to align 2 files to create a translation memory based TMX file.
[If your master MS Word, you can align text by using macros as well.]

Soonthon L.

Viewing all articles
Browse latest Browse all 3905

Trending Articles