Quantcast
Channel: ProZ.com Translation Forums
Viewing all articles
Browse latest Browse all 3905

New free & open source aligner (for Windows, OS X and linux) | There really should be an easier way to do this

$
0
0
Forum: CAT Tools Technical Help
Topic: New free & open source aligner (for Windows, OS X and linux)
Poster: Michael Beijer
Post title: There really should be an easier way to do this

[quote]FarkasAndras wrote:

Merging files before aligning is an option, but it's not a very good one. If one file pair is badly mismatched (one file several dozen or several hundred segments longer than the other) then it can throw off the alignment throughout the rest of the project. It's best to keep files isolated to isolate problems.

Re: GUI batch mode, LF Aligner is on the back burner... I did write a simple GUI program that generates a .bat file for batch alignment, but it's primitive and ugly. I mostly wrote it for my own use. Maybe I will polish it up and publish it at some point, but the earliest time that could possibly happen is next week.
You can of course generate the .bat yourself, which is what I did up to a month or two ago. Copying file names to the clipboard from Total Commander, pasting them in Excel and then using either Excel or search and replace in a text editor to add the rest of the command makes it relatively painless... relatively being the keyword.

[Edited at 2015-09-14 08:35 GMT] [/quote]

Thanks FarkasAndras,

But I solved it for now. I realised that in AlignFactory you can set the program to spit out separate TMXs, as well as one big one. So when you run a huge batch job, and the program chokes, the last TMX it spits out will be the one with the problem. The name of this TMX will correspond to the txt file (pair) with the problem. Just skipping this txt file usually allows the project to complete if rerun. I then just convert the single txt file (pair) with the problem into a separate TMX using Heartsome's TMX editor (Tools > Convert to TMX), and then merge it with the AlignFactory TMX.

Indeed: it's never a good idea to merge 100 txt files into a single big one for stuff like this. Way too much chance of something going wrong, not to mention merely merging 100 text files of this type is in itself quite a chore, and likely to choke most programs, even EmEditor.

No time for generating .bat files, etc., right now but I do look forward to your future GUI batch mode thingee, as I would love to test it against AlignFactory.

There really should be an easier way to do this though, seeing as how all of these files are effectively already aligned. All I need is for a program to: take the first line of text file de1.txt and match it to the first line of text file en1.txt, and turn this into a TU. Then, it needs to take the second line of text file de1.txt and match it with the second line of text file en1.txt, and turn it into a TU. Then repeat that a few times.

PS: this is what I'm currently working on: [url removed]

Viewing all articles
Browse latest Browse all 3905

Trending Articles