Quantcast
Channel: ProZ.com Translation Forums
Viewing all articles
Browse latest Browse all 3915

Trim internal fuzzies (AutoIt script)

$
0
0
Forum: CAT Tools Technical Help
Topic: Trim internal fuzzies (AutoIt script)
Poster: Samuel Murray

Hello everyone

I have written a very simple, work-in-progress, proof-of-concept set of scripts that searches a list of sentences for internal fuzzy matches and then groups internal fuzzy matches together. This would potentially allow one to share a job among multiple translators while preventing internal fuzzy matches from being split between translators (who would otherwise not have any benefit from such matches, since the "other" sentences may have been given to other translators).

These are AutoIt scripts, so you need AutoIt installed on your computer to use them. The input file format is plain text, UTF8 with BOM, one sentence per line. The output file contains the same sentences in the same order, except that internal fuzzy matching segments from later in the list are moved up and grouped with the earliest one of the matches.

[url removed]

There are four scripts, using Levenshtein and Sift2 fuzzy matching, using two different methods of searching. The inner workings of both Levenshtein and Sift2 are Greek to me, so don't bother asking such technical questions. (-:

Samuel

PS. If you know of any CAT tool that can do this sort of thing (i.e. either remove/export or group internal fuzzies in a file), please please let me know. It should be a standard feature, but it isn't.

Viewing all articles
Browse latest Browse all 3915

Trending Articles