Quantcast
Channel: ProZ.com Translation Forums
Viewing all 3915 articles
Browse latest View live

When a 50% match isn't a 50% match? | I don't think I never trusted even a 80% match....

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: megane_wang
Post title: I don't think I never trusted even a 80% match....

.... can you imagine a "50%"? I don't care about them at all.

CAT tools are a help, not THE ultimate tool (fortunately for us translators) :D

Ruth

When a 50% match isn't a 50% match? | Which CAT tool?

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: Jean Dimitriadis
Post title: Which CAT tool?

Out of curiosity, which CAT tool did you use?

Mine gives a 33% match, also counting the word "til".

When a 50% match isn't a 50% match? | A program that clearly can't count

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: Chris S
Post title: A program that clearly can't count

It is large agency's own system.

I got paid 100% for this, so that's not the issue.

What bothers me is how a computer can possibly add up wrong...

And how much else it might get wrong...

When a 50% match isn't a 50% match? | almost funny

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: Nadia Castro
Post title: almost funny

[quote]Roman Karabaev wrote:

A screenshot from MemoQ. Zero words out of two make... a 68% match. [/quote]

(I)nstruction -> (co)nstruction....it's weird how your MemoQ "thinks"!

When a 50% match isn't a 50% match? | Configuration

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: Nadia Castro
Post title: Configuration

[quote]Jean Dimitriadis wrote:

Out of curiosity, which CAT tool did you use?

Mine gives a 33% match, also counting the word "til". [/quote]

Most CAT tools allow you to set your preferred match rate, I personally set it to a minimum of 75% percent -- less than that it feels like (at least in most cases) just easier to translate from scratch.

When a 50% match isn't a 50% match? | It's a bit of science and a bit of magic

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: Samuel Murray
Post title: It's a bit of science and a bit of magic

[quote]Chris S wrote:
I did a rare CAT job today and noticed this:
...
This came up as a 50% match. [/quote]

By character, 65% of the segment in the TM matches 40% of the segment in the source text.

I believe CAT tools that can't do proper morphological stemming/tokenization may try to strike a balance between word matching and character matching. My own CAT tool, WFC, favours character matching when the segment is short, and word matching when the segment is long. This leads to things similar to Roman's construction/instruction.

[quote]Since when do four words out of 14 make a 50% match? [/quote]

Is the proposed translation 50% useful to you? If yes, then it is a true 50% match. If not, then they didn't get the magic quite right, but magic isn't precise anyway.

[quote]Jean Dimitriadis wrote:
Out of curiosity, which CAT tool did you use?
Mine gives a 33% match, also counting the word "til". [/quote]

Without any morphological analysis (i.e. default tokenizer, "language unknown"), OmegaT says it's below the match threshold (i.e. below 30%). With an English tokenizer, OmegaT says it's a 38% match. But with the Danish tokenizer, OmegaT says it's a 50% match.

[Edited at 2018-11-16 18:34 GMT]

When a 50% match isn't a 50% match? | Matches only start getting useful at 70-80%

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: Endre Both
Post title: Matches only start getting useful at 70-80%

As Thomas and Nadia have mentioned, it’s usually only somewhere above 70% that matches actually have a chance to be useful in the sense of saving any time at all compared to a fresh translation. Lower matches can help with terminology consistency (although there are better tools for that), but they don't make you quicker.

It would be interesting to compare the development of match ratings in CAT tools over time. Unfortunately the calculation of match rates is a race to the bottom. For obvious reasons, agencies are interested in pushing match rates upwards until they are just this side of indefensible - higher matches are a windfall to them.

Even translators who have the occasion and the inclination to compare CAT tools tend to assume that higher match rates equate to better TM leveraging, when in fact there is scant correlation between the two, the differences mostly boiling down to the audacity of the CAT tool’s marketing team. As displayed by the examples in this thread, they are getting pretty audacious.

When a 50% match isn't a 50% match? | culture-dependents: half-full is half-empty

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: DZiW
Post title: culture-dependents: half-full is half-empty

Not dwelling too much on such "secret vendors' know-hows" as hashes, checksums, shingles, clusters, vectors, Levenshtein distances, encoders, SounEx, and other weird stuff, it's just an attempt to obfuscate the fact that very idea of "similar sentences"--let alone in different language--is but an expensive miscalculation.

Little by little modern trends steadily come to per-language structural [subj-pred-obj] parts aggregation, considering synonyms and weighting antonyms while sacrificing functional parts. A couple years ago I was pleasantly surprised to watch a demonstration where some app analyzed simple, complex, and compound sentences and could tell about similarity of the context--noting the antecedents (the meaning).

However, even in a new/small TM I never used a 50% fuzzy match, because I also doubt that many 'false positives' are any useful for speeding the process up

When a 50% match isn't a 50% match? | @Endre

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: Samuel Murray
Post title: @Endre

[quote]Endre Both wrote:
It’s usually only somewhere above 70% that matches actually have a chance to be useful in the sense of saving any time at all compared to a fresh translation. Lower matches can help with terminology consistency (although there are better tools for that), but they don't make you quicker. [/quote]

I've had the opposite experience. Especially with regard to lengthier segments, a low match would save me time if it concerns a repeated phrase. I can recall several instances when my CAT tool yielded no match but the first result in a concordance search was something that I would very much would have liked to see suggested as a fuzzy match. This is particularly true for matches consisting of consecutive words.

Here's a hypothetical example of such a no match that would have saved time and sanity:

Segment 1: Thinking of your experience with Company X over the past 7 days, please rate the following on a scale of 1 to 10:
Segment 2: Thinking of your experience with Company X over the past 7 days, and considering how the company's Y compares with that of other companies mentioned in question Z, please tell in your own words how satisfied you were with the following:

When a 50% match isn't a 50% match? | The answer

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: Chris S
Post title: The answer

The helpdesk tells me the reason for this showing as a 50% match is because otherwise this 30% match wouldn't show up as a match at all. In other words, they're trying to help me, not rip me off. This seems reasonable.

The construction/instruction thing made me laugh/cry though :-)

When a 50% match isn't a 50% match? | Fragments

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: Lincoln Hui
Post title: Fragments

[quote]Samuel Murray wrote:

[quote]Endre Both wrote:
It’s usually only somewhere above 70% that matches actually have a chance to be useful in the sense of saving any time at all compared to a fresh translation. Lower matches can help with terminology consistency (although there are better tools for that), but they don't make you quicker. [/quote]

I've had the opposite experience. Especially with regard to lengthier segments, a low match would save me time if it concerns a repeated phrase. I can recall several instances when my CAT tool yielded no match but the first result in a concordance search was something that I would very much would have liked to see suggested as a fuzzy match. This is particularly true for matches consisting of consecutive words.

Here's a hypothetical example of such a no match that would have saved time and sanity:

Segment 1: Thinking of your experience with Company X over the past 7 days, please rate the following on a scale of 1 to 10:
Segment 2: Thinking of your experience with Company X over the past 7 days, and considering how the company's Y compares with that of other companies mentioned in question Z, please tell in your own words how satisfied you were with the following:
[/quote]
I think we call them fragments rather than matches. Most CAT tools definitely use them, and I often wish that they are more robust in detecting them.

When a 50% match isn't a 50% match? | Depends what it's used for

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: B D Finch
Post title: Depends what it's used for

I don't give a discount for anything below an 85% match. However, when translating, I find it useful to set Wordfast to display matches down to 65%. Even if those lower matches aren't directly used, they can be useful for ideas on how to put something and for keeping the translation consistent.

When a 50% match isn't a 50% match? | NP + VP consistency

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: DZiW
Post title: NP + VP consistency

An abstract 50% "match" (1) is but irrelevant, (2) slows down [often badly] selection and (3) edition, rendering it rather useless. Perhaps, in some language pairs such an approach--considering the nucleus of the sentence--might help a little, but still...Either speed, or quality

Free QA alternative to XBench? | Free QA Tool available ????

$
0
0
Forum: CAT Tools Technical Help
Topic: Free QA alternative to XBench?
Poster: MarlonofAnkara
Post title: Free QA Tool available ????

I been to { [url removed] } but it is only a search engine and does not lead to anywhere. could you be a bit help please?

Free QA alternative to XBench? | Correct link for CheckMate

$
0
0
Forum: CAT Tools Technical Help
Topic: Free QA alternative to XBench?
Poster: Jorge Payan
Post title: Correct link for CheckMate

[quote]Sergei Leshchinsky wrote:

also Check Mate from Okapi ( [url removed] )

[/quote]

[url removed]

Free QA alternative to XBench? | /

$
0
0
Forum: CAT Tools Technical Help
Topic: Free QA alternative to XBench?
Poster: Sergei Leshchinsky
Post title: /

[quote]Jorge Payan wrote:
[quote]Sergei Leshchinsky wrote:
also Check Mate from Okapi [url removed]
[/quote]
[url removed] [/quote]
My link works too :)

Portable version of Olifant TMX editor - edit TMs on the go | Sorry but this file is marked by my antivirus (McAfee) as potentially unwanted software

$
0
0
Forum: CAT Tools Technical Help
Topic: Portable version of Olifant TMX editor - edit TMs on the go
Poster: DamienR
Post title: Sorry but this file is marked by my antivirus (McAfee) as potentially unwanted software

Again I'm sorry, it sounds like I'm accusing you of something but I know this could be a false flag (or it could come from that Enigma wrapper you used?).
So anyway I figured I might as well warn you (and people who could be interested in Olifant in a portable version like me) that when I scan the unzipped file you shared, it gets reported as "PUP-XCI-JJ", a "Potentially Unwanted Program" which I understand to mean it's not actually toxic (ie not a virus/trojan/keylogger/...).
I couldn't find anything precise about PUP-XCI-JJ so again I *guess* it's not too dangerous, but you might not want to trust it with your precious TMX.
Maybe give Heartsome's TMX editor a go?
[url removed]
It can be used as portable software too AFAIK.

looking for opensource web-based translation software

$
0
0
Forum: CAT Tools Technical Help
Topic: looking for opensource web-based translation software
Poster:Руслан И.

Is there a translation tool with such features, liked below in priority order:
1) web-based
2) open source
3) deployable of my own server
4) database based (for increasing search speed and managing gigabytes od data)
5) easy to import translation memories, dictionaries, glossaries
6) full text search feature (as well as fuzzy seach)
7) Option to plug customlemmatization tool

For now I've found only [url removed]

When a 50% match isn't a 50% match? | Nice 80% match from Crowdin

$
0
0
Forum: CAT Tools Technical Help
Topic: When a 50% match isn't a 50% match?
Poster: Endre Both
Post title: Nice 80% match from Crowdin

A nice 80% match from Crowdin - one or two tweaks here and there, and you should be good to go, right?

Translation Workspace XLIFF Editor – bulk replace of decimal points by decimal commas?

$
0
0
Forum: CAT Tools Technical Help
Topic: Translation Workspace XLIFF Editor – bulk replace of decimal points by decimal commas?
Poster: Jiří Toman

Hi all, I have been wondering if someone has already tried to bulk replace decimal points by decimal commas. I have large files that contain around 1000 words in decimal numbers and I have to open every segment and change each decimal point to a decimal comma. I have tried using the bulk find and replace feature with regular expressions, I have found find all of the occurrences, but I don't know how to replace them... it won't let me... any idea? Have someone else tried the same? Is there a way? Thank you in advance for any hint.
Viewing all 3915 articles
Browse latest View live