Forum: CAT Tools Technical Help
Topic: Segment length analysis?
Poster: SDL Community
Post title: All agreed...
[quote]Samuel Murray wrote:
[quote]SDL Community wrote:
So if there were 2500 segments and 2500 words you'd know. [/quote]
What if there were 1000 segments and 10 000 words? That's 10 words per segment, on average. But the time saving on very long segments does not cancel out the time wastage on very short segments. A 30-word segment does not really take more time per word than a 20-word segment, but a 3-word segment takes up much more time per word than a 10-word segment.
I mean, suppose 100 of those segments have only 1 word, and 100 have only 2 words, and 100 have only 3 words, then the average length of the remaining 700 segments (the remaining 9400 words) is 13 words per segment. The 300 short segments will take up far more time per word than the average.
It takes me (generally) just as long to translate a 1-word segment as a 3-word segment or even a 5-word segment. So for me, if I had wanted the weighted word count to be an accurate indication of the amount of time it will take to do the job, all segments of 5 words or less should be counted as 5 words.
So let's recalculate the the 10 000-word example:
100 x 1-word segments: 100 words actual, 500 words weighted
100 x 2-word segments: 200 words actual, 500 words weighted
100 x 3-word segments: 300 words actual, 500 words weighted
Other segments: 9400 words actual
The adjusted word count, then, is 10900 words (i.e. it would take two to three hours longer to complete the job than a strictly average 10 000 words).
[Edited at 2019-04-27 06:25 GMT] [/quote]
That's why I asked what you'd like to see. In terms of helping with project estimation this seems like an interesting way forward. Perhaps this is something we could do as a small plugin so you have an additional analysis. Any developer could add this using the API... but assuming nobody here can develop perhaps I'll add it to our list of things to do.
Regards
Paul
Topic: Segment length analysis?
Poster: SDL Community
Post title: All agreed...
[quote]Samuel Murray wrote:
[quote]SDL Community wrote:
So if there were 2500 segments and 2500 words you'd know. [/quote]
What if there were 1000 segments and 10 000 words? That's 10 words per segment, on average. But the time saving on very long segments does not cancel out the time wastage on very short segments. A 30-word segment does not really take more time per word than a 20-word segment, but a 3-word segment takes up much more time per word than a 10-word segment.
I mean, suppose 100 of those segments have only 1 word, and 100 have only 2 words, and 100 have only 3 words, then the average length of the remaining 700 segments (the remaining 9400 words) is 13 words per segment. The 300 short segments will take up far more time per word than the average.
It takes me (generally) just as long to translate a 1-word segment as a 3-word segment or even a 5-word segment. So for me, if I had wanted the weighted word count to be an accurate indication of the amount of time it will take to do the job, all segments of 5 words or less should be counted as 5 words.
So let's recalculate the the 10 000-word example:
100 x 1-word segments: 100 words actual, 500 words weighted
100 x 2-word segments: 200 words actual, 500 words weighted
100 x 3-word segments: 300 words actual, 500 words weighted
Other segments: 9400 words actual
The adjusted word count, then, is 10900 words (i.e. it would take two to three hours longer to complete the job than a strictly average 10 000 words).
[Edited at 2019-04-27 06:25 GMT] [/quote]
That's why I asked what you'd like to see. In terms of helping with project estimation this seems like an interesting way forward. Perhaps this is something we could do as a small plugin so you have an additional analysis. Any developer could add this using the API... but assuming nobody here can develop perhaps I'll add it to our list of things to do.
Regards
Paul