Quantcast
Channel: ProZ.com Translation Forums
Viewing all 3915 articles
Browse latest View live

List of standard subject categories for a translation memory (TM) wanted | Numerical codes?

$
0
0
Forum: CAT Tools Technical Help
Topic: List of standard subject categories for a translation memory (TM) wanted
Poster: Piotr Bienkowski
Post title: Numerical codes?

One of my regular clients actually uses numerical codes for domains and subjects of their jobs. For example the current one I am working on has domain 500 and subject 599. When I asked them some time ago about these numbers the response was that it is their in-house thing. I actually think that they did not develop the coding themselves, but copied it from somewhere. So, where do you think it comes from?

[Edited at 2015-12-04 21:17 GMT]

List of standard subject categories for a translation memory (TM) wanted | DVX?

$
0
0
Forum: CAT Tools Technical Help
Topic: List of standard subject categories for a translation memory (TM) wanted
Poster: Michael Beijer
Post title: DVX?

[quote]Piotr Bienkowski wrote:

One of my regular clients actually uses numerical codes for domains and subjects of their jobs. For example the current one I am working on has domain 500 and subject 599. When I asked them some time ago about these numbers the response was that it is their in-house thing. I actually think that they did not develop the coding themselves, but copied it from somewhere. So, where do you think it comes from?

[Edited at 2015-12-04 21:17 GMT] [/quote]

Doesn't DVX use number codes (along with names)?

Michael

List of standard subject categories for a translation memory (TM) wanted | That UNESCO list

$
0
0
Forum: CAT Tools Technical Help
Topic: List of standard subject categories for a translation memory (TM) wanted
Poster: Oliver Walter
Post title: That UNESCO list

Today (about 3 years late!) I've had a look at that UNESCO list.
It is a PDF file, about 1.2 MB in size, containing 19 pages, with the actual list of names of areas of science occupying 17 and a half pages (pp 2-19).
The page contents are graphical (i.e. images), with no extractable text, and I estimate it has a total of about 1900 science areas - probably much too detailed for practical use as TM subjects. The quality of the graphics is probably too poor for OCR to be of much use.

The top 2 levels of subject-area headings may be of interest for this purpose. e.g. at the top level:
11 Logic, 12 Mathematics, 21 Astronomy and Astrophysics - - - 63 Sociology, 71 Ethics, 72 Philosophy.
At the 2nd level:
1103 Deductive logic, 1203 Computer sciences, 1201 Algebra, 1204 Geometry, 2406 Biophysics.

The actual detailed subject areas are the 3rd level (e.g. Boolean algebra, Hybrid computing, Solar physics, Fused salts, Human anatomy. Pharmacodynamics).
If you want to estimate how many titles there are at each level, download your copy, count the relevant titles on one or two of the pages 2 to 18 and multiply by 17.5 (if you counted 1 page) or 8.7 (if you counted 2 pages to get a slightly more accurate estimate. I haven't done that!)

I would guess these details at the third level are of interest to librarians like Christine (of some years ago) but translators are more likely to be interested in the first or second level.
Michael's list, from TC might be more useful.
I've just discovered: there is (unsurprisingly) some information about this nomenclature in Wikipedia, e.g.:
[url removed]

List of standard subject categories for a translation memory (TM) wanted | I've got that PDF too, but in mine the text is editable/selectable.

$
0
0
Forum: CAT Tools Technical Help
Topic: List of standard subject categories for a translation memory (TM) wanted
Poster: Michael Beijer
Post title: I've got that PDF too, but in mine the text is editable/selectable.

[quote]Oliver Walter wrote:

Today (about 3 years late!) I've had a look at that UNESCO list.
It is a PDF file, about 1.2 MB in size, containing 19 pages, with the actual list of names of areas of science occupying 17 and a half pages (pp 2-19).
The page contents are graphical (i.e. images), with no extractable text, and I estimate it has a total of about 1900 science areas - probably much too detailed for practical use as TM subjects. The quality of the graphics is probably too poor for OCR to be of much use.

The top 2 levels of subject-area headings may be of interest for this purpose. e.g. at the top level:
11 Logic, 12 Mathematics, 21 Astronomy and Astrophysics - - - 63 Sociology, 71 Ethics, 72 Philosophy.
At the 2nd level:
1103 Deductive logic, 1203 Computer sciences, 1201 Algebra, 1204 Geometry, 2406 Biophysics.

The actual detailed subject areas are the 3rd level (e.g. Boolean algebra, Hybrid computing, Solar physics, Fused salts, Human anatomy. Pharmacodynamics).
If you want to estimate how many titles there are at each level, download your copy, count the relevant titles on one or two of the pages 2 to 18 and multiply by 17.5 (if you counted 1 page) or 8.7 (if you counted 2 pages to get a slightly more accurate estimate. I haven't done that!)

I would guess these details at the third level are of interest to librarians like Christine (of some years ago) but translators are more likely to be interested in the first or second level.
Michael's list, from TC might be more useful.
I've just discovered: there is (unsurprisingly) some information about this nomenclature in Wikipedia, e.g.:
[url removed]

[/quote]

I uploaded it to my server (just in case mine is different):

[url removed]

List of standard subject categories for a translation memory (TM) wanted | I gave up

$
0
0
Forum: CAT Tools Technical Help
Topic: List of standard subject categories for a translation memory (TM) wanted
Poster: Meta Arkadia
Post title: I gave up

When I started using a CAT tool in 1997, I obediently followed the manual (some CAT tools come with a manual), and assigned client and subject to my resources. In those days, you could only connect to 1 TM, 1 termbase, and 1 lexicon/glossary in DejaVu, and I'm pretty sure you couldn't use more resources in other tools. The subject drop-down list looked like this:

It's both too comprehensive and too limited since most jobs I do require more than one subject. Like: General, automotive, marketing. Or: General, automative, workplace. Worse: General, automotive, workplace, ACME, series FooBar. I gave up. All subjects became "General." And I wasn't the only one. Like most DV users, I switched to the Big Mama (general TM), Big Papa (general termbase), and lexicon (project specific glossary) approach. The lexicons could grow from project specific to subject specific, and by that, rather useless as project specific.

More than 10 years later, it became possible to connect to more resources in your CAT tool. I developed a rather sophisticated approach with almost unlimited resources and settings. It works. If you know what resources to use, and I'm pretty sure using a classification system like the ones proposed here, won't work.

So earlier this year, I decided to import all my old lexicons, client termbases, and downloaded stuff (like Michael's glossaries) in an indexed database, arguing that I would be able to easily find the resources I needed, and I even intended write a script to search for them automatically by creating a wordlist from the source document. I gave up. There turned out to be more columns needed than I provided for, or wanted to create.

So I went back to good old desktop search, one for less frequently used, extremely big, or "unidentified" resources.

I can access it from within my CAT tool, and if I have to search it regularly, and if it turns out I often get the hits from the same resource, I can add that resource in the CAT tool for automatic search and auto-assembling.

I don't think "mere translators" can create and maintain a useful (standard) classification system, unless you are highly specialised. And in that case, you hardly need a classification system.

Cheers.

Hans

[Edited at 2015-12-05 00:56 GMT]

Across, memoQ are the hell for workers

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: sabrino

All those who order jobs are not always kings as customers.
When economic rationality for all parties involved is lacking and only customer demand is unilaterally imposed, this is an economic tyranny and unfair deal.
A good example:
Especially Across. This online processing tool has internal program errors. An example: when the source texts are copied into the target column and confirmed, then every time number count errors pop up, which must be clicked either by Edit, Add, or Skip and this takes additional waste of time.
Replace-All function is unstable in that copied source text strings do not let copy into the Find and Replace popup easily. There is some internal link error with clipboard to Across UI.

Vendors prefer Across and memoQ for these generate less word count than by trados. This practice is still acceptable as internal fuzzy matches can be included in word count, which translator can put into target text strings by the replace-all function.

With client-based Trados, there no such errors, this is also not dependent on network conditions and rational. Word count can be chosen between. with/without internal fuzzy matches, the package structure is very stable and file size for return package can be often smaller than work files.

The blind belief on online live processing over the internet may have been the impetus for these business people who are manufacturing and selling these online tools.
The actual end result on the practical level is counterproductive, or only productive for those who prefer these tools and just like the "justice" imposed by mass media on any attacks by the power holding nations, those who prefer these tools want to believe that this is a "fact" that these would be productive for its online based technology. This is like cult religion and irrational.

Do not use Across, especially any more unless the program errors are not fixed.

Across, memoQ are the hell for workers | memoQ???

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: ahartje
Post title: memoQ???

I used Trados years ago with all its pros(???) and cons(!!!), including difficult installation, worse or even no client´s support etc. until I found memoQ.
What a relief! It is stable in all its versions, works fine with almost each and every file format, online and offline, and the support given by Kilgray is outstanding and lightning fast.
With memoQ my dreams and demands became true...and Trados lost a forced client!

Across, memoQ are the hell for workers | @ahartje: I feel the same way

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: Stepan Konev
Post title: @ahartje: I feel the same way

After years of using Trados (from 2005), I found memoQ 4 years ago. Full stop here. No more words needed.

What regards word counting, does it matter for you at all?
Every translator has his/her own method to calculate workload and profitabilty. Just take the amount proposed by your client and divide it by your comfortable unit (e.g. 250 words for my language pair). If you like the result, you accept the job. And it does not matter how they count: from right to left, from down to top, diagonally backwards, or upside down... No muss, no fuss.

By the way, same file, same TM. Total count is exactly the same. New words/No match count 188 (T2015) v. 189 (mQ) is almost the same.

[IMG] [url removed] [/IMG]

[Edited at 2015-12-05 11:17 GMT]

Across, memoQ are the hell for workers | momoQ passed

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: sabrino
Post title: momoQ passed

From all you cited, I agree to your points plus for memoQ.

But, what kind of support did you need for trados studio? I like this since I never need any help....

Across, memoQ are the hell for workers | RE: what kind of support did you need for trados studio?

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: Stepan Konev
Post title: RE: what kind of support did you need for trados studio?

I don't.
Ask them:

[URL= [url removed] ][IMG] [url removed] [/IMG][/URL]
------------------
BTW,
Trados... is also not dependent on network conditions - false. T2015 IS totally dependent. As soon as I disconnect Internet, T2015 says my 'License Expired'. You cannot work without Internet at all.

Across, memoQ are the hell for workers | But I needed support for all versions, till Studio 2009,

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: ahartje
Post title: But I needed support for all versions, till Studio 2009,

for installation. The licence key did not work for 10 days, going forth and back, treating me like a dummy. Finally they called it one of these rare cases needing another approach.
But 2 times I needed urgent help solving a problem within an ongoing project. In both cases Trados replied to see the FAQs (as if I had not done so in first place!) and when I insisted to talk to the personal adviser (or how they called it), I received a message like: "Thank you for contacting me. I´m on vacation so please get in touch in 10 days from now."

At that moment my decision was made up to look for an alternative: memoQ! Happy ending (or beginning)!

Across, memoQ are the hell for workers | not much point saying it, but I'll say it anyway ...

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: Michael Beijer
Post title: not much point saying it, but I'll say it anyway ...

There are other alternatives out there, outside of the memoQ vs Trados bubble so many people seem to be trapped in. Just sayin'.

These days, m€moQ is just as buggy as that pile of crumpets, $DL $tudio.

Michael

Across, memoQ are the hell for workers | sdl license

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: sabrino
Post title: sdl license

I know many pirated trados 2007 working on .ttx are still used mainly in China, except for some good vendors.
But I didn't know trados 2015 needs to be online for the license purpose. Will not use it. 2014 is good.

All in all, stop using Across, call to all all vendors.
I ended this hell of my work time by myself.

Good luck everybody and enjoy working and thank you for your replies.

Across, memoQ are the hell for workers | RE: These days, m€moQ is just as buggy as...

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: Stepan Konev
Post title: RE: These days, m€moQ is just as buggy as...

Maybe you mean 2015 version. I have 2013 and I cannot detect any problem with it... What I do wrong?
Most problems with memoQ that I have seen here on this forum stem from lack of knowledge. That's it.

Across, memoQ are the hell for workers | the old versions are indeed better

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: Michael Beijer
Post title: the old versions are indeed better

[quote]Stepan Konev wrote:

Maybe you mean 2015 version. I have 2013 and I cannot detect any problem with it... What I do wrong?
Most problems with memoQ that I have seen here on this forum stem from lack of knowledge. That's it. [/quote]

Indeed, I too still have 2013 (although I only ever use it for projects where the client insists that memoQ be used), and it is relatively bug free. However, this cannot be said of the newer versions, and since it is now 2015 already (almost 2016!), not 2013, or even 2014 – 2015 is the version Kilgray is pushing.

From 2016 to 2013 is quite a jump to have to make to escape bugs.

[Edited at 2015-12-05 19:22 GMT]

How to extract text from this bilingual text to create TMX

$
0
0
Forum: CAT Tools Technical Help
Topic: How to extract text from this bilingual text to create TMX
Poster: gianghl1983

Dear Prozers,

I have a huge bilingual text (English-Vietnamese) in this below format. 20 Vietnamese sentences followed after every 20 English sentences (Total ~ 40.000 sentences).

I am using EmEdit and Notepad++ to process text, my strategy is to select every 20 others lines, then paste into English file. Then, delete all these lines to save into Vietnamese file to create TMX from these 02 files. However, I could not find a suitable filter/regex to do my job.

Do anybody know how to solve my problem?

Thank you so much in advanced!!!

[sentence id="1"] Another method is to have a central control unit allocate time based on the priority requirements of each transmission , as in the 100 VGAnyLAN demand priority access method .[/sentence]
[sentence id="2"] I 'm using the same interface function here , but I want it to do something different for my new type . .[/sentence]
...
[sentence id="1"] Một phương pháp khác nhờ một đơn vị điều khiển trung tâm cấp phát thời gian dựa trên độ ưu tiên về đường truyền , như trong phương pháp 100 VGAnyLAN .[/sentence]
[sentence id="2"] Tôi đang dùng hàm giao diện đó ở đây nhưng muốn làm một điều gì đó khác hẳn cho kiểu mới của tôi .[/sentence]
...
[sentence id="21"] A back - end MTA ( message transfer agent ) that transfers messages from the UA and delivers them to mail servers .[/sentence]
[sentence id="22"] A backbone can link multiple networks in the campus environment or connect networks over wide area network links .[/sentence]
...
[sentence id="21"] Trình ứng dụng back - end MTA ( đại lý truyền thư tín ) truyền tải các thông điệp từ UA và phân phối chúng đến các mail server .[/sentence]
[sentence id="22"] Một đường trục có thể nối nhiều mạng trong một môi trường campus hay nối kết các mạng trong diện rộng .[/sentence]
....

How to extract text from this bilingual text to create TMX | Excel sorting by condition

$
0
0
Forum: CAT Tools Technical Help
Topic: How to extract text from this bilingual text to create TMX
Poster: Soonthon LUPKITARO(Ph.D.)
Post title: Excel sorting by condition

[quote]gianghl1983 wrote:

Dear Prozers,

I have a huge bilingual text (English-Vietnamese) in this below format. 20 Vietnamese sentences followed after every 20 English sentences (Total ~ 40.000 sentences).

I am using EmEdit and Notepad++ to process text, my strategy is to select every 20 others lines, then paste into English file. Then, delete all these lines to save into Vietnamese file to create TMX from these 02 files. However, I could not find a suitable filter/regex to do my job.

Do anybody know how to solve my problem?
.... [/quote]

Seeing your text file content, I suggest as follows:
1. Copy all texts into Excel cells.
2. In Excel, sort rows by conditions e.g. extract by multipliers of each 10 rows or others into a new file (based on your contents) e.g. on row 1-10, 21-30, 41-50, ....... Do the same for target texts e.g. extract on row 11-20, 31-40, 51-60, .....
3. You get 2 new Excel files.
4. If you have CAT tools, use them to align 2 files to create a translation memory based TMX file.
[If your master MS Word, you can align text by using macros as well.]

Soonthon L.

How to extract text from this bilingual text to create TMX | Post file

$
0
0
Forum: CAT Tools Technical Help
Topic: How to extract text from this bilingual text to create TMX
Poster: FarkasAndras
Post title: Post file

If you post the file I'll have a look. It sounds like it has a good structure so it should take 10 minutes to process. Try to avoid aligning the files with a CAT as all it will do is mess things up. This should be converted straight into a table and then into a TMX.

Across, memoQ are the hell for workers | I don't understand a word of the original posting

$
0
0
Forum: CAT Tools Technical Help
Topic: Across, memoQ are the hell for workers
Poster: Mikhail Kropotov
Post title: I don't understand a word of the original posting

I have a lot to say on the general topic of what it's like to use MemoQ and Trados. However, since I don't understand a word of sabrino's original posting, I cannot really comment.

How to extract text from this bilingual text to create TMX | I found solution!

$
0
0
Forum: CAT Tools Technical Help
Topic: How to extract text from this bilingual text to create TMX
Poster: gianghl1983
Post title: I found solution!

Thank you all for your suggestion, I found a solution for my problem by:

- Using EmEditor to rip off syntax, so I will have 20 Vietnamese sentences followed by 20 English sentences...
- Copy-Paste into Excel
- Using Kutools Excel (30 days trial) to use SELECT INTERVAL ROWS & COLUMN to select 20 rows with interval = 20 > Only English/Vietnamese Content.

Have a nice day!
Viewing all 3915 articles
Browse latest View live