Forum: CAT Tools Technical Help
Topic: How to convert TMX to tab-delimited?
Poster: Hans Lenting
Post title: Smart approach
[quote]Stepan Konev wrote:
Well, the regex (\n|.) means either line break (\n) or (|) any character (.). Therefore it covers both single line segments and multiline segments. However manual work may be needed indeed to fix them (by removing the paragraph mark) before converting them into a 2-column table. A second option is to use a regex that only covers single line segments having sacrificed the multiline segments. Not sure which evil is lesser though. [/quote]
I’ll try to come up with regex that replaces all line-breaks with ÿ, except when they are followed by <seg>
[Edited at 2022-10-14 19:59 GMT]
Topic: How to convert TMX to tab-delimited?
Poster: Hans Lenting
Post title: Smart approach
[quote]Stepan Konev wrote:
Well, the regex (\n|.) means either line break (\n) or (|) any character (.). Therefore it covers both single line segments and multiline segments. However manual work may be needed indeed to fix them (by removing the paragraph mark) before converting them into a 2-column table. A second option is to use a regex that only covers single line segments having sacrificed the multiline segments. Not sure which evil is lesser though. [/quote]
I’ll try to come up with regex that replaces all line-breaks with ÿ, except when they are followed by <seg>
[Edited at 2022-10-14 19:59 GMT]