How to instruct Déjà vu X3 not to break apart sentences containing "p. [number]"
Thread poster: Pavel Tsvetkov
Pavel Tsvetkov
Pavel Tsvetkov  Identity Verified
Bulgaria
Local time: 06:18
Member (2008)
English to Bulgarian
+ ...

Moderator of this forum
May 26, 2018

Hi All,

I need an expression to instruct Dejavu X3 not to break sentences in the middle, when it encounters a full stop after certain letters and before a number. For example, not to break these apart:

p. 37
стр. 37

Here is a real screen capture of the problem:

2018-05-26_192457

Any ideas?

Best regards,
PTs

[Edited at 2018-05-26 16:52 GMT]


 
Hans Lenting
Hans Lenting
Netherlands
Member (2006)
German to Dutch
Workaround May 26, 2018

When you're in a hurry (and until someone gives you a real solution): replace 'p.' with 'xyz' in the source text (and maybe also in the TM) and import.

When the text is imported, replace 'xyz' with 'p.' again.

Worked for me, with Déjà Vu 2.


 
mikhailo
mikhailo
Local time: 07:18
English to Russian
+ ...
solution May 26, 2018

Pavel Tsvetkov wrote:

Hi All,

I need an expression to instruct Dejavu X3 not to break sentences in the middle, when it encounters a full stop after certain letters and before a number. For example, not to break these apart:

p. 37
стр. 37

Here is a real screen capture of the problem:

2018-05-26_192457

Any ideas?

Best regards,
PTs

[Edited at 2018-05-26 16:52 GMT]


add them to exclusions

before split p.^w (p. followed by space)
after split ^# (any number)

before split стр.^w
after split ^#

try other variants
before split ^wp.^w (space p. space)
after split ^#

The most interesting thing - at segmentation DVX3 don't distinguish simple space and unbrealable space (CTRL+SPACE n word).


 
Selcuk Akyuz
Selcuk Akyuz  Identity Verified
Türkiye
Local time: 07:18
English to Turkish
+ ...
Segmentation options May 27, 2018

Hi Pavel,

FILE > Options > Segmentation

There are two panels, Rules and Exceptions

In Exceptions write the following:

Before Split: p.^w
(p. followed by white space)

After Split: ^#
(A digit)


 
Pavel Tsvetkov
Pavel Tsvetkov  Identity Verified
Bulgaria
Local time: 06:18
Member (2008)
English to Bulgarian
+ ...

Moderator of this forum
TOPIC STARTER
Thank you! May 27, 2018

Thank you all who responded!

 
Pavel Tsvetkov
Pavel Tsvetkov  Identity Verified
Bulgaria
Local time: 06:18
Member (2008)
English to Bulgarian
+ ...

Moderator of this forum
TOPIC STARTER
One thing more... May 27, 2018

When these changes are made, which file are they recorded into? Settings.dvset ?

 
Selcuk Akyuz
Selcuk Akyuz  Identity Verified
Türkiye
Local time: 07:18
English to Turkish
+ ...
settings file May 27, 2018

Correct, they are stored in the Languages table of the Settings file.

Some DVX users think it is better to store them in project templates but updating the templates is a manual task, therefore I prefer the current settings.


 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Pavel Tsvetkov[Call to this topic]

You can also contact site staff by submitting a support request »

How to instruct Déjà vu X3 not to break apart sentences containing "p. [number]"






Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »