Re: [scribus] Paragraph Separator and Newline Both Showing as \r in Python

Gregory Pittman Fri, 31 Dec 2021 15:00:53 -0800

On 12/31/21 14:19, Matt Miller wrote:

On Fri, Dec 31, 2021, at 07:11, Gregory Pittman wrote:

On 12/30/21 20:49, Matt Miller wrote:

I'm loading a text frame from a utf-8 encoded text file, and within my Scribus 
Python code I want to search for the standard newline character, ascii value 
10. When I see an ascii 10 as the line separator I want to apply a special 
paragraph style to the following paragraph. Most paragraphs end with the 
Unicode paragraph separator character, \u2029, and in those cases the default 
paragraph style is fine.


My problem is that both these types of characters are matching '\r' when I use 
re.search in python. also, if I select either line separator character, then do 
getText(), I get a '\r' no matter what. I've confirmed that my file encoding is 
utf-8. What am I missing? How can I search for a simple '\n' character?


Hi Matt,

You don't say what OS you're using.


Linux

Maybe running dos2unix on the text would help.


Well, my ideal workflow is one where the contents of the text file instruct 
Scribus what paragraph and character styles to use throughout the document. So, 
I'm careful to put exactly the characters I want into the text file. I've been 
largely successful, but now I've run into a case where it seems Scribus (or 
Python) is losing information when I load the file. From inside Scribus I can't 
distinguish between a Unicode paragraph separator, \u2029, and a simple line 
feed, \u000A.

I'm able to open my text file from the Python console and dump it out to see that newline is displayed as 
"\n" and a paragraph separator is displayed as "\u2029." So, I'm suspecting the problem 
is with Scribus, or how I'm using it. I'm loading the file using insertHtmlText(), but I get the same bad 
behavior from the GUI when I do "Content | Get Text..." and load the file manually.

I've attached a text file that shows the problem. If you run the following in the 
scripter console from a document with a text frame named "Text1" you should see 
the problem:


When I save this file then open it with kwrite, then when I do a Replace operation 
trying to switch \n to <p> (an arbitrary choice), I see the replacement happen 
at the end of the 2nd, 3rd, 4th, and 6th sentences.

Greg


___
Scribus Mailing List: [email protected]
Edit your options or unsubscribe:
http://lists.scribus.net/mailman/listinfo/scribus
See also:
http://wiki.scribus.net
http://forums.scribus.net

Re: [scribus] Paragraph Separator and Newline Both Showing as \r in Python

Reply via email to