V.3.8.1 adds extra blank lines when clipping Web pages

Clairvaux
Posts: 46
Joined: Sun Dec 25, 2016 12:43 pm
Contact:

V.3.8.1 adds extra blank lines when clipping Web pages

Postby Clairvaux » Mon Jul 10, 2017 10:55 pm

On some Web sites, v.3.8.1 adds 2 extra blank lines between paragraphs when clipping from Web pages. This happened to me on several sites.

Previously, CN respected Web pages text layout : 1 blank line on the Web = 1 blank line in a Cinta note.

Does not happen on all sites, though.

Some pages generating those extra blank lines :

http://www.lemonde.fr/planete/article/2 ... _3244.html

http://www.eurozine.com/germans-must-re ... r-own-sake

https://en.wikipedia.org/wiki/Cat

http://tass.com/politics/955619

Some pages producing normal results :

http://www.lefigaro.fr/flash-actu/2017/ ... iviles.php

https://www.nytimes.com/2017/07/10/opin ... manov.html
User avatar
CintaNotes Developer
Site Admin
Posts: 4644
Joined: Fri Dec 12, 2008 4:45 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby CintaNotes Developer » Tue Jul 11, 2017 10:27 am

Thanks for the report!

However, I can't reliably reproduce. On the pages that you list as "producing extra blank lines", for me only EUROZINE produces an unnecessary blank line, that is explainable by a non-breaking space that has been put into the middle line.

Could you please add some screenshots? Thanks!
Alex
Clairvaux
Posts: 46
Joined: Sun Dec 25, 2016 12:43 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby Clairvaux » Tue Jul 11, 2017 2:58 pm

Hi Alex,

Here are screenshots of the clipped notes with v.3.8.1 and v.3.8.

In fact, only one of those Web pages clips correctly : Le Figaro. The New York Times does get extra blank lines too, only I had not noticed them because you have to scroll down to reach them.

I qualify my original post by saying that I did not inspect the Web pages html code, so I can't say for sure how many blank lines are actually there. What I mean is, the previous versions correctly interpreted the Web designers' intentions, and the reader's perception (those are two separate paragraphs here, the Web site put some white space between them => signal this by putting one blank line in the note).

Now it seems to be unpredictable. Temporarily reverting to v.3.8.

Just in case it is relevant, I use Firefox with No Script.

New version

Eurozine v.3.8.1.PNG
http://www85.zippyshare.com/v/ao9776WZ/file.html

Figaro Nice - v3.8.1.PNG
http://www85.zippyshare.com/v/aCTgmlfh/file.html

Le Monde ONU - v.3.8.1.PNG
http://www85.zippyshare.com/v/pApqw4Tx/file.html

NYT Romanovs - v.3.8.1.PNG
http://www85.zippyshare.com/v/fpzWddWW/file.html

Tass Syria - v.3.8.1.PNG
http://www85.zippyshare.com/v/SNY4rlnx/file.html

Wikipedia cat - v.3.8.1.PNG
http://www85.zippyshare.com/v/VNWoFB6L/file.html

Previous version

Eurozine v.3.8.PNG
http://www85.zippyshare.com/v/nsIKb0Vb/file.html

Figaro Nice - v3.8.PNG
http://www85.zippyshare.com/v/GygnaX7n/file.html

Le Monde ONU - v.3.8.PNG
http://www85.zippyshare.com/v/Jtrx8vrd/file.html

NYT Romanovs - v.3.8.PNG
http://www85.zippyshare.com/v/fucnXocX/file.html

Tass Syria - v.3.8.PNG
http://www85.zippyshare.com/v/EbDie7Wb/file.html

Wikipedia cat - v.3.8.PNG
http://www85.zippyshare.com/v/YWL5FeS7/file.html

Edit : is the attachment function broken ? It's the second time I try to upload some files with a post and this does not work. What happens is the green upload bars show up, something gets uploaded because the router light is flashing, the list of files shows up below the post, but there's a yellow triangle in the status column next to the files names. After submitting the post, the files disappear into thin air. Tried Firefox and Opera, to no avail.
User avatar
CintaNotes Developer
Site Admin
Posts: 4644
Joined: Fri Dec 12, 2008 4:45 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby CintaNotes Developer » Thu Jul 13, 2017 7:23 am

Thanks for the screenshots!

I have different results when I clip from same pages, so probably you're using a different browser than me.
Please tell which one! This is very browser dependent because each browser puts stuff into clipboard in its own unique way.

About attachments on the forum: Indeed, something is broken. Thanks for drawing my attention to it. I'll look into it asap.
Alex
User avatar
CintaNotes Developer
Site Admin
Posts: 4644
Joined: Fri Dec 12, 2008 4:45 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby CintaNotes Developer » Thu Jul 13, 2017 7:28 am

Please try now - attachments should be working!
Alex
Clairvaux
Posts: 46
Joined: Sun Dec 25, 2016 12:43 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby Clairvaux » Thu Jul 13, 2017 8:49 am

Firefox 54.0.1, 64-bit. With No Script and Adblock Plus.

Just checking :
Attachments
Kermode bear.jpg
Kermode bear.jpg (276.22 KiB) Viewed 573 times
User avatar
CintaNotes Developer
Site Admin
Posts: 4644
Joined: Fri Dec 12, 2008 4:45 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby CintaNotes Developer » Fri Jul 14, 2017 7:45 am

Nice bear!)
Alex
User avatar
CintaNotes Developer
Site Admin
Posts: 4644
Joined: Fri Dec 12, 2008 4:45 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby CintaNotes Developer » Wed Jul 19, 2017 10:39 am

Hi,

could you please try this version?
Should fix the Firefox issues.

CintaNotes_3_8_2_Beta1.zip
(4.4 MiB) Downloaded 44 times
Alex
Clairvaux
Posts: 46
Joined: Sun Dec 25, 2016 12:43 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby Clairvaux » Wed Jul 19, 2017 12:35 pm

Hi Alex,

I've tried this new version, and all the pages above now clip fine, plus a few extra ones I also used it on.

I also tried it with Opera v.46.0.2597.57 (which I did not experiment with CN v.3.8.1), and in one case (http://www.eurozine.com/germans-must-re ... r-own-sake) it did a slightly better job than Firefox, not adding extra white lines at the top of the note (see attachments).

That's not really a problem however, as those extra white lines seem to be generated by the Twitter and Facebook buttons, so getting rid of them is part of the expected bit of cleaning work that occurs after clipping (if you want your note to be tidy and nice).

The problem with CN v.3.8.1 was it added white lines in totally unexpected places and wasn't as good as previous versions in that respect, which now seems to be corrected.

Kind regards,

Clairvaux
Attachments
Opera clip.PNG
Opera clip.PNG (80.93 KiB) Viewed 509 times
Firefox clip.PNG
Firefox clip.PNG (72.98 KiB) Viewed 509 times
User avatar
CintaNotes Developer
Site Admin
Posts: 4644
Joined: Fri Dec 12, 2008 4:45 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby CintaNotes Developer » Thu Jul 20, 2017 3:01 am

Thanks for taking time to try the new version out!
Well the main point was to get rid of the unexpected empty lines inside text blocks, which seems that now is achieved.
I'll take a look what can be done to further clear up the article beginning in FF, but it might be that the structure of HTML won't allow CN to make bold assumptions to clean up. For example, it just sees a DIV with an IMG inside, it can't possibly know that this is a Twitter button that can be stripped out.

I take it so that this version is definitely better than 3.8.1, the last question is - is it better than 3.8?
Alex
Clairvaux
Posts: 46
Joined: Sun Dec 25, 2016 12:43 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby Clairvaux » Thu Jul 20, 2017 11:33 am

Thank you for correcting this so fast.

Indeed, the nice thing about CN is it makes correct assumptions most of the time. I suppose assumptions is the best that can be hoped for when converting something as complex as a webpage to rich text, as opposed to a graphic object or something derived from html.

Also, different users might have different expectations regarding the design of their notes, so a core of common assumptions is what matters.

Regarding the progress between v.3.8.1 and v.3.8, others will be in a better position to chime in. I just jumped on this problem which was staring me in the face.
User avatar
CintaNotes Developer
Site Admin
Posts: 4644
Joined: Fri Dec 12, 2008 4:45 pm
Contact:

Re: V.3.8.1 adds extra blank lines when clipping Web pages

Postby CintaNotes Developer » Fri Jul 21, 2017 8:23 am

Well I guess we can consider the issue resolved for now.
However I'm definitely open to suggestions of any concrete HTML-to-rich text transformation algorithm improvements.
And huge thanks to you for taking time to report and test!
Alex

Return to “Bug Reports”