[Libreoffice-bugs] [Bug 38983] Word Count counts incorrectly with dashes and other separators
https://bugs.freedesktop.org/show_bug.cgi?id=38983 --- Comment #12 from Caolán McNamara caol...@redhat.com 2012-08-28 09:29:55 UTC --- re comment #6 the CH_TXTATR_BREAKWORD there is a special case to filter out words which consist of a special internal placeholder character. So forget about that, the meat is in SwScanner::NextWord which determines the bounds of the words, backed mostly by icu's word break iterator -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. ___ Libreoffice-bugs mailing list Libreoffice-bugs@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs
[Libreoffice-bugs] [Bug 38983] Word Count counts incorrectly with dashes and other separators
https://bugs.freedesktop.org/show_bug.cgi?id=38983 Caolán McNamara caol...@redhat.com changed: What|Removed |Added Status|NEW |ASSIGNED AssignedTo|libreoffice-b...@lists.free |caol...@redhat.com |desktop.org | --- Comment #13 from Caolán McNamara caol...@redhat.com 2012-08-28 16:14:08 UTC --- I have a plausible solution for much of this while exploring msword-alike word counts wrt endash and emdash -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. ___ Libreoffice-bugs mailing list Libreoffice-bugs@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs
[Libreoffice-bugs] [Bug 38983] Word Count counts incorrectly with dashes and other separators
https://bugs.freedesktop.org/show_bug.cgi?id=38983 Astron heinzless...@gmail.com changed: What|Removed |Added Status|NEEDINFO|NEW --- Comment #11 from Astron heinzless...@gmail.com 2012-06-28 01:37:33 PDT --- Setting this to NEW. Can reproduce completely agree with James's proposal. -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. ___ Libreoffice-bugs mailing list Libreoffice-bugs@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs
[Libreoffice-bugs] [Bug 38983] Word Count counts incorrectly with dashes and other separators
https://bugs.freedesktop.org/show_bug.cgi?id=38983 --- Comment #10 from Jamee Mikell jamee.mik...@gmail.com 2012-04-20 09:21:49 PDT --- Comment on attachment 60399 -- https://bugs.freedesktop.org/attachment.cgi?id=60399 Example of outline spreadsheet described in comment 9. re: comment 8 It's possible I could figure out LO's macro language and object structures. I have developed (and currently maintain and enhance) macros in VBA for Excel, and made a living coding C and learned a little Python for a couple of small side projects I did, and used various other languages off and on, so it isn't like I have no programming background. But I'd rather be writing fiction, and I haven't found much to help me understand LO's macro language or object structure. I didn't find any extensions or such that offered more sophisticated word counting. The one or two examples I found rely on the underlying (flawed in my opinion) word count method or implement the same method in their own code. To be honest, though, if I did write a macro, I'd write it to look for a specific section break identifier (e.g., eol#eol) that would be entered as part of a dialog and then report word count data for each section. Or use bookmarks and count between bookmarks. Probably other options for defining breaks too. In the end, I decided this issue wasn't going to get solved in a way that meets my needs any time soon so I modified my outline spreadsheet for this writing project. See attachment OutlineExample.ods. Each time I edit a section (part of a chapter), I run the default word count and record the LO word count and the count of characters including spaces on the section's row in the spreadsheet. I sum up the two sets of numbers--LO words and characters--at the end of each set of sections and report LO words and char/6 and char/6 and char/X, where X is a value I determined was a reasonable estimate of characters per page based on sampling several comparable books. I can also estimate char/6 given char for any single section if I need to. This helps me balance the size of books and, to a lesser extent, sections and chapters. If someone would like to write a macro or extension or enhance LO to facilitate more sophisticated word counting methods that fiction writers and others who care about word count might find useful, I would be grateful. I occasionally make changes with find/replace across the whole document, which requires going recounting each section to update the count data. But as I noted, I'd rather write fiction than spend a lot of time figuring out how LO's macro language works and how the Writer objects work and such. If I already knew LO's macro language and objects, it would probably be a different story. -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. ___ Libreoffice-bugs mailing list Libreoffice-bugs@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs
[Libreoffice-bugs] [Bug 38983] Word Count counts incorrectly with dashes and other separators
https://bugs.freedesktop.org/show_bug.cgi?id=38983 --- Comment #9 from Jamee Mikell jamee.mik...@gmail.com 2012-04-20 09:20:10 PDT --- Created attachment 60399 -- https://bugs.freedesktop.org/attachment.cgi?id=60399 Example of outline spreadsheet described in comment 9. In case someone is interested in writing a macro to do more sophisticated word counting, this attachment illustrates the semi-manual solution I describe in comment 9. -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. ___ Libreoffice-bugs mailing list Libreoffice-bugs@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs
[Libreoffice-bugs] [Bug 38983] Word Count counts incorrectly with dashes and other separators
https://bugs.freedesktop.org/show_bug.cgi?id=38983 sasha.libreoff...@gmail.com changed: What|Removed |Added CC||sasha.libreoff...@gmail.com --- Comment #8 from sasha.libreoff...@gmail.com 2012-03-01 03:05:52 UTC --- Thanks for so ground work in word counting. IMHO right version of word counting may be written on Basic macro Please, verify: may be among OpenOffice extensions already exist such macro -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. ___ Libreoffice-bugs mailing list Libreoffice-bugs@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs
[Libreoffice-bugs] [Bug 38983] Word Count counts incorrectly with dashes and other separators
https://bugs.freedesktop.org/show_bug.cgi?id=38983 Björn Michaelsen bjoern.michael...@canonical.com changed: What|Removed |Added Status|NEW |NEEDINFO --- Comment #7 from Björn Michaelsen bjoern.michael...@canonical.com 2011-12-23 12:26:44 PST --- [This is an automated message.] This bug was filed before the changes to Bugzilla on 2011-10-16. Thus it started right out as NEW without ever being explicitly confirmed. The bug is changed to state NEEDINFO for this reason. To move this bug from NEEDINFO back to NEW please check if the bug still persists with the 3.5.0 beta1 or beta2 prereleases. Details on how to test the 3.5.0 beta1 can be found at: http://wiki.documentfoundation.org/QA/BugHunting_Session_3.5.0.-1 more detail on this bulk operation: http://nabble.documentfoundation.org/RFC-Operation-Spamzilla-tp3607474p3607474.html -- Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email --- You are receiving this mail because: --- You are the assignee for the bug. ___ Libreoffice-bugs mailing list Libreoffice-bugs@lists.freedesktop.org http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs