Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
2009/11/3 Evgeniy Ginzburg : > On Tue, Nov 3, 2009 at 4:06 AM, David Reyes Samblas Martinez > wrote: > [snip] >> I have find a process[1] I think it can be industrialized to transform >> any image of the wikipedia to one more or less good to the device is >> clear than we can expect a real time 3D zoomable render on the WR but >> I think results are quite promising > One option for such "industrialization" of images converting may be > something like this onliner using ImageMagic > > convert infile.png -geometry 240 +dither -colors 2 -colorspace gray > -contrast-stretch 0 -normalize outfile.pbm > > For reference see http://www.imagemagick.org/Usage/quantize/ > > [snip] > -- > So long, and thanks for all the fish. > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community GIMP also alows to work with bactch process and scripting , I have to find how but I know it can, we will choose the option than better results will give David Reyes Samblas Martinez http://www.tuxbrain.com Open ultraportable & embedded solutions Openmoko, Openpandora, Arduino Hey, watch out!!! There's a linux in your pocket!!! > ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
On Tue, Nov 3, 2009 at 6:55 PM, Michal Brzozowski wrote: > 2009/11/3 Evgeniy Ginzburg >> >> On Tue, Nov 3, 2009 at 4:06 AM, David Reyes Samblas Martinez >> wrote: >> [snip] >> > I have find a process[1] I think it can be industrialized to transform >> > any image of the wikipedia to one more or less good to the device is >> > clear than we can expect a real time 3D zoomable render on the WR but >> > I think results are quite promising >> One option for such "industrialization" of images converting may be >> something like this onliner using ImageMagic >> >> convert infile.png -geometry 240 +dither -colors 2 -colorspace gray >> -contrast-stretch 0 -normalize outfile.pbm >> >> For reference see http://www.imagemagick.org/Usage/quantize/ > > Ascii art could be nice too. And wouldn't require much hacking on the device > side :-) > I've just tried to view 240 pixel wide images in ASCII, cannot see nothing. Using .PBM let you see (in worst case) something. -- So long, and thanks for all the fish. ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
2009/11/3 Evgeniy Ginzburg > On Tue, Nov 3, 2009 at 4:06 AM, David Reyes Samblas Martinez > wrote: > [snip] > > I have find a process[1] I think it can be industrialized to transform > > any image of the wikipedia to one more or less good to the device is > > clear than we can expect a real time 3D zoomable render on the WR but > > I think results are quite promising > One option for such "industrialization" of images converting may be > something like this onliner using ImageMagic > > convert infile.png -geometry 240 +dither -colors 2 -colorspace gray > -contrast-stretch 0 -normalize outfile.pbm > > For reference see http://www.imagemagick.org/Usage/quantize/ > Ascii art could be nice too. And wouldn't require much hacking on the device side :-) ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
On Tue, Nov 3, 2009 at 4:06 AM, David Reyes Samblas Martinez wrote: [snip] > I have find a process[1] I think it can be industrialized to transform > any image of the wikipedia to one more or less good to the device is > clear than we can expect a real time 3D zoomable render on the WR but > I think results are quite promising One option for such "industrialization" of images converting may be something like this onliner using ImageMagic convert infile.png -geometry 240 +dither -colors 2 -colorspace gray -contrast-stretch 0 -normalize outfile.pbm For reference see http://www.imagemagick.org/Usage/quantize/ [snip] -- So long, and thanks for all the fish. ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
David Reyes Samblas Martinez wrote: >> But having maps, flags, schematics and other low dynamic stuff makes >> total >> sense. > I see the flags more problematic than van Gough ... a lot of them > relies on colors to diferentiate each other so italian,french,irish, > and all the miriad trhee vertical colors flags will be very hard > differentiable You see that effect on cheap newspaper prints. They have fairly large 1bit pixels. It works good enough. It's ugly for pictures. But very well for diagrams or anything like that. You won't have absolute colours but that still works well. >> PPS: Apropos SVG. I guess we can keep them as some kind of vector format >> to save space. > With the sizes we are talking abuout (3-4Kb once compressed), rarely a > svg will be smaller than this, and I think reder a vector image is > more resouce hungry than just a plain bitmap, but if the device can > hold it it can be awesome as map viewer :) True. 1 bit images are probably smaller then vector graphics. What would be a miss could maybe 'scrolling' (Paging) ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
> I'm not sure if this is a desired workflow. But I don't think whis will be > a problem if everybody builds his own wikireder offline database. > > Meaning, Wikireader ships and maintains a database with all safe content. > And if you like more you do it yourself. If the viewer is already implemented you parse/render the whole Wikipedia to include the images links already rip off in the "official" version with a "--include-non-free" option in make or a unnoficial patch to avoid this filtering..., It seems a great idea to me. then is up the (advanced)user to include this image or not and he is not taking any more profit than enjoying the images . I think is a good aproach for the licencing issue. > > PS: I think it would be a good idea to only use pics with low dnamic in > the first place. There is no use to have a van Gough on a 1bit low res > screen. I not agree with this, is clear than you cannot appreciate the subtle mastering of colors or the smart use of lights in a 1 bit color depth 240px width image :P but you can see How it looks like and in WR for me is far from enough, > But having maps, flags, schematics and other low dynamic stuff makes total > sense. I see the flags more problematic than van Gough ... a lot of them relies on colors to diferentiate each other so italian,french,irish, and all the miriad trhee vertical colors flags will be very hard differentiable > I especially think about the huge amount of svg content. > I imagine, that this can be fairly easily detected. (Maybe just simply by > compression factor) or by his extension :P > > PPS: Apropos SVG. I guess we can keep them as some kind of vector format > to save space. With the sizes we are talking abuout (3-4Kb once compressed), rarely a svg will be smaller than this, and I think reder a vector image is more resouce hungry than just a plain bitmap, but if the device can hold it it can be awesome as map viewer :) > > PPPS: We need a mailinglist Meanwhile people tag the topic I feel confortable in the OM list for a OM device > > Tilman > > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community > David Reyes Samblas Martinez http://www.tuxbrain.com Open ultraportable & embedded solutions Openmoko, Openpandora, Arduino Hey, watch out!!! There's a linux in your pocket!!! ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
David Reyes Samblas Martinez http://www.tuxbrain.com Open ultraportable & embedded solutions Openmoko, Openpandora, Arduino Hey, watch out!!! There's a linux in your pocket!!! 2009/11/3 David Garabana Barro : > On Tuesday 03 November 2009 12:15:11 David Reyes Samblas Martinez wrote: >> Regarding compression, I believe lzma is already builded in the >> wikireader application and it compress the images a 50%. enough for >> start I guess. but I have to recongnize than the image on png looks >> really good do maybe it worth the meaning to implemente it on the >> device if it's not much resource hungr > > Both png and pbm are 1 bit images without lossy compression. > You can obtain exactly the same final image quality on both formats, but png > will have smaller disk size. As I said lzma compresed pbm files are about the same size like a png file so if same results can be achieved, I vote for stay on what's already implemented And seems this way isi commpressed a litte bit more I see the sample png file is 4263 bytes and the same pmb+lzma is about 2937 (sucotronic please can you email me with the name of "treshold tool" in spanish and post the values you chose if any?) > Final result only depends on RGB->1 bit indexed conversion method used. > > AFAIK png decompression is not resource hungry. Compression *IS*. well compresion is done on host so no problem on this side, in fact WR is decompresing huge amount of text in lzma quite fast so a tiny file of 2-4Kb will be no problem > > PS For minimal png archive size, you *MUST* convert image to 1 bit indexed > palette before saving it. > If you use "greyscale", RBG or more than 1 bit palette, png will waste space > saving palette or RBG/greyscale info. totally agree :) > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community > ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
Rui Miguel Silva Seabra wrote: > On Tue, Nov 03, 2009 at 12:15:11PM +0100, David Reyes Samblas Martinez > wrote: >> Also some way to not infringe the authoring and licencing text >> includings clauses must be used by the images viewer. but I guess it >> can be done by links to text as other wikipage more. > > The problem isn't so much about WikiMedia or OpenMoko, but that the > original authors did not free the images. > > As such, whilst maybe they can be on Wikipedia, which is on a non-profit > environment, distributing on the WikiReader (which is for-profit) may > be legally problematic. I'm not sure if this is a desired workflow. But I don't think whis will be a problem if everybody builds his own wikireder offline database. Meaning, Wikireader ships and maintains a database with all safe content. And if you like more you do it yourself. PS: I think it would be a good idea to only use pics with low dnamic in the first place. There is no use to have a van Gough on a 1bit low res screen. But having maps, flags, schematics and other low dynamic stuff makes total sense. I especially think about the huge amount of svg content. I imagine, that this can be fairly easily detected. (Maybe just simply by compression factor) PPS: Apropos SVG. I guess we can keep them as some kind of vector format to save space. PPPS: We need a mailinglist Tilman ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
On Tuesday 03 November 2009 12:15:11 David Reyes Samblas Martinez wrote: > Regarding compression, I believe lzma is already builded in the > wikireader application and it compress the images a 50%. enough for > start I guess. but I have to recongnize than the image on png looks > really good do maybe it worth the meaning to implemente it on the > device if it's not much resource hungr Both png and pbm are 1 bit images without lossy compression. You can obtain exactly the same final image quality on both formats, but png will have smaller disk size. Final result only depends on RGB->1 bit indexed conversion method used. AFAIK png decompression is not resource hungry. Compression *IS*. PS For minimal png archive size, you *MUST* convert image to 1 bit indexed palette before saving it. If you use "greyscale", RBG or more than 1 bit palette, png will waste space saving palette or RBG/greyscale info. ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
On Tue, Nov 03, 2009 at 12:15:11PM +0100, David Reyes Samblas Martinez wrote: > Regarding licensing , well until OM or/and Wikipiedia doesn't say the > contrary (for example considering Wikireader as an extension of the > Wikipedia and allow all wikipedia image to be on Wikireader) we must > stay in the save side so only explicitly free licenced images will be > safe to use, I'm working on the > http://download.wikimedia.org/enwiki/latest/enwiki-latest-image.sql.gz > table to know how many pictures we are talking about. > Also some way to not infringe the authoring and licencing text > includings clauses must be used by the images viewer. but I guess it > can be done by links to text as other wikipage more. The problem isn't so much about WikiMedia or OpenMoko, but that the original authors did not free the images. As such, whilst maybe they can be on Wikipedia, which is on a non-profit environment, distributing on the WikiReader (which is for-profit) may be legally problematic. If there's a way to automatically determine if the image is safe to copy (for instance, being licensed with a good CC license like by, by-sa) then it's doable. If not... it requires a lot of human filtering... Rui ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
Regarding compression, I believe lzma is already builded in the wikireader application and it compress the images a 50%. enough for start I guess. but I have to recongnize than the image on png looks really good do maybe it worth the meaning to implemente it on the device if it's not much resource hungry Regarding licensing , well until OM or/and Wikipiedia doesn't say the contrary (for example considering Wikireader as an extension of the Wikipedia and allow all wikipedia image to be on Wikireader) we must stay in the save side so only explicitly free licenced images will be safe to use, I'm working on the http://download.wikimedia.org/enwiki/latest/enwiki-latest-image.sql.gz table to know how many pictures we are talking about. Also some way to not infringe the authoring and licencing text includings clauses must be used by the images viewer. but I guess it can be done by links to text as other wikipage more. Regarding machine needed to do so, due we just need at maximum of 240 pixel with we can tweak the Wikix to use the thumb url like this http://upload.wikimedia.org/wikipedia/commons/thumb/f/f9/HN_Pegasi_B.jpg/240px-HN_Pegasi_B.jpg instead of the full url http://upload.wikimedia.org/wikipedia/commons/f/f9/HN_Pegasi_B.jpg and this will save us a lot disk space and a step in the process :P Also Wikix must be tweaked to just download "free licenced images" using the info on the enwiki-latest-image.sql.gz file then sure we will save a lot more disk space. David Reyes Samblas Martinez http://www.tuxbrain.com Open ultraportable & embedded solutions Openmoko, Openpandora, Arduino Hey, watch out!!! There's a linux in your pocket!!! 2009/11/3 David Garabana Barro : > On Tuesday 03 November 2009 09:46:44 Alexander Shulgin wrote: > >> Can we run zlib and, wait-wait... libpng on the device? :) > > Good point! > png compress 1 bit images a lot! > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community > ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
Davide wrote: > > On Tuesday 03 November 2009 09:46:44 Alexander Shulgin wrote: > >> Can we run zlib and, wait-wait... libpng on the device? :) > > Good point! > png compress 1 bit images a lot! > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community > > You're right! One sample done with treshold tool in gimp and saved in png format: http://tinypic.com/r/mjs58m/4 -- View this message in context: http://n2.nabble.com/wikireader-Images-on-the-WR-not-so-imposible-P-was-wikireader-Error-on-parsing-the-spanish-wikipedia-tp3935879p3937629.html Sent from the Openmoko Community mailing list archive at Nabble.com. ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
On Tuesday 03 November 2009 09:46:44 Alexander Shulgin wrote: > Can we run zlib and, wait-wait... libpng on the device? :) Good point! png compress 1 bit images a lot! ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
2009/11/3 David Reyes Samblas Martinez > An now some numbers the average image are 10kb so with the hypotesis > than there are one image per article (yes I know there articles with > more than a image but there a lot of articles without images) there > will be about 3.000.000 images so 30Gb of images :P there are any > 32Gb uSD cards out there? > Your pbm files are not compressed. I've tried compressing one with gzip and it went down by 50%. If you use some smart image format you can probably go down much more. ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
David Samblas Martinez wrote: > > 2009/10/31 Sean Moss-Pultz : >> On Fri, Oct 30, 2009 at 11:29 PM, Laszlo KREKACS >> wrote: >>> On Fri, Oct 30, 2009 at 4:22 PM, David Reyes Samblas Martinez >>> wrote: >>>> Are you uploading this changes to git? can I take a look? >>> >>> Btw is there any plan to implement images rendering? >> >> Math (images) are on our roadmap. Hopefully before the end of this >> year. The screen is only 1bit. So anything else would look kinda >> funny. >> >> -Sean > Well due I have clear than Internationalization and running other apps > are totally posible and in fact it can be done without much hacking, I > have spend some time investigating the posibility of include image > other than maths on the device and I think is at least more closer > than it can seems. > I have find a process[1] I think it can be industrialized to transform > any image of the wikipedia to one more or less good to the device is > clear than we can expect a real time 3D zoomable render on the WR but > I think results are quite promising > > Just some questions, is hard to do a image viewer able to scroll > vertically as we do in text? > Any good tutorial of scripting using gimp? > > An now some numbers the average image are 10kb so with the hypotesis > than there are one image per article (yes I know there articles with > more than a image but there a lot of articles without images) there > will be about 3.000.000 images so 30Gb of images :P there are any > 32Gb uSD cards out there? > I have to do a more in depth analisis on how many images(meaningful) > are there using the data on the dumps of wikipedia so we will see. > > [1]http://www.tuxbrain.com/en/content/images-wikireader-posible > >> >> ___ >> Openmoko community mailing list >> community@lists.openmoko.org >> http://lists.openmoko.org/mailman/listinfo/community >> > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community > > This is an awasome idea David, but you have first to consider two things: 1- Not all English wikipedia images are under a cc license or similar. There're a lot of copyrighted images: logos, photograpsh, captured images from videogames... You've one warning in this wikipedia page: http://en.wikipedia.org/wiki/Wikipedia_database#Images_and_uploaded_files 2- It's possible to automatically download all the wikipedia images using a program called wikix (http://meta.wikimedia.org/wiki/Wikix) but someone tried it back in 2007 and the result had a size of ¡¡407 gb!! (http://yousefourabi.com/blog/2007/10/download-all-wikipedia-images-with-wikix/). Then, the task of downloading all the images and convert them should be done with a very good machine or cluster. PS: I know that for spanish wikipedia copyrighted images are not allowed and we don't have the point 2 problem :P -- View this message in context: http://n2.nabble.com/wikireader-Images-on-the-WR-not-so-imposible-P-was-wikireader-Error-on-parsing-the-spanish-wikipedia-tp3935879p3937360.html Sent from the Openmoko Community mailing list archive at Nabble.com. ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
hi, the wish is clear:), but the images on wikipedia are slightly problematic, see http://en.wikipedia.org/wiki/Wikipedia:Database_download#Images_and_uploaded_files and http://en.wikipedia.org/wiki/Wikipedia:Copyrights#Non-free_materials_and_special_requirements on the other hand, see:): http://meta.wikimedia.org/wiki/Wikix -Jörn David Reyes Samblas Martinez wrote: > 2009/10/31 Sean Moss-Pultz : > >> On Fri, Oct 30, 2009 at 11:29 PM, Laszlo KREKACS >> wrote: >> >>> On Fri, Oct 30, 2009 at 4:22 PM, David Reyes Samblas Martinez >>> wrote: >>> Are you uploading this changes to git? can I take a look? >>> Btw is there any plan to implement images rendering? >>> >> Math (images) are on our roadmap. Hopefully before the end of this >> year. The screen is only 1bit. So anything else would look kinda >> funny. >> >> -Sean >> > Well due I have clear than Internationalization and running other apps > are totally posible and in fact it can be done without much hacking, I > have spend some time investigating the posibility of include image > other than maths on the device and I think is at least more closer > than it can seems. > I have find a process[1] I think it can be industrialized to transform > any image of the wikipedia to one more or less good to the device is > clear than we can expect a real time 3D zoomable render on the WR but > I think results are quite promising > > Just some questions, is hard to do a image viewer able to scroll > vertically as we do in text? > Any good tutorial of scripting using gimp? > > An now some numbers the average image are 10kb so with the hypotesis > than there are one image per article (yes I know there articles with > more than a image but there a lot of articles without images) there > will be about 3.000.000 images so 30Gb of images :P there are any > 32Gb uSD cards out there? > I have to do a more in depth analisis on how many images(meaningful) > are there using the data on the dumps of wikipedia so we will see. > > [1]http://www.tuxbrain.com/en/content/images-wikireader-posible > > >> ___ >> Openmoko community mailing list >> community@lists.openmoko.org >> http://lists.openmoko.org/mailman/listinfo/community >> >> > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community > > smime.p7s Description: S/MIME Cryptographic Signature ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
On Tue, Nov 3, 2009 at 03:06, David Reyes Samblas Martinez wrote: > > An now some numbers the average image are 10kb so with the hypotesis > than there are one image per article (yes I know there articles with > more than a image but there a lot of articles without images) there > will be about 3.000.000 images so 30Gb of images :P there are any > 32Gb uSD cards out there? Can we run zlib and, wait-wait... libpng on the device? :) -- Alex ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
2009/11/3 David Reyes Samblas Martinez : > 2009/10/31 Sean Moss-Pultz : >> On Fri, Oct 30, 2009 at 11:29 PM, Laszlo KREKACS >> wrote: >>> On Fri, Oct 30, 2009 at 4:22 PM, David Reyes Samblas Martinez >>> wrote: Are you uploading this changes to git? can I take a look? >>> >>> Btw is there any plan to implement images rendering? >> >> Math (images) are on our roadmap. Hopefully before the end of this >> year. The screen is only 1bit. So anything else would look kinda >> funny. >> >> -Sean > Well due I have clear than Internationalization and running other apps > are totally posible and in fact it can be done without much hacking, I > have spend some time investigating the posibility of include image > other than maths on the device and I think is at least more closer > than it can seems. > I have find a process[1] I think it can be industrialized to transform > any image of the wikipedia to one more or less good to the device is > clear than we can expect a real time 3D zoomable render on the WR but > I think results are quite promising > > Just some questions, is hard to do a image viewer able to scroll > vertically as we do in text? > Any good tutorial of scripting using gimp? > > An now some numbers the average image are 10kb so with the hypotesis > than there are one image per article (yes I know there articles with > more than a image but there a lot of articles without images) there > will be about 3.000.000 images so 30Gb of images :P there are any > 32Gb uSD cards out there? > I have to do a more in depth analisis on how many images(meaningful) > are there using the data on the dumps of wikipedia so we will see. > > [1]http://www.tuxbrain.com/en/content/images-wikireader-posible > >> >> ___ >> Openmoko community mailing list >> community@lists.openmoko.org >> http://lists.openmoko.org/mailman/listinfo/community >> > [...]It's clear we can NOT expect 3D render[...] ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
[wikireader] Images on the WR not so imposible :P [was [wikireader]Error on parsing the spanish wikipedia]
2009/10/31 Sean Moss-Pultz : > On Fri, Oct 30, 2009 at 11:29 PM, Laszlo KREKACS > wrote: >> On Fri, Oct 30, 2009 at 4:22 PM, David Reyes Samblas Martinez >> wrote: >>> Are you uploading this changes to git? can I take a look? >> >> Btw is there any plan to implement images rendering? > > Math (images) are on our roadmap. Hopefully before the end of this > year. The screen is only 1bit. So anything else would look kinda > funny. > > -Sean Well due I have clear than Internationalization and running other apps are totally posible and in fact it can be done without much hacking, I have spend some time investigating the posibility of include image other than maths on the device and I think is at least more closer than it can seems. I have find a process[1] I think it can be industrialized to transform any image of the wikipedia to one more or less good to the device is clear than we can expect a real time 3D zoomable render on the WR but I think results are quite promising Just some questions, is hard to do a image viewer able to scroll vertically as we do in text? Any good tutorial of scripting using gimp? An now some numbers the average image are 10kb so with the hypotesis than there are one image per article (yes I know there articles with more than a image but there a lot of articles without images) there will be about 3.000.000 images so 30Gb of images :P there are any 32Gb uSD cards out there? I have to do a more in depth analisis on how many images(meaningful) are there using the data on the dumps of wikipedia so we will see. [1]http://www.tuxbrain.com/en/content/images-wikireader-posible > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community > ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
On Sat, Oct 31, 2009 at 2:46 AM, David Reyes Samblas Martinez wrote: > just an think I realized , all faulty articles the title starts with > the "~" simbol David No that's not a problem. That character gets removed in a later build stage. We had to add that because of a integer conversion issue with SQLite. It was automatically converting articles like "1984" into integers (not strings) and storing them in the database. SQLite, BTW, claims this is a "feature". Sean ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
On Fri, Oct 30, 2009 at 11:29 PM, Laszlo KREKACS wrote: > On Fri, Oct 30, 2009 at 4:22 PM, David Reyes Samblas Martinez > wrote: >> Are you uploading this changes to git? can I take a look? > > Btw is there any plan to implement images rendering? Math (images) are on our roadmap. Hopefully before the end of this year. The screen is only 1bit. So anything else would look kinda funny. -Sean ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
On Fri, Oct 30, 2009 at 11:22 PM, David Reyes Samblas Martinez wrote: > Are you uploading this changes to git? can I take a look? Yes. The latest commit fixes it. Have a look here: http://github.com/wikireader/wikireader Sean ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
2009/10/30 Laszlo KREKACS : > On Fri, Oct 30, 2009 at 4:22 PM, David Reyes Samblas Martinez > wrote: >> Are you uploading this changes to git? can I take a look? > > Btw is there any plan to implement images rendering? > > If so, any time estimation? > > Best regards, > Laszlo > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community > Some kind of renderer has been already implemented because keyboard, and the erase history dialog are images . I'm wrong? ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
just an think I realized , all faulty articles the title starts with the "~" simbol regards David Reyes Samblas Martinez http://www.tuxbrain.com Open ultraportable & embedded solutions Openmoko, Openpandora, Arduino Hey, watch out!!! There's a linux in your pocket!!! 2009/10/30 David Reyes Samblas Martinez : > Are you uploading this changes to git? can I take a look? > > David Reyes Samblas Martinez > http://www.tuxbrain.com > Open ultraportable & embedded solutions > Openmoko, Openpandora, Arduino > Hey, watch out!!! There's a linux in your pocket!!! > > > > > 2009/10/30 Sean Moss-Pultz : >> On Fri, Oct 30, 2009 at 4:50 AM, David Reyes Samblas Martinez >> wrote: >>> Hi I'm trying to generate the file for a spainsh wikipedia on the WR , >>> after compiling succsesfuly the source on the git and solve some >>> annoyings with utf8 encoding on phyton error was somthing like this: >>> UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in >>> position: ordinal not in range(128) >>> this was solved changing the default encode "ascii" to "utf8" int the >>> /usr/lib/python2.6/site.py file >>> after this I was hable to execute ok the instruction: >>> make DESTDIR=image WORKDIR=work >>> XML_FILES=xml-file-samples/eswiki-latest-pages-articles.xml index >>> parse render combine >>> >>> Every thing seem fine for a couple(about 6-7h) of hours parsing the >>> 70 articles in spanish but then ... the horror >>> Count: 38 >>> Traceback (most recent call last): >>> File "./ArticleParser.py", line 224, in >>> main() >>> File "./ArticleParser.py", line 172, in main >>> process_article_text(title.encode('utf-8'), f.read(length), newf) >>> File "./ArticleParser.py", line 218, in process_article_text >>> newf.write(text + '\n') >>> IOError: [Errno 32] Broken pipe >>> make[1]: *** [parse] Error 1 >>> make[1]: se sale del directorio >>> `/OE/Proyectos/tuxbrain/productos/wikireader/wikireader/host-tools/offline-renderer' >>> make: *** [parse] Error 2 >> >> OK that's fixed now. Chris already checked in the code. Our build >> worked fine. We need to do a few more tweaks and then we can post a >> (super) early test image. Give us until early this coming week. >> >> -Sean >> >> ___ >> Openmoko community mailing list >> community@lists.openmoko.org >> http://lists.openmoko.org/mailman/listinfo/community >> > ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
On Fri, Oct 30, 2009 at 4:22 PM, David Reyes Samblas Martinez wrote: > Are you uploading this changes to git? can I take a look? Btw is there any plan to implement images rendering? If so, any time estimation? Best regards, Laszlo ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
Are you uploading this changes to git? can I take a look? David Reyes Samblas Martinez http://www.tuxbrain.com Open ultraportable & embedded solutions Openmoko, Openpandora, Arduino Hey, watch out!!! There's a linux in your pocket!!! 2009/10/30 Sean Moss-Pultz : > On Fri, Oct 30, 2009 at 4:50 AM, David Reyes Samblas Martinez > wrote: >> Hi I'm trying to generate the file for a spainsh wikipedia on the WR , >> after compiling succsesfuly the source on the git and solve some >> annoyings with utf8 encoding on phyton error was somthing like this: >> UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in >> position: ordinal not in range(128) >> this was solved changing the default encode "ascii" to "utf8" int the >> /usr/lib/python2.6/site.py file >> after this I was hable to execute ok the instruction: >> make DESTDIR=image WORKDIR=work >> XML_FILES=xml-file-samples/eswiki-latest-pages-articles.xml index >> parse render combine >> >> Every thing seem fine for a couple(about 6-7h) of hours parsing the >> 70 articles in spanish but then ... the horror >> Count: 38 >> Traceback (most recent call last): >> File "./ArticleParser.py", line 224, in >> main() >> File "./ArticleParser.py", line 172, in main >> process_article_text(title.encode('utf-8'), f.read(length), newf) >> File "./ArticleParser.py", line 218, in process_article_text >> newf.write(text + '\n') >> IOError: [Errno 32] Broken pipe >> make[1]: *** [parse] Error 1 >> make[1]: se sale del directorio >> `/OE/Proyectos/tuxbrain/productos/wikireader/wikireader/host-tools/offline-renderer' >> make: *** [parse] Error 2 > > OK that's fixed now. Chris already checked in the code. Our build > worked fine. We need to do a few more tweaks and then we can post a > (super) early test image. Give us until early this coming week. > > -Sean > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community > ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
On Fri, Oct 30, 2009 at 4:50 AM, David Reyes Samblas Martinez wrote: > Hi I'm trying to generate the file for a spainsh wikipedia on the WR , > after compiling succsesfuly the source on the git and solve some > annoyings with utf8 encoding on phyton error was somthing like this: > UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in > position: ordinal not in range(128) > this was solved changing the default encode "ascii" to "utf8" int the > /usr/lib/python2.6/site.py file > after this I was hable to execute ok the instruction: > make DESTDIR=image WORKDIR=work > XML_FILES=xml-file-samples/eswiki-latest-pages-articles.xml index > parse render combine > > Every thing seem fine for a couple(about 6-7h) of hours parsing the > 70 articles in spanish but then ... the horror > Count: 38 > Traceback (most recent call last): > File "./ArticleParser.py", line 224, in > main() > File "./ArticleParser.py", line 172, in main > process_article_text(title.encode('utf-8'), f.read(length), newf) > File "./ArticleParser.py", line 218, in process_article_text > newf.write(text + '\n') > IOError: [Errno 32] Broken pipe > make[1]: *** [parse] Error 1 > make[1]: se sale del directorio > `/OE/Proyectos/tuxbrain/productos/wikireader/wikireader/host-tools/offline-renderer' > make: *** [parse] Error 2 OK that's fixed now. Chris already checked in the code. Our build worked fine. We need to do a few more tweaks and then we can post a (super) early test image. Give us until early this coming week. -Sean ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
On Fri, Oct 30, 2009 at 7:58 AM, Nelson Castillo wrote: > On Thu, Oct 29, 2009 at 6:54 PM, David Reyes Samblas Martinez > wrote: >> Great! :) good to see you are working on this!, please count on me for >> any testing to be done, I will try to make a look on the code myself >> to kill the bug but no time and nor expertise so no promises :P > > I haven't seen the code but if you don't feel like fixing it now you > can add a try/catch on the block that is processing each page so that > you have a wiki to play with while the error is fixed. Yeah we're trying exactly that Nelson. It's just a long process to render all this stuff. We actually have 9 quad-core systems running in parallel now. Each with at least six GB of ram :-) -Sean ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
On Fri, Oct 30, 2009 at 7:54 AM, David Reyes Samblas Martinez wrote: > Great! :) good to see you are working on this!, please count on me for > any testing to be done, I will try to make a look on the code myself > to kill the bug but no time and nor expertise so no promises :P We'll get it working. Just give us a bit of time. And it would be super helpful if you could help test / QA. Thanks a lot for the offer! -Sean ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
On Thu, Oct 29, 2009 at 6:54 PM, David Reyes Samblas Martinez wrote: > Great! :) good to see you are working on this!, please count on me for > any testing to be done, I will try to make a look on the code myself > to kill the bug but no time and nor expertise so no promises :P I haven't seen the code but if you don't feel like fixing it now you can add a try/catch on the block that is processing each page so that you have a wiki to play with while the error is fixed. ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
Great! :) good to see you are working on this!, please count on me for any testing to be done, I will try to make a look on the code myself to kill the bug but no time and nor expertise so no promises :P David Reyes Samblas Martinez http://www.tuxbrain.com Open ultraportable & embedded solutions Openmoko, Openpandora, Arduino Hey, watch out!!! There's a linux in your pocket!!! 2009/10/30 Sean Moss-Pultz : > David > > We're working on exactly the same thing now :-) > > I'll ask Chris to email the list once we get past it. I think the > problem is with the mixtures of different encodings (latin-1 and > UTF-8) in the Spanish Wikipedia and the way our code is handling this. > For some reason Python's print (at times) wants to default to ascii, > even after we explicitly tell it to use UTF-8. > > -Sean > > > On Fri, Oct 30, 2009 at 4:50 AM, David Reyes Samblas Martinez > wrote: >> >> Hi I'm trying to generate the file for a spainsh wikipedia on the WR , >> after compiling succsesfuly the source on the git and solve some >> annoyings with utf8 encoding on phyton error was somthing like this: >> UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in >> position: ordinal not in range(128) >> this was solved changing the default encode "ascii" to "utf8" int the >> /usr/lib/python2.6/site.py file >> after this I was hable to execute ok the instruction: >> make DESTDIR=image WORKDIR=work >> XML_FILES=xml-file-samples/eswiki-latest-pages-articles.xml index >> parse render combine >> >> Every thing seem fine for a couple(about 6-7h) of hours parsing the >> 70 articles in spanish but then ... the horror >> Count: 38 >> Traceback (most recent call last): >> File "./ArticleParser.py", line 224, in >> main() >> File "./ArticleParser.py", line 172, in main >> process_article_text(title.encode('utf-8'), f.read(length), newf) >> File "./ArticleParser.py", line 218, in process_article_text >> newf.write(text + '\n') >> IOError: [Errno 32] Broken pipe >> make[1]: *** [parse] Error 1 >> make[1]: se sale del directorio >> `/OE/Proyectos/tuxbrain/productos/wikireader/wikireader/host-tools/offline-renderer' >> make: *** [parse] Error 2 >> >> I have relaunched the process again with the (few)hope that was a >> temporary fault but If any one has a clue will be helpfull. >> >> BTW.- I documenting all this proccess to make a step by step howto on >> how to put the wikipedia in other languages on the wikireader. >> >> >> >> David Reyes Samblas Martinez >> http://www.tuxbrain.com >> Open ultraportable & embedded solutions >> Openmoko, Openpandora, Arduino >> Hey, watch out!!! There's a linux in your pocket!!! >> >> ___ >> Openmoko community mailing list >> community@lists.openmoko.org >> http://lists.openmoko.org/mailman/listinfo/community > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community > ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
Re: [wikireader]Error on parsing the spanish wikipedia
David We're working on exactly the same thing now :-) I'll ask Chris to email the list once we get past it. I think the problem is with the mixtures of different encodings (latin-1 and UTF-8) in the Spanish Wikipedia and the way our code is handling this. For some reason Python's print (at times) wants to default to ascii, even after we explicitly tell it to use UTF-8. -Sean On Fri, Oct 30, 2009 at 4:50 AM, David Reyes Samblas Martinez wrote: > > Hi I'm trying to generate the file for a spainsh wikipedia on the WR , > after compiling succsesfuly the source on the git and solve some > annoyings with utf8 encoding on phyton error was somthing like this: > UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in > position: ordinal not in range(128) > this was solved changing the default encode "ascii" to "utf8" int the > /usr/lib/python2.6/site.py file > after this I was hable to execute ok the instruction: > make DESTDIR=image WORKDIR=work > XML_FILES=xml-file-samples/eswiki-latest-pages-articles.xml index > parse render combine > > Every thing seem fine for a couple(about 6-7h) of hours parsing the > 70 articles in spanish but then ... the horror > Count: 38 > Traceback (most recent call last): > File "./ArticleParser.py", line 224, in > main() > File "./ArticleParser.py", line 172, in main > process_article_text(title.encode('utf-8'), f.read(length), newf) > File "./ArticleParser.py", line 218, in process_article_text > newf.write(text + '\n') > IOError: [Errno 32] Broken pipe > make[1]: *** [parse] Error 1 > make[1]: se sale del directorio > `/OE/Proyectos/tuxbrain/productos/wikireader/wikireader/host-tools/offline-renderer' > make: *** [parse] Error 2 > > I have relaunched the process again with the (few)hope that was a > temporary fault but If any one has a clue will be helpfull. > > BTW.- I documenting all this proccess to make a step by step howto on > how to put the wikipedia in other languages on the wikireader. > > > > David Reyes Samblas Martinez > http://www.tuxbrain.com > Open ultraportable & embedded solutions > Openmoko, Openpandora, Arduino > Hey, watch out!!! There's a linux in your pocket!!! > > ___ > Openmoko community mailing list > community@lists.openmoko.org > http://lists.openmoko.org/mailman/listinfo/community ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community
[wikireader]Error on parsing the spanish wikipedia
Hi I'm trying to generate the file for a spainsh wikipedia on the WR , after compiling succsesfuly the source on the git and solve some annoyings with utf8 encoding on phyton error was somthing like this: UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position: ordinal not in range(128) this was solved changing the default encode "ascii" to "utf8" int the /usr/lib/python2.6/site.py file after this I was hable to execute ok the instruction: make DESTDIR=image WORKDIR=work XML_FILES=xml-file-samples/eswiki-latest-pages-articles.xml index parse render combine Every thing seem fine for a couple(about 6-7h) of hours parsing the 70 articles in spanish but then ... the horror Count: 38 Traceback (most recent call last): File "./ArticleParser.py", line 224, in main() File "./ArticleParser.py", line 172, in main process_article_text(title.encode('utf-8'), f.read(length), newf) File "./ArticleParser.py", line 218, in process_article_text newf.write(text + '\n') IOError: [Errno 32] Broken pipe make[1]: *** [parse] Error 1 make[1]: se sale del directorio `/OE/Proyectos/tuxbrain/productos/wikireader/wikireader/host-tools/offline-renderer' make: *** [parse] Error 2 I have relaunched the process again with the (few)hope that was a temporary fault but If any one has a clue will be helpfull. BTW.- I documenting all this proccess to make a step by step howto on how to put the wikipedia in other languages on the wikireader. David Reyes Samblas Martinez http://www.tuxbrain.com Open ultraportable & embedded solutions Openmoko, Openpandora, Arduino Hey, watch out!!! There's a linux in your pocket!!! ___ Openmoko community mailing list community@lists.openmoko.org http://lists.openmoko.org/mailman/listinfo/community