Re: [Dspace-tech] How to enrich metadata with special characters?
Bram Luyten schrieb: I was recently looking for a good resource on what's included in the unicode tables and what isn't. This one was one of the best I could find: http://barzilai.org/math_sym.htm That's a nice overview, but only a small subset of available unicode characters (see http://www.unicode.org/charts/symbols.html). But (repeating the previous posts), this doesn't cover all symbols necessary to express mathematical (or for example chemical) formulas. Especially when thinking about OAI, I'm really wondering if _any_ standards or agreements apply ? Does anyone know how they do this in big reference repositories ? In our library system there is a coding rule for the handling of sub- and superscripts. Carbon dioxide (HTML COsub2/sub) would be coded CO_2 and the Uranium Isotope 238 (HTML sup238/supsub92/supU) as 238_92_U. This is not useful for reading and understanding (because the coding results are ambiguous). But while coding the metadata this way it is possible to formulate (and transform) queries that can match the metadata. Maybe I was just unlucky after looking for a few references, but it doesn't seem that these submissions in Arxiv include special symbols: http://arxiv.org/abs/0811.1165 http://arxiv.org/abs/0811.0548 They use TeX and hope that anyone can understand it. Best Greetings Robert - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] How to enrich metadata with special characters?
Unfortunately there were no responds to my question. Whenever You include a markup-language in the field values, You will have problems during searching. When You search for example for carbon dioxide as 'CO2', and You encoded it as 'COsub2/sub', You will never find it! A possible way to solve the problem would be to split metadata values into two fields: one for harvesting / indexing and one for presenting - like dc.title and dc.title.display. Then it would (theoretically) be possible to include something like HTML or TeX in the presentation field and allow DSpace to render it, while doing the search on the normal field. But to do the rendering one must modify the DSpace code... We still have no practical solution for this problem. Best regards Robert Yuyun Wirawati ISHAK (LIBRIS) schrieb: Hi Robert I'm having the same question. Do you have any answer to this query yet? Thanks in advance, Yuyun NIE Library Singapore Hi, I just wonder how to add special characters to metadata. Unicode-Escapes like #x1D6C0; work - but there are things in the world which are not captured by the unicode tables. One example is a superscript infinity sign, like in http://elib.ub.uni-osnabrueck.de/publications/pics/Omega-S.gif http://elib.ub.uni-osnabrueck.de/publications/pics/Omega-S.gif . Would it be possible to allow MathML or LaTeX to be rendered? Or at least to allow a selected set of HTML-tags to handle these things, like sub, sup and img? Best Regards Robert PS: While playing around to insert unicode-entities, I discovered a bug: whenever ampersands occur in a field, the last one is escaped as amp; in the rendered item page, but displayed correct in the text field for metadata editing. After actualising / saving the item a second time the characters are displayed as expected. -- === Robert Roggenbuck Universitaetsbibliothek Osnabrueck Osnabrueck Germany [EMAIL PROTECTED] === - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] How to enrich metadata with special characters?
I was recently looking for a good resource on what's included in the unicode tables and what isn't. This one was one of the best I could find: http://barzilai.org/math_sym.htm But (repeating the previous posts), this doesn't cover all symbols necessary to express mathematical (or for example chemical) formulas. Especially when thinking about OAI, I'm really wondering if _any_ standards or agreements apply ? Does anyone know how they do this in big reference repositories ? Maybe I was just unlucky after looking for a few references, but it doesn't seem that these submissions in Arxiv include special symbols: http://arxiv.org/abs/0811.1165 http://arxiv.org/abs/0811.0548 In any case, interesting topic and hope there are some good solutions out there. regards, Bram Luyten @mire NV Romeinse Straat 18 3001 Heverlee Belgium +32 2 888 29 56 http://www.atmire.com - Institutional Repository Solutions http://www.togather.eu - Before getting together, get [EMAIL PROTECTED] On Fri, Nov 14, 2008 at 9:36 AM, Robert Roggenbuck [EMAIL PROTECTED] wrote: Unfortunately there were no responds to my question. Whenever You include a markup-language in the field values, You will have problems during searching. When You search for example for carbon dioxide as 'CO2', and You encoded it as 'COsub2/sub', You will never find it! A possible way to solve the problem would be to split metadata values into two fields: one for harvesting / indexing and one for presenting - like dc.title and dc.title.display. Then it would (theoretically) be possible to include something like HTML or TeX in the presentation field and allow DSpace to render it, while doing the search on the normal field. But to do the rendering one must modify the DSpace code... We still have no practical solution for this problem. Best regards Robert Yuyun Wirawati ISHAK (LIBRIS) schrieb: Hi Robert I'm having the same question. Do you have any answer to this query yet? Thanks in advance, Yuyun NIE Library Singapore Hi, I just wonder how to add special characters to metadata. Unicode-Escapes like #x1D6C0; work - but there are things in the world which are not captured by the unicode tables. One example is a superscript infinity sign, like in http://elib.ub.uni-osnabrueck.de/publications/pics/Omega-S.gif http://elib.ub.uni-osnabrueck.de/publications/pics/Omega-S.gif . Would it be possible to allow MathML or LaTeX to be rendered? Or at least to allow a selected set of HTML-tags to handle these things, like sub, sup and img? Best Regards Robert PS: While playing around to insert unicode-entities, I discovered a bug: whenever ampersands occur in a field, the last one is escaped as amp; in the rendered item page, but displayed correct in the text field for metadata editing. After actualising / saving the item a second time the characters are displayed as expected. -- === Robert Roggenbuck Universitaetsbibliothek Osnabrueck Osnabrueck Germany [EMAIL PROTECTED] === - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
[Dspace-tech] How to enrich metadata with special characters?
Hi, I just wonder how to add special characters to metadata. Unicode-Escapes like #x1D6C0; work - but there are things in the world which are not captured by the unicode tables. One example is a superscript infinity sign, like in http://elib.ub.uni-osnabrueck.de/publications/pics/Omega-S.gif . Would it be possible to allow MathML or LaTeX to be rendered? Or at least to allow a selected set of HTML-tags to handle these things, like sub, sup and img? Best Regards Robert PS: While playing around to insert unicode-entities, I discovered a bug: whenever ampersands occur in a field, the last one is escaped as amp; in the rendered item page, but displayed correct in the text field for metadata editing. After actualising / saving the item a second time the characters are displayed as expected. -- === Robert Roggenbuck Universitaetsbibliothek Osnabrueck Germany === - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech