[ 
http://issues.apache.org/jira/browse/SOLR-66?page=comments#action_12447901 ] 
            
Yonik Seeley commented on SOLR-66:
----------------------------------

> How to encode 'comma'?

For standard CSV, ytou could quote the entire field value...   "a,b"
I don't know if Commons CSV supports backslash escaping or not, but that would 
be another way.

> How to encode UTF-8?

Two ways... the user can define a charset for the file (and the file could 
actually be UTF-8),
and we can support unicode escapes \u1234

> Should we use Base64 and encode raw values?

I hadn't thought about binary fields (they aren't even supported in the XML 
update yet).
Doing Base64 would seem relatively easy though.


> bulk data loader
> ----------------
>
>                 Key: SOLR-66
>                 URL: http://issues.apache.org/jira/browse/SOLR-66
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Yonik Seeley
>         Assigned To: Yonik Seeley
>
> A way to efficiently load simple formatted text files, including CSV files.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to