Yibo Cai created ARROW-15878: -------------------------------- Summary: [C++] Optimize csv writer for string with quotes Key: ARROW-15878 URL: https://issues.apache.org/jira/browse/ARROW-15878 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Yibo Cai Assignee: Yibo Cai
Escaping a string with quotes (put an extra quote before a quote) is the hotspot of csv writer [1]. This can probably be improved, possible approaches: - Find the next quote with memchr, then memcpy blocks without quotes. - Check if there are quotes with simd in 8 bytes or 16 bytes, do memcpy if no, otherwise go slow path. Should make sure the method doesn't decrease performance too much for strings with many quotes, or short strings. -- This message was sent by Atlassian Jira (v8.20.1#820001)