I have used the following and it works very fast in DIH solr-5.0 <dataSource name="URL" type="FileDataSource" /> <document name="map"> <entity name="sites"
processor="org.apache.solr.handler.dataimport.LineEntityProcessor" dataSource="URL" url="/opt/solr/solrfiles/descriptions.csv" rootEntity="false" separator="," transformer="RegexTransformer" > <field column="rawLine" regex="([^,]+),([^,]+),(.*)$" groupNames="id,mydata," /> </entity> <document> You can try this for getting groupNames from regex. Regards, Swaraj Kumar Senior Software Engineer I MakeMyTrip.com +91-9811774497