Also if you're not yet creating rels (i.e. read your writes you should also be able to up the periodic commit to 50k)
Michael Am 05.03.2014 um 08:29 schrieb Michael Hunger <michael.hun...@neopersistence.com>: > Yep, > > it would be also interesting how you ran this? With neo4j-shell? Against a > running server? > Did you configure any RAM or memory mapping setting in neo4j.properties? > > Check out this blog post for some hints on memory config: > http://blog.bruggen.com/2014/02/some-neo4j-import-tweaks-what-and-where.html?view=sidebar > Note that on windows the heap settings include the mmio settings unlike other > OS'es. > > Michael > > Am 04.03.2014 um 17:22 schrieb Mark Needham <m.h.need...@gmail.com>: > >> Hi Aram, >> >> * Do you have any other information of the spec of the machine you're >> running this on? e.g. how much RAM etc >> * Have you tried upping the value to PERIODIC COMMIT? Perhaps try it out >> with a smaller subset of the data to measure the impact - try it with values >> of 1,000 / 10,000 perhaps. >> * I think it would be interesting to pull out some other things as nodes as >> well - might lead to more interesting queries e.g. CEO, Location, Registered >> Agent, DOS Process, Jurisdiction could all be nodes that link back to a DOS. >> >> Let me know if any of that doesn't make sense. >> Mark >> >> >> On 4 March 2014 15:54, Aram Chung <aramol...@gmail.com> wrote: >> Hi, >> >> I was asked to post this here by Mark Needham (@markhneedham) who thought my >> query took longer than it should. >> >> I'm trying to see how graph databases could be used in investigative >> journalism: I was loading in New York State's Active Corporations: Beginning >> 1800 data from >> https://data.ny.gov/Economic-Development/Active-Corporations-Beginning-1800/n9v6-gdp6 >> as a 1964486-row csv (and deleted all U+F8FF characters, because I was >> getting "[null] is not a supported property value"). The Cypher query I used >> was >> >> USING PERIODIC COMMIT 500 >> LOAD CSV >> FROM >> "file://path/to/csv/Active_Corporations___Beginning_1800__without_header__wonky_characters_fixed.csv" >> AS company >> CREATE (:DataActiveCorporations >> { >> DOS_ID:company[0], >> Current_Entity_Name:company[1], >> Initial_DOS_Filing_Date:company[2], >> County:company[3], >> Jurisdiction:company[4], >> Entity_Type:company[5], >> >> DOS_Process_Name:company[6], >> DOS_Process_Address_1:company[7], >> DOS_Process_Address_2:company[8], >> DOS_Process_City:company[9], >> DOS_Process_State:company[10], >> DOS_Process_Zip:company[11], >> >> CEO_Name:company[12], >> CEO_Address_1:company[13], >> CEO_Address_2:company[14], >> CEO_City:company[15], >> CEO_State:company[16], >> CEO_Zip:company[17], >> >> Registered_Agent_Name:company[18], >> Registered_Agent_Address_1:company[19], >> Registered_Agent_Address_2:company[20], >> Registered_Agent_City:company[21], >> Registered_Agent_State:company[22], >> Registered_Agent_Zip:company[23], >> >> Location_Name:company[24], >> Location_Address_1:company[25], >> Location_Address_2:company[26], >> Location_City:company[27], >> Location_State:company[28], >> Location_Zip:company[29] >> } >> ); >> >> Each row is one node so it's as close to the raw data as possible. The idea >> is loosely that these nodes will be linked with new nodes representing >> people and addresses verified by reporters. >> >> This is what I got: >> >> +-------------------+ >> | No data returned. | >> +-------------------+ >> Nodes created: 1964486 >> Properties set: 58934580 >> Labels added: 1964486 >> 4550855 ms >> >> Some context information: >> Neo4j Milestone Release 2.1.0-M01 >> Windows 7 >> java version "1.7.0_03" >> >> Best, >> Aram >> >> -- >> You received this message because you are subscribed to the Google Groups >> "Neo4j" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to neo4j+unsubscr...@googlegroups.com. >> For more options, visit https://groups.google.com/groups/opt_out. >> >> >> -- >> You received this message because you are subscribed to the Google Groups >> "Neo4j" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to neo4j+unsubscr...@googlegroups.com. >> For more options, visit https://groups.google.com/groups/opt_out. > -- You received this message because you are subscribed to the Google Groups "Neo4j" group. To unsubscribe from this group and stop receiving emails from it, send an email to neo4j+unsubscr...@googlegroups.com. For more options, visit https://groups.google.com/groups/opt_out.