Best way to process this dataset

Raymond Xie Mon, 18 Jun 2018 19:29:50 -0700

I have a 3.6GB csv dataset (4 columns, 100,150,807 rows), my environment is
20GB ssd harddisk and 2GB RAM.


The dataset comes with
User ID: 987,994
Item ID: 4,162,024
Category ID: 9,439
Behavior type ('pv', 'buy', 'cart', 'fav')
Unix Timestamp: span between November 25 to December 03, 2017

I would like to hear any suggestion from you on how should I process the
dataset with my current environment.

Thank you.

*------------------------------------------------*
*Sincerely yours,*


*Raymond*

Best way to process this dataset

Reply via email to