*Hi,* *Kindly let me know if you are comfortable on below position.*
*Position: Big Data or Data Scientist* *Location: Wilmington, DE* *Length – 1 to 2 years* *Interview process: Onsite Interview only (might be able to do skype )* *Rate: Open * *Notes: * This is a Data Scientist position Understands how to showcase data to the business - been through a Hadoop transformation! - machine learning experience with Java experience needed - must have solid Kafka, Spark, Hive, Solr, Scala - Must have 10/10 communication skills *Job description * Data scientists are big data wranglers. They take an enormous mass of messy data points (unstructured and structured) and use their formidable skills in math, statistics and programming to clean, massage and organize them. Then they apply all their analytic powers – industry knowledge, contextual understanding, skepticism of existing assumptions – to uncover hidden solutions to business challenges. - Conduct undirected research and frame open-ended industry questions - Extract huge volumes of data from multiple internal and external sources - Employ sophisticated analytics programs, machine learning and statistical methods to prepare data for use in predictive and prescriptive modeling - Thoroughly clean and prune data to discard irrelevant information - Explore and examine data from a variety of angles to determine hidden weaknesses, trends and/or opportunities - Devise data-driven solutions to the most pressing challenges - Invent new algorithms to solve problems and build new tools to automate work - Communicate predictions and findings to management and IT departments through effective data visualizations and reports - Recommend cost-effective changes to existing procedures and strategies *Technical Skills* - Math (e.g. linear algebra, calculus and probability) - Statistics (e.g. hypothesis testing and summary statistics) - Machine learning tools and techniques (e.g. k-nearest neighbors, random forests, ensemble methods, etc.) - Software engineering skills (e.g. distributed computing, algorithms and data structures) - Data mining - Data cleaning and munging - Data visualization (e.g. ggplot and d3.js) and reporting techniques - Unstructured data techniques - SQL databases and database querying languages - Python (most common),Java, Perl - Big data platforms like Hadoop, Hive & Pig - Cloud tools like Amazon S3 is a plus Regards *Mayank* 978-558-4666 x 103 *may...@teknavigators.com* <may...@teknavigators.com> *TekNavigators LLC* -- You received this message because you are subscribed to the Google Groups "Citrix and Sap problems" group. To unsubscribe from this group and stop receiving emails from it, send an email to citrix-and-sap-problems+unsubscr...@googlegroups.com. To post to this group, send email to citrix-and-sap-problems@googlegroups.com. Visit this group at https://groups.google.com/group/citrix-and-sap-problems. For more options, visit https://groups.google.com/d/optout.