Profiles from Search

Profile photo

Purvil Dave

2ndPremium Subscriber


Udacity, Machine Learning Nanodegree, Data Science




• Previously worked as Data scientist and Software engineer at SeeeScan for about 5 years. I was one of the employees who started a data science team at SeeScan from the ground up. • Having strong Python and Java programming skills with thorough knowledge of Data Structures and algorithms. • In-depth knowledge of Machine Learning, Distributed Systems, and Data Analysis. • 3+ year of experience with Python developing Machine learning models and data analysis. • Experience in complete Software Development Life Cycle (SDLC) from proposing a solution to a client to user requirement phase, system analysis, and design, architecting and implementation to testing, deployment, and technical documentation. • Technical Skills – Programming: Python, Java – Data analysis and Visualization: NumPy, Pandas, Seaborn, Matplotlib, Tableau, Jupyter – Machine Learning: Scikit-learn, MLlib, TensorFlow, Keras – Big Data Tools: Hadoop, Spark, Storm, Kafka – Database: Relational database, Cassandra, MongoDB, Redis


  • Software Engineer


    March 2015 – Present(4 years 7 months)Greater San Diego Area

    ● Initiated the first data science team from the ground up and built a linear regression model to estimate battery usage by the various components of the device which helped to increase battery life by 12% in a single battery system and by 18% in a dual battery system. ● Analyzed user logs to understand frequently used features and customer demands. Performed exploratory data analysis and visualized usage patterns to identify the highest captured media type, average video duration, factors that lead to media corruption or sync failure. ● Used Qt to create the user interface for the product and conducted A/B testing to choose the best UI design and flow which significantly reduce the bounce rate by 30%. ● Implemented a failure prediction model and used a sequence mining technique for rules to estimate machine failure on the field using sensor data, which reduced warranty cost by $300k. Detected temperature and fan speed correlation and its effect on the processor clock rate. ● Built a recommendation system to show recommendations for new camera head and reel based on the user’s routine usage of reel length and camera sensor readings which increased sell of reels and camera by 27%. ● Led the transition from non-internet-based device to Wi-Fi enabled device based on data from user logs and multiple sensors. ● Linux kernel development for Yocto and i.mx6-freescale based pipeline inspection tool. Developed device drivers to control the camera, dual battery system, fan control, and heat management. Technologies: Python, Scikit-learn, Pandas, NumPy, SQL, C++, C, Qt, Linux


  • Udacity

    Machine Learning Nanodegree, Data Science

    2019 – 2019

    Python, Scikit-learn, Py-Torch, Pandas, Numpy, Matplotlib, Seaborn

  • Udacity

    Data Analyst Nanodegree, Data Science

    2019 – 2019

    Python, SQL, NumPy, Pandas, SciPy, Matplotlib, Seaborn, Descriptive and Inferential Statistics, A/B testing Projects: – TMDb Movie data analysis – WeRateDog Twitter data wrangling

  • San Diego State University-California State University

    Master’s Degree, Computer Science

    2015 – 2017

    Coursework Machine learning Big data tools and methods Data mining and knowledge Database theory and implementation Applied computer vision Advanced object-oriented design and programming Theory of parallel algorithms Wireless networks

  • Ganpat University

    Bachelor’s Degree, Computer Engineering

    2010 – 2014

    Coursework Operating Systems Data structure and algorithms Computer programming Object-oriented programming Database management systems Computer networking

  • J L High School

    High School



  • English

    Full professional proficiency

  • Hindi

    Native or bilingual proficiency

  • Gujarati

    Native or bilingual proficiency

Skills & Expertise

  • Seaborn
  • Device Drivers
  • Algorithms
  • Hive
  • Git
  • Keras
  • Qt
  • MapReduce
  • Data Analysis
  • Data Science
  • C (Programming Language)
  • Embedded Linux
  • Microsoft Office
  • Linux
  • HiveQL
  • Embedded Systems
  • Tableau
  • TensorFlow
  • Apache Storm
  • Apache Spark
  • Big Data Analytics
  • Debugging
  • Python (Programming Language)
  • Algorithm Design
  • MySQL
  • C++
  • Big Data
  • PyTorch
  • Amazon Web Services (AWS)
  • Linux Kernel
  • Scikit-Learn
  • Java
  • Matplotlib
  • Hadoop
  • Pandas
  • Software Development
  • NumPy
  • SQL


  • Introduction to Big Data

    Coursera Course Certificates, License ZHQZE3PZMDFL

    December 2016

  • Programming for Everybody (Getting Started with Python)

    Coursera Course Certificates, License SD9NERXV4Y2X

    December 2016

  • Python Data Structures

    Coursera, License 3J6GHSKB694Z

    April 2017

  • Using Python to Access Web Data

    Coursera, License 6698333K44M7

    January 2018

  • Data Science Orientation

    Coursera, License

    February 2019

  • Python for Data Science

    Coursera, License ZQNKYPG3AUUU

    February 2019

  • The Data Scientist’s Toolbox

    Coursera, License X4QAB98CEVCJ

    March 2018

  • Mathematics for Machine Learning: Linear Algebra

    Coursera, License 6TB58YHUQGUJ

    March 2018

  • SQL for Data Science

    Coursera, License S5ZXALSW2KWL

    March 2019

  • Mathematics for Machine Learning: PCA

    Coursera, License 6UBPYNWMSXKB

    April 2019

  • Mathematics for Machine Learning: Linear Algebra

    Coursera, License 6TB58YHUQGUJ

    March 2018

  • Cloud Computing Applications, Part 1: Cloud Systems and Infrastructure

    Coursera, License NAJ42C7HFR5X

    May 2019

  • Understanding and Visualizing Data with Python

    Coursera, License L4K9YXQC7EPA

    May 2019

  • Python Programming

    DataCamp, License 101641
  • Data Manipulation with Python

    DataCamp, License 101930
  • Hive to ADVANCE Hive

    Udemy, License UC-CH8QZKLQ

    August 2019

  • Apache Storm

    Udemy, License UC-062IVSGS

    August 2019

  • Specialization: Introduction to Data Science

    Coursera, License URPWPZCGG3J3

    March 2019


San Diego State University-California State University

  • Advanced Object Oriented Programming
  • Data mining and knowledge
  • Intro to big data : Tools and Method
  • Computer Security
  • Machine Learning
  • Wireless Networks
  • Theory of Parallel Algorithm
  • Spatial Database

Ganpat University

  • Data Structure and Algorithm
  • Operating Systems
  • Object Oriented Programming
  • Software Engineering
  • Computer Networks

Volunteer Experience & Causes

Causes Purvil cares about:

  • Animal Welfare
  • Education
  • Poverty Alleviation
  • Science and Technology