Data Scientist

Contract to Hire

Mandatory Skills

  • 3+ years of work experience in writing code in Python and PySpark
  • Experience in using various Python libraries like Pandas, NumPy
  • Experience in writing good quality code in Python and code refactoring techniques (e.g., IDE’s – PyCharm, Visual Studio Code; Libraries – Pylint, pycodestyle, pydocstyle, Black)
  • Deep understanding of data structures, algorithms, and excellent problem-solving skills
  • Experience in working under Big Data engineering ecosystem (i.e. Hadoop, spark, ETL pipeline)
  • Experience in building data processing pipelines on one of the clouds (GCP, AWS or Azure)
  • Strong in SQL skills, Databases and Data Warehouses
  • Experience in building scalable data processing pipelines using Spark and Dask
  • Experience in Exploratory Data Analysis (EDA), Feature Engineering, Data Visualisation
  • Experience in data visualization techniques (e.g., Matplotlib, Seaborn, ggplot2)
  • Identify, analyse, and interpret trends or patterns in complex data sets
  • Interpret data, analyse results using statistical techniques and provide ongoing reports
  • Machine Learning libraries like Scikit-learn, XGBoost
  • Experience in building models for ML tasks (Regression, Classification, Time Series)
  • Excellent understanding of machine learning techniques and algorithms, such as Regression, k-NN, Naive Bayes, SVM, Decision trees, Random Forests, etc
  • Familiarity in Dockerizing the model and creating model endpoints (Rest or gRPC)
  • Strong working knowledge of source code control tools such as Git, Bitbucket
  • Strong drive to learn and master new technologies and techniques
  • Strong communication and collaboration skills
  • Good attitude and self-motivated