Data Science & Analytics

Hundreds of PB of data

Diverse technology stack

We leverage a wide array of technologies, including statistical software such as Python, R, SAS, Stata, Eviews, SPSS and Machine Learning packages such as Stan, Keras, Tensorflow, Pytorch

Reporting

Great Expectations, Databricks, Data Quality Dashboards

OpenMetadata, Airflow

Analysis

Spark, Presto/Athena, Flink, Beam, Airflow, Prefect, Kafka, Kinesis

Modeling

SageMaker, MlFlow, Kubeflow

Generative AI

data modeling, dimensional modeling, normal forms, wide tables, dbt

Interpretabile, Ethical, and Responsibile AI

Creation of API Client libraries, streaming data integration, DBT unit test libraries, open source contribution to DBT Athena

Experimentation Design

Creation of API Client libraries, streaming data integration, DBT unit test libraries, open source contribution to DBT Athena

Sample past projects

  • Data Quality Monitoring, Reporting and alerting - Great Expectations, Grafan
  • SLA Alerting & Monitoring implementation - Airflow