Hundreds of PB of data
We leverage a wide array of technologies, including statistical software such as Python, R, SAS, Stata, Eviews, SPSS and Machine Learning packages such as Stan, Keras, Tensorflow, Pytorch
Great Expectations, Databricks, Data Quality Dashboards
OpenMetadata, Airflow
Spark, Presto/Athena, Flink, Beam, Airflow, Prefect, Kafka, Kinesis
SageMaker, MlFlow, Kubeflow
data modeling, dimensional modeling, normal forms, wide tables, dbt
Creation of API Client libraries, streaming data integration, DBT unit test libraries, open source contribution to DBT Athena
Creation of API Client libraries, streaming data integration, DBT unit test libraries, open source contribution to DBT Athena