Pachyderm is an enterprise-grade, open-source data science platform that makes explainable, repeatable, and scalable ML/AI a reality. Its platform brings together version control for data with the tools to build scalable end-to-end ML/AI pipelines while empowering users to use any language, framework, or tool they want.
Pachyderm is “Git for Data Science.” It offers complete version control for data and gives your data science team the same first-class development tools as software developers. Pachyderm is ideal for building machine learning pipelines and ETL workflows because we track every model/output directly to the raw input datasets that created it (aka: Provenance).
B2C, B2B
26 to 50
Series B
$28,120,000
Scaling Up
2014
Computer Software
N/A
N/A
IT and Security
Service
Yes
Active
Machine Learning
Natural Language Processing
N/A
Software
Sr. Python Engineer - Integrations
New York, New York
Staff Engineer
New York, New York
Sr. Engineering Manager
San Francisco, California
Distributed Systems Engineer
San Francisco, California
DevOps Engineer
San Francisco, California
Want to work at Pachyderm?
We can introduce you to the right person at Pachyderm
Interested in what they do or partnership?
Learn more about how they work