Data Machines Corp. is a small company dedicated to data analytic research.

We release our code as open source, participate meaningfully in our communities through working groups and sponsorships, and give part of our profit to charity.

Continue reading to learn more about who we are, what we do, and how we do it.


Open Source

Open software and standards are a foundation for innovative and important work in science, education, and industry. The customers who value our professional services help us contribute to these communities.

Technology Agnostic

There is no one answer for every challenge. We are unbiased in the selection of components or services that contribute to a solution.

Full Stack

Good developers that are familiar with the entire technology stack know how to make better solutions and life easier for those around them. Our teams have a genuine interest in all layers of software and hardware technology.


Creating and managing automatic intelligent behavior is necessary to work elastically and securely at scale. We build systems that adapt to unpredictable change and simplify their intrinsic complexity for operators and users.


Read Employee Biographies here.


Data Machines Corp. (DMC) is a small business which specializes in designing, building, and using cloud architectures to engineer and share solutions to difficult problems in data analytics, dev-ops, machine intelligence, and data science. 

DMC is developing new data architecture technologies as part of eight unique and challenging DARPA research programs where our systems and code are used daily by over 2,500 researchers, data scientists, and research transition partners.

DMC maintains a library of higher math based Cyber and Network Defense big data analytics tailored to detecting APT and complex cyber threats. This portfolio is comprised of code paired with white-papers which explain it's application, strengths, and weaknesses.

DMC maintains libraries of code and techniques for scalable data enrichment, transformation, and analysis (computer vision, filters, tagging, tokenization, summarization, matrix completion, missing value inference, principal component analysis, prediction, ...etc). We use these libraries to conduct Exploratory Data Analysis (EDA) on customer data sets, internal and external training, as well as improvement of our own technical solutions.

DMC maintains a suite of tools and techniques for managing architectures and data. Examples of our tool capabilities including utilities for securely managing global Internet of Things communication and orchestration, data transport, architecture deployment, application security, identity management, and data security. 

DMC is skilled at taking cutting edge machine learning and artificial intelligence code and engineering scalable production deployment and operations strategies for them. We bring extensive experience and first-hand knowledge of using the latest tools in production systems to ensure project success. We have worked with everything from robust production ready code to brittle research code.

DMC has a structured approach to securing scaled production deployments of cutting edge software. We are experienced at meeting security and compliance goals by making appropriate and documented risk decisions while maintaining the maximum possible functionality and performance of cutting edge machine learning and artificial intelligence code.

DMC has extensive experience working with broad research teams comprised of government, industry, and academia to do white-label transition of important technologies to make meaningful impacts on challenges such as threat-finance, human trafficking, cyber security, decision support, healthcare, investing, litigation, and more.

Access our Open Source Code on GitHub.


DMC maintains an extensive library of curated public and private data sets used to support research and seed challenge problems for collaborative research efforts. We also maintain a library of data gathering and curation tools which are often utilized to generate new data sets for use in research challenges or to power observational statistical analysis in response to specific data science requests.

View our curated list of open data sets.






Machine Learning




2018-09-20 Data Machines Corp. Director of Scientific Computing Dr. Martial Michel is a panelist in the "Edge Computing: Shaping the Named Data Edge" session at the "Named Data Networking Community Meeting 2018" in Gaithersburg MD.

2018-09-13 Data Machines Corp. staff spends an afternoon volunteering at a Second Story youth center. Staff cooked food, helped kids with reading and math homework, played games, and helped sort and organize clothing and food donations.

2018-09-10 Data Machines Corp. Director of Scientific Computing Dr. Martial Michel leads the OpenStack Project Team Gathering's Scientific Special Interest Group session in Denver CO.

2018-07-24 Data Machines Corp. Director of Scientific Computing Dr. Martial Michel moderates a panel on Transforming Scientific Computing with Cloud Technologies at the 2018 Practice and Experience in Advanced Research Computing (PEARC18) conference in Pittsburg PA.

2018-06-26 Data Machines Corp. sponsors AutoML 2018, The International Workshop on Automatic Machine Learning in Stockholm, Sweden.

2018-05-23 Data Machines Corp. Director of Scientific Computing Dr. Martial Michel moderates a panel on High Performance Computing, GPU, and Artificial Intelligence at the OpenStack Summit in Vancouver, BC.

2018-05-12 Data Machines Corp. BG (Ret) Thompson presents on Human Capital Risk Management Through Human Resource Threat Assessments at the Fredericksburg, VA  Society for Human Resource Management Chapter meeting.

2018-04-18 DARPA Media Forensics on NBC News.

2018-04-17 DARPA Media Forensics on CBS News.

2018-03-20 Data Machines Corp. supports and sponsors the Open Research Cloud Alliance (ORCA) workshop in Gaithersburg MD to discuss Enabling Intercloud Interoperability and Support for Scientific Research Collaboration. DMC Director of Scientific Computing Dr. Martial Michel  supports as Vice Chair.

2018-03-05 Data Machines Corp. is honored to have Burt Thompson (Brigadier General, US Army Ret.) join our leadership team. 

2018-02-15 Data Machine Corp. Director of Scientific Computing Dr. Martial Michel is announced as Vice Chair of the IEEE P2302 Intercloud Interoperability and Federation Working Group

2018-01-05 Data Machines Corp. publishes our 2017 Giving Report. 

2017-11-06 Data Machine Corp. Director of Cloud Computing Mike May presents at the OpenStack Summit in Sydney Australia.  

2017-10-28 Data Machines Corp. supports and sponsors the first ever Conference on Applied Machine Learning for Information Security (CAMLIS). Data Machines Corp. Director of Data Science and Analytics Dr. Nathan Danneman presents. 


DMC is still a very young company and in the process of formalizing processes for how we approach the balance of (what we feel is) the necessity of generous giving and the responsible management and funding of our own strategic goals as a business. These annual reports summarize our efforts.

2017 end of year giving report.

The 2018 end of year giving report will be released soon.


General inquiries: info at datamachines.io

Address: Data Machines Corp. Suite 110, 44933 George Washington Blvd, Ashburn, VA 20147-6301

To apply for employment, click here. 


Data Machines Corp. is developing curriculum related to data science, machine learning, programming, and cloud architectures. We hope to be able to share our knowledge through outreach and structured training seminars. If you're interested in attending or learning more about these topics, please reach out to us at training@datamachines.io

If you're new to programming, you can take our online Python Training Course to get started. 

More to follow.