Implementing a detector at GitGuardian : a use case with MongoDB credentials

Company

GitGuardian

Date Published

March 19, 2021

Author

Guardians

Word count

1868

Language

English

Hacker News points

None

URL

blog.gitguardian.com/mongodb-credentials-detector

Summary

The blog post by Pierre Lalanne, a data scientist at GitGuardian, explores the company's approach to developing and refining detectors for identifying MongoDB credentials leaked in source code. GitGuardian's detection engine, which monitors over 10 million GitHub documents daily, is designed to balance high recall and precision by focusing on "multimatch" secrets, a method that reduces false positives and alert fatigue. The article describes three main detection methods for MongoDB credentials: variable assignments, URI connection strings, and shell commands, each with specific strategies for identifying and validating potential leaks. GitGuardian employs pre-validation to discard irrelevant files and post-validation steps to refine results, such as filtering out common placeholder values and ensuring detected credentials are related. This meticulous approach allows the company to maintain high precision in detecting MongoDB credentials, with the engine raising over 3,000 alerts weekly for such leaks.