There’s no single data science project documentation recipe. Rather, your documentation needs will vary by project, team, organization, and industry.
And it’s not just about producing data science model documentation. Instead, think broader and ask – What do I need to document and why?
Once you’ve thought this through and have goals in place, you can then set a repeatable plan for how to document a data science project.
Let’s start with three guiding principles.
Before you build out your documentation, ask:
Think broadly and don’t take a “one-size-fits-all” approach. Rather, you should create various artifacts that best serve each set of stakeholder’s needs.
Don’t forget your users! Rather, be sure they know how to use your system.
If you have a user interface, a great practice is to put a help menu link in the upper right of your screen so that the user can navigate to find items such as:
Another common output for a model is via an API. In this case, write technical documentation so that the receiving-end engineers can build on top of your API. Include items such as definitions, endpoints, parameters, data formats, and response times.
There are a few templates that can help get you started.
The reality is that your project, team, and organizational needs will deviate from the above templates. As such, use these as starting points toward creating your own data science documentation templates.
Congrats! You made it to the end. But your work is just getting started. Remember that these data science project documentation best practices do not apply to all circumstances. And your situation will likely require some additional practices not mentioned here. So to review:
Best of luck and reach out if you have some additional pointers you found useful.