The Jupyter Notebook tutorials in the splink_demos
project are structured as a collection of Jupyter Notebook files, which are organized into subdirectories based on the topic they cover. The main directories in the project are:
examples
: This directory contains examples of how to usesplink
for various data linkage tasks. It is further divided into subdirectories for different databases, such assqlite
andduckdb
. Each subdirectory contains adashboards
directory that holds the notebooks for that specific database.tutorials
: This directory contains tutorials that guide users through the process of usingsplink
for data linkage. The tutorials are designed to be self-contained and easy to follow, with step-by-step instructions and explanations.data
: This directory contains sample datasets that are used in the examples and tutorials.docs
: This directory contains documentation for the project, including installation instructions, a user guide, and API references.
Each Jupyter Notebook in the project follows a similar structure, consisting of markdown cells that provide explanations and code cells that contain executable Python code. The code cells are used to demonstrate the concepts discussed in the markdown cells, and users can modify and run the code to experiment with different configurations and parameters.
The splink_demos
project also includes a Makefile
that provides commands for building and testing the project. The Makefile
includes targets for building the notebooks, running tests, and creating distribution packages.
Overall, the structure of the splink_demos
project is designed to be modular and easy to navigate, with clear directories for each topic and well-organized Jupyter Notebook files. The project’s documentation and examples provide users with a clear and concise introduction to the splink
library and its capabilities.
Sources:
- Opensource.com. (2021). How to update a Linux symlink. Retrieved from https://opensource.com/article/21/11/update-linux-file-system-link
- Red Hat Developer. (2017). The GDB Python API. Retrieved from https://developers.redhat.com/blog/2017/11/10/gdb-python-api
- Opensource.com. (2017). A user’s guide to links in the Linux filesystem. Retrieved from https://opensource.com/article/17/6/linking-linux-filesystem
- Linkerd. (2020). Serverless Service Mesh with Knative and Linkerd. Retrieved from https://linkerd.io/2020/03/23/serverless-service-mesh-with-knative-and-linkerd
- Opensource.com. (2020). Create demo project templates with one script. Retrieved from https://opensource.com/article/20/1/initsh-script
- HashiCorp Developer. (n.d.). Proxmox Builder. Retrieved from https://developer.hashicorp.com/packer/integrations/hashicorp/proxmox/latest/components/builder/clone
- HashiCorp Developer. (n.d.). Hyper-V Builder. Retrieved from https://developer.hashicorp.com/packer/integrations/hashicorp/hyperv/latest/components/builder/vmcx
- Terraform. (n.d.). Stacks - CDK for Terraform. Retrieved from https://developer.hashicorp.com/terraform/cdktf/concepts/stacks
- Hexdocs.pm. (n.d.). OpenTelemetry - opentelemetry_api v1.2.2. Retrieved from https://hexdocs.pm/opentelemetry_api/opentelemetry.html
- Grafana Cloud. (n.d.). Grafana Cloud API. Retrieved from https://grafana.com/docs/grafana-cloud/developer-resources/api-reference/cloud-api
- GitLab. (n.d.). GitLab Flavored Markdown (GLFM). Retrieved from https://docs.gitlab.com/ee/user/markdown.html
- Docker Docs. (n.d.). Dockerfile reference. Retrieved from https://docs.docker.com/engine/reference/builder