It’s the end of June and I want to take a step back and reflect on my open source contributions since the beginning of the year. I’ve made 770 public commits on GitHub in the first semester.
I’m working at Etalab, the French government administration in charge of data policy and I’m involved in the Public Interest Entrepreneurs program (hiring talents to work in the administration). 95% of the code I work on is open source. This is a huge privilege and I love it. I get to talk about my work, collaborate with people and help other open source enthusiasts.
Main projects
Here are the main projects I’ve been contributing to (in term of commits):
- etalab/schema.data.gouv.fr: the website schema.data.gouv.fr collects, validates and displays schemas for open data datasets
- entrepreneur-interet-general/bulletins: a weekly retrospective tool for multiple projects or teams I developed for the Public Interest Entrepreneurs program
- entrepreneur-interet-general/site-eig: the Jekyll website of the Public Interest Entrepreneurs program
- etalab/data-codes-sources-fr: an initiative to automatically collect metadata about code repositories published by public administrations. I blogged about it.
- AntoineAugusti/table-schema-to-markdown: a Python library to convert a TableSchema file (describing a tabular file schema) to a Markdown documentation
- etalab/csv-gg: a VueJS application to generate forms from and validate against a Table Schema
- AntoineAugusti/vacances-scolaires and AntoineAugusti/vacances-scolaires-france: a dataset and a Python library about French school holidays
- DISIC/politique-de-contribution-open-source: the French open source contribution policy
- etalab/vuepress-theme-gouv-fr: a VuePress theme for the French administration
- Repos to automatically collect data: AntoineAugusti/tweets-rerb about the RER B transit line, AntoineAugusti/antennes-free outages for mobile antennas, AntoineAugusti/glyphosate about uses of pesticide and snosan-tools/avurnavs-fichiers for navigational warnings in France.
Overall I’ve contributed to more than 50 public repositories.
Data extraction
I extracted this data thanks to Google BigQuery, which provides a public dataset storing GitHub events. This was the best way I found to extract my public activity on GitHub. Unfortunately GitHub doesn’t provide a way to get this through the API (at the people level) or through a personal export. Here is the SQL request used to extract only my public commits:
SELECT repo.name, count(1) as nb FROM ( TABLE_DATE_RANGE([githubarchive:day.], TIMESTAMP('2019-01-01'), TIMESTAMP('2019-06-31') )) WHERE actor.login = 'AntoineAugusti' and type = 'PushEvent' GROUP BY repo.name
Community
These open source contributions have been a great opportunity to work with others. Open source lets public administrations interact with other administrations, companies, nonprofits and other governments on a daily basis. It’s been a privilege to answer questions, explain how things work and inspire others. Of course I’ve relied on many open source projects and have been helped a lot by other contributors.
Thanks to all the community, see you online for the next semester!