CodeCommons aims to provide a centralized repository of essential resources, including code, documentation, and metadata, to facilitate the creation of smaller, more effective datasets for the next generation of AI tools.
Read post
CodeCommons is a two-year project building on the Software Heritage archive. Here’s an overview of the projects we and our partners are working on.
Read post
A look at what our amazing Ambassadors did in 2024 – and how you can get involved.
Read post
Thomas Aynaud joins Software Heritage as CTO, bringing extensive experience in search and open source. Learn more about his background.
Read post
CodeCommons, a two-year project funded by the French government, is building on Software Heritage—the world’s largest public source code archive—to create higher-quality datasets for responsible artificial intelligence.
Read post
The newest Software Heritage ambassador is a full-stack developer and contributor to open-source JavaScript projects.
Read post
Mozilla named Software Heritage co-founder Roberto Di Cosmo a Rise25 Honoree in the ‘builder’ category. Here’s how he got there.
Read post
Our newest champion has a toolkit that includes English, German, Kurdish, Arabic and technical tools ranging from Linux, Python, XSLT, Java, and SQL. Learn more about him and how you can connect.
Read post
How are libraries supporting the growing importance of software in research? Frédéric Saby of the Université Grenoble-Alpes provides answers.
Read post
Karim Boualem, head of research support, highlights key initiatives aimed at empowering researchers with Software Heritage.
Read post