PyTux

Trips of a curious penguin.

Hello, time traveler! You are reading an article that is almost ten years old. The world has changed, and so have I and my opinions. There is a good chance what's below is not current, correct, or secure anymore, and maybe it never was. This page is preserved because I am an archivist at heart, but you have been warned.

Archive your GitHub repo and data

GitHub is a service we all trust, so this is not a “get your data off that cloud before it explodes!”-style post, but sometimes you want to take an offline copy of your or somebody’s work.

Here is a quick and dirty Python script that will help you clone all the repositories, the Gists and some metadata that can be fetched over the API. Be warned, it only fetches public repos and data and there’s no error checking.

I wrote and used this to archive Aaron Swartz GitHub account on archive.org. R.I.P.