I’m launching a new side project today! Indie Map is a public IndieWeb social graph and dataset. It’s a complete crawl of 2300 of the most active IndieWeb sites, sliced and diced and rolled up in a few useful ways:
- Social graph API and interactive map.
- SQL queryable dataset and GUI analytics.
- Raw crawl data in WARC format: 2300 sites, 5.7M pages, 380GB HTML + mf2.
The IndieWeb‘s raison d’être is to do social networking on individual personal web sites instead of centralized silos. Some parts have been fairly straightforward to decentralize – publishing, reading, interacting – but others are more difficult. Social graph and data mining fall squarely in the latter camp, which is why the community hasn’t tackled them much so far. I hope this inspires us to do more!
Indie Map was announced at IndieWeb Summit 2017. Check out the slide deck and video of the talk!
380gb :) of … “The IndieWeb‘s raison d’être is to do social networking on individual personal web sites instead of centralized silos”
Love this map of the IndieWeb. Great visualization of how the entire web can be the best social network.
Cool
Love this map of the IndieWeb. Great visualization of how the entire web can be the best social network. snarfed.org/2017-06-24_new…
… O.o
Realise I’m the only one, and so hopefully I don’t understand. But… the whole point of “not Facebook” and “not Google” is not to be “datamined”. This makes me afraid there will have to be a whole new reset of the social web after this one. Happy to be talked sense into, though.
Tom mentioned this Article on herestomwiththeweather.com.
Amazing
Awesome Ryu!