like seems they used some pretty outdated microformats, @schnarfed personally came up with a list of some 3000 domains that had h-entry # Standard