Since the HTML is parsed anyway, it may make sense to recognize common micro formats in it like addresses or contacts.