RASH
From Just Solve the File Format Problem
RASH (Research Articles in Simplified HTML) is a stripped-down version of HTML using only a subset of its elements in order to provide consistently marked-up academic papers which are readable in normal web browsers but are more easily processable by tools than normal HTML which can have a much greater variability and sometimes a messy structure.