This is a pure Dart html5 parser. It's a port of html5lib from Python. Since it's 100% Dart you can use it safely from a script or server side app.
Eventually the parse tree API will be compatible with dart:html, so the same code will work on the client or the server.
This library is not finished. These files from the html5lib directory still need to be ported:
html5parser.py
ihatexml.py
sanitizer.py
filters/*
serializer/*
treebuilders/*
treewalkers/*
- most of
tests
Dependencies are installed using the Pub Package Manager.
pub install
# Run command line tests
#export DART_SDK=path/to/dart/sdk
tests/run.sh