Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Try out libxml2 for faster XML & HTML parsing #25

Open
uogbuji opened this issue Apr 30, 2023 · 1 comment
Open

Try out libxml2 for faster XML & HTML parsing #25

uogbuji opened this issue Apr 30, 2023 · 1 comment

Comments

@uogbuji
Copy link
Owner

uogbuji commented Apr 30, 2023

Possibly via Cython? Check on cross-platform deployment issues in any such case.

libxml API docs

Idea popped into my head when I stumbled across this thread.

@uogbuji
Copy link
Owner Author

uogbuji commented May 11, 2023

Actually, this library might be a better option: html5-parser

Historical reference: The now defunct gumbo-parser

Worth noting that the WHATWG html5 parser has some recent updates, though it's still not heavily maintained (perhaps no need).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant