Question 1

How to scrape HTML from a website using Hickory in Clojure?

Accepted Answer

Fetch HTML with a library like clj-http, parse it using hickory.core/parse, convert to Hickory format with as-hickory, then use hickory.select/select with CSS-style selectors to extract nodes. The README shows an example extracting race dates from Formula 1's site.

Question 2

Hickory vs Enlive for HTML processing in Clojure?

Accepted Answer

Hickory excels at lossless parsing and round-trip conversion with dual formats, ideal for data extraction and manipulation. Enlive is better for template transformation and live scraping but may not preserve all HTML details like comments. Choose based on whether you need completeness or templating focus.

Question 3

How to parse HTML fragments without a root element in Hickory?

Accepted Answer

Use the parse-fragment function which returns a list of parsed nodes, then apply as-hiccup or as-hickory to each element via map. The README demonstrates this with fragment parsing examples showing separate handling.

Question 4

Does Hickory support malformed HTML?

Accepted Answer

Yes, it uses HTML5 parsers (Jsoup on JVM, browser DOM in ClojureScript) that fix up malformed HTML into well-formed documents automatically, as mentioned in the parsing section for robust handling.

Question 5

How to install and configure Hickory for Node.js ClojureScript?

Accepted Answer

Add the Hickory dependency, then install a DOM library like jsdom or xmldom from npm and set js/document or js/DOMParser in your code. The README provides snippets but warns about figwheel compatibility issues.

Question 6

What are the performance implications of using Hickory for large HTML files?

Accepted Answer

Hickory adds layers for parsing and format conversion, which can slow down processing of very large documents compared to lightweight parsers. However, it trades some speed for functional purity and ease of manipulation in Clojure.

Hickory

What is Hickory?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions