jsoup

From Wikipedia, the free encyclopedia
jsoup Java HTML Parser
Developer(s)Jonathan Hedley
Stable release
1.17.2 / December 29, 2023; 2 months ago (2023-12-29)[1]
Repository
Written inJava
Operating systemCross-platform
PlatformJava (JVM)
TypeHTML parser
LicenseMIT license
Websitejsoup.org

jsoup is an open-source Java library designed to parse, extract, and manipulate data stored in HTML documents.

History[edit]

jsoup was created in 2009 by Jonathan Hedley. It is distributed it under the MIT License, a permissive free software license similar to the Creative Commons attribution license.

Hedley's avowed intention in writing jsoup was "to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup."

Projects powered by jsoup[edit]

jsoup is used in a number of current projects,[2] including Google's OpenRefine data-wrangling tool.

See also[edit]

References[edit]

  1. ^ "jsoup Java HTML Parser release 1.17.2". Retrieved 29 Dec 2023.
  2. ^ "Jsoup". MVNRepository / F. Rodriguez. 2015-03-08.

External links[edit]