Zero Signup ToolsFree browser tools

Developer Tools

HTML Attribute Extractor

Extract every href, src, alt, data-*, aria-*, class, id, or any custom attribute value from pasted HTML. Output as a list, CSV, TSV, JSON, or Markdown.

Quick presets

0 chars

Format: tag.attribute. Use * as a wildcard on either side. Suffix a wildcard like data-* to match a namespace.

Output format
Sort order
Comparison

Elements scanned

0

Matches

0

Unique values

0

Output rows

0

Nothing leaves your browser. The HTML is parsed locally with the built-in DOMParser; scripts inside the pasted HTML are never run.

Preview table

0 of 0 rows

No matches yet. Paste HTML and pick a selector, or load the sample to see how it works.

Selector reference

  • *.hrefEvery href on every tag (a, link, area, base).
  • a.hrefOnly href values on anchor tags.
  • *.data-*Every data-* attribute on every tag.
  • *.aria-*Every aria-* attribute, useful for accessibility audits.
  • img.altEvery alt text on img tags.
  • img.*Every attribute on every img tag.
  • meta.contentEvery content attribute on meta tags.
  • *.*Every attribute on every tag in the document.

How to use

  1. Paste an HTML document or fragment into the input box. Use Load sample if you want to see how the extractor behaves on a typical page.
  2. Pick a preset like All href, Image alt, All data-*, or All aria-*, or type your own tag.attribute selector (for example a.href, img.*, *.data-track-id).
  3. Choose an output format: a plain list, a unique list, CSV or TSV for spreadsheets, JSON for tooling, or a Markdown table for docs.
  4. Tweak sort order and the comparison toggles (trim values, drop empty, case-insensitive uniqueness) to match how you plan to use the result.
  5. Use the Copy output button, or scan the preview table and Most frequent values panel to spot duplicates, missing alt text, or repeated tracking IDs.

About this tool

HTML Attribute Extractor pulls any attribute value out of pasted HTML so you can audit, migrate, or export it without writing a one-off script. Type a simple tag.attribute selector and the tool walks the parsed document with the browser's built-in DOMParser, then reports every match in document order. Wildcards work on both sides of the dot, so you can target one specific attribute (a.href), every attribute on a single tag (img.*), one attribute across every tag (*.title), an attribute family on every tag (*.data-* or *.aria-*), or every attribute on every tag at once (*.*). Twelve preset buttons cover the most common needs: every href, every src, every img alt, every data-*, every aria-*, every class, every id, meta content, input name, and a few more. The result panel offers six output formats: a plain list of values, a unique list with duplicates removed, CSV and TSV with tag, attribute, and value columns ready to paste into Sheets or Excel, JSON for downstream tooling, and a Markdown table for docs or pull requests. Sorting can keep document order, sort A to Z or Z to A, or rank by frequency so the most common values float to the top, which is handy for spotting repeated tracking IDs, recurring URLs, or duplicate alt text. Optional value trimming, empty-value filtering, and case-insensitive uniqueness keep the output clean. A live preview table shows the first 200 matches with tag and attribute context, and a Most frequent values panel surfaces any value that appears more than once. Useful workflows include auditing every href on a page for an internal link review, pulling every src for a CDN migration, listing every data-track-id for an analytics QA pass, dumping every aria-* attribute for an accessibility check, exporting alt text for a localization request, or grabbing every input name for a form refactor. The HTML is parsed locally in your browser using an inert document; scripts inside the pasted markup are never executed, and nothing you paste is uploaded.

Free to use. Works in your browser. No signup, no login.

Related tools

You may also like

All tools
All toolsDeveloper Tools