Home  ·  Glossary
Glossary

Web scraping in
plain English.

Definitions for 40+ terms — scraping, SERPs, email enrichment, MX records, catch-all, the lot. Bookmarkable. Linkable. Updated when the industry shifts.

A

API (Application Programming Interface)
A structured way for software to talk to software. Livescraper exposes a REST API so you can run scrapes from your own code instead of the dashboard. See our API docs →
Application form
The user-facing page where someone signs up. We collect email + password + (optionally) company; nothing else.

B

B2B (Business-to-Business)
Companies selling to other companies. Most Livescraper customers run B2B lead gen — pulling contact data on businesses, not consumers.
Bounce / hard bounce
An email that fails permanently — wrong address, dead domain. A "soft bounce" fails temporarily (mailbox full, server down). High bounce rates hurt sender reputation. Check before you send →

C

CAPTCHA
A test (typically image-based) to tell humans from bots. Most modern scraping involves either avoiding pages with CAPTCHAs or solving them via a third-party service. Livescraper handles both — invisibly.
Catch-all email
A domain configured to accept mail to any address (anything@example.com). Sends won't bounce, but they often go nowhere. Catch-all detection is part of email validation.
CCPA
California Consumer Privacy Act. US analogue to GDPR. Livescraper is CCPA-compliant — public business data only.
CID (Customer ID)
Google's internal identifier for a place. Looks like 1234567890123456789. Useful for re-fetching the same place reliably. Extract one from a URL →
Cold email
Outbound email to someone who hasn't opted in. Legal in most jurisdictions if relevant + has a clear unsubscribe; deliverability requires careful list hygiene.
Conversion rate
The percentage of visitors who take a desired action — sign up, buy, click. The whole point of measuring lead gen.
CSV (Comma-Separated Values)
The simplest tabular file format. Every row is a line, every column separated by commas. Livescraper exports default to CSV; opens in Excel, Google Sheets, anywhere.

D

Data extraction
Pulling structured information out of unstructured pages. The job all six Livescraper tools do.
Deduplication / dedup
Removing duplicate rows from a dataset. Critical when you combine multiple sources. Free CSV dedup tool →
Disposable email
A throwaway address from services like Mailinator or 10MinuteMail. People use them to dodge real signups. Livescraper's validator filters them out.
DPA (Data Processing Agreement)
A legal contract required under GDPR for any vendor processing personal data on your behalf. We sign one with EU customers on request.

E

Enrichment
Adding fields to existing records. "Enrich a domain list" = "give me back emails, phones, socials for these domains". Email Scraper →
E.164 format
The international standard for phone numbers: +[country][national number]. Always 15 digits or fewer. Check any number →
ETL (Extract, Transform, Load)
Classic data pipeline pattern: pull data, clean it up, push it into your warehouse. Livescraper handles the Extract step.

F

Featured snippet
The "Position 0" answer box at the top of Google's results. Tracked by our SERP Scraper.
Fingerprinting
A technique sites use to identify a browser uniquely (canvas, fonts, plugin list). Defeating fingerprinting is part of why Livescraper takes care of all the scraping infra for you.

G

GDPR
EU regulation governing personal data. Public business data is generally fine to scrape; never scrape personal/protected categories. Livescraper is GDPR-aligned. Read more →
Geocoding
Converting an address into latitude/longitude coordinates. Reverse geocoding does the opposite. Maps Scraper does both.

H

Headless browser
A browser running without a visual interface — used by scrapers to render JavaScript-heavy pages. Livescraper uses headless browsers for harder targets, invisibly.
HTTP / HTTPS
The transport protocols of the web. HTTPS is encrypted; HTTP is not. Livescraper only fetches over HTTPS.

I

Intent data
Signals that suggest a buyer is researching your category — recent careers postings, tech-stack changes, expansion announcements. We surface several public intent signals.
IP rotation
Cycling through a pool of IP addresses to avoid rate limits. Livescraper does this for you — you never see an IP.

J

JSON
The default modern data interchange format. Every Livescraper API response is JSON; CSV is a flat-file rendering of it.

L

Lead
A potential customer. "Lead gen" = the process of finding and qualifying them. Most Livescraper customers use it for B2B lead gen.
Local pack / 3-pack
The map + 3 business listings Google shows for "near me" queries. Position 1 in the local pack is enormously valuable. Track yours →

M

MX record
The DNS record that tells the world which mail server accepts mail for a domain. No MX = no email = certain bounce. Part of every email validation pass.

N

NAP (Name · Address · Phone)
The three pieces of business identity that local SEO depends on. Inconsistent NAP across the web hurts rank. Maps Scraper helps you audit.

P

PaaS / SaaS
Platform-as-a-Service vs Software-as-a-Service. Livescraper is SaaS — usable from a browser, no install.
Parsing
Reading raw HTML and extracting structured fields. The hardest, most maintenance-heavy part of any scraper. We do all of it.
Place ID
Google's stable identifier for a place. Survives renames, address changes. Different from CID (older) and from the hex ID inside maps URLs.
Proxy
An intermediate server that requests pages on your behalf. Used to rotate IPs and avoid blocks. Livescraper handles all proxy infrastructure invisibly.

R

Rate limit
How many requests a server allows in a window. Most scraping pain comes from rate limits. We absorb them with our proxy + retry layer.
REST API
An HTTP-based API style using verbs (GET / POST) on URL resources. Our API is REST. Endpoints →
Role account
A shared inbox like info@ or sales@. Lower deliverability than personal addresses; cold outreach should usually exclude.

S

SERP (Search Engine Results Page)
The page Google returns for a query. Tracking your SERP rank weekly is the foundation of local + organic SEO. SERP Scraper →
SMTP probe
Asking a mail server "do you accept mail for this address?" without actually sending. The most reliable single email validation check.
SOC 2
An audit framework for service providers. SOC 2 Type II = audited annually. We've held SOC 2 Type II since 2023. More →

T

Terms of service (TOS)
The contract you agree to when using a service. Scraping public data generally doesn't violate site TOS, but specific platforms vary. We only ever scrape what's publicly visible.

U

UTM parameters
URL tags (?utm_source=…) that let analytics tools attribute visits. We support UTM tagging on every Livescraper outbound link.

V

Validation
Confirming a piece of data is well-formed and likely real (email syntax + MX + SMTP + catch-all checks). Email Validation →

W

Webhook
A URL on your server that Livescraper POSTs to when an event happens (task completed, schedule fired). Lets you wire scrapes directly into your stack. Webhook docs →

Looking for a term we don't have? Tell us.

The glossary grows with reader requests. Email hello@livescraper.com with the term you want defined.