Website analysis scrapes a company’s website and uses AI to extract structured data: people, addresses, contact information, industry classification, and more.
Enabling website analysis
Add websiteAnalysis: true to your request. Providing the website URL is strongly recommended — if omitted, the API attempts to discover it automatically, which adds time.
{
"name": "Shopify Inc",
"website": "https://shopify.com",
"websiteAnalysis": true
}
Results appear in the website field of the response:
{
"website": {
"status": "success",
"url": "https://shopify.com",
"scrapedAt": "2024-01-15T10:30:00Z",
"extracted": {
"legalName": "Shopify Inc.",
"descriptions": [
"Shopify is a commerce platform that allows anyone to set up an online store and sell their products."
],
"addresses": [
{
"label": "Headquarters",
"street": "151 O'Connor Street",
"city": "Ottawa",
"province": "Ontario",
"postalCode": "K2P 2L8",
"country": "Canada"
}
],
"people": [
{ "name": "Tobias Lütke", "title": "Chief Executive Officer" }
],
"contactInfo": {
"emails": ["support@shopify.com"],
"phoneNumbers": ["+18006578057"]
},
"registrations": [
{ "label": "GST/HST Number", "value": "123456789RT0001" }
]
},
"industry": {
"naicsCode": "454110",
"naicsDescription": "Electronic Shopping and Mail-Order Houses",
"reasoning": "Shopify operates an e-commerce platform..."
},
"domain": {
"url": "https://shopify.com",
"createdAt": "1996-04-28T00:00:00Z"
}
}
}
| Field | Description |
|---|
extracted.legalName | Legal name found on the website |
extracted.descriptions | Business description text |
extracted.addresses | Physical addresses found on the site |
extracted.people | Named individuals with titles (executives, contacts) |
extracted.contactInfo.emails | Contact email addresses |
extracted.contactInfo.phoneNumbers | Phone numbers in E.164 format |
extracted.registrations | Registration numbers mentioned on the site (GST, BN, etc.) |
industry.naicsCode | NAICS industry classification code |
industry.naicsDescription | Human-readable industry description |
domain.createdAt | Domain registration date |
Status values
| Status | Meaning |
|---|
success | Website was scraped and data extracted |
skipped | Website analysis was not requested |
error | Scraping failed (site unreachable, blocked, etc.) |
empty | Site was reachable but no structured data could be extracted |
People deduplication
People found on the website are merged with people found in registry data. The people top-level field in the response consolidates everyone across all sources — you won’t see duplicates between website.extracted.people and people.directors etc.
Website analysis adds 10–20 seconds to the response time. For the best results, provide the website URL directly rather than relying on automatic discovery.