Skip to main content
Website analysis scrapes a company’s website and uses AI to extract structured data: people, addresses, contact information, industry classification, and more.

Enabling website analysis

Add websiteAnalysis: true to your request. Providing the website URL is strongly recommended — if omitted, the API attempts to discover it automatically, which adds time.
{
  "name": "Shopify Inc",
  "website": "https://shopify.com",
  "websiteAnalysis": true
}

What gets extracted

Results appear in the website field of the response:
{
  "website": {
    "status": "success",
    "url": "https://shopify.com",
    "scrapedAt": "2024-01-15T10:30:00Z",
    "extracted": {
      "legalName": "Shopify Inc.",
      "descriptions": [
        "Shopify is a commerce platform that allows anyone to set up an online store and sell their products."
      ],
      "addresses": [
        {
          "label": "Headquarters",
          "street": "151 O'Connor Street",
          "city": "Ottawa",
          "province": "Ontario",
          "postalCode": "K2P 2L8",
          "country": "Canada"
        }
      ],
      "people": [
        { "name": "Tobias Lütke", "title": "Chief Executive Officer" }
      ],
      "contactInfo": {
        "emails": ["support@shopify.com"],
        "phoneNumbers": ["+18006578057"]
      },
      "registrations": [
        { "label": "GST/HST Number", "value": "123456789RT0001" }
      ]
    },
    "industry": {
      "naicsCode": "454110",
      "naicsDescription": "Electronic Shopping and Mail-Order Houses",
      "reasoning": "Shopify operates an e-commerce platform..."
    },
    "domain": {
      "url": "https://shopify.com",
      "createdAt": "1996-04-28T00:00:00Z"
    }
  }
}

Extracted fields

FieldDescription
extracted.legalNameLegal name found on the website
extracted.descriptionsBusiness description text
extracted.addressesPhysical addresses found on the site
extracted.peopleNamed individuals with titles (executives, contacts)
extracted.contactInfo.emailsContact email addresses
extracted.contactInfo.phoneNumbersPhone numbers in E.164 format
extracted.registrationsRegistration numbers mentioned on the site (GST, BN, etc.)
industry.naicsCodeNAICS industry classification code
industry.naicsDescriptionHuman-readable industry description
domain.createdAtDomain registration date

Status values

StatusMeaning
successWebsite was scraped and data extracted
skippedWebsite analysis was not requested
errorScraping failed (site unreachable, blocked, etc.)
emptySite was reachable but no structured data could be extracted

People deduplication

People found on the website are merged with people found in registry data. The people top-level field in the response consolidates everyone across all sources — you won’t see duplicates between website.extracted.people and people.directors etc.
Website analysis adds 10–20 seconds to the response time. For the best results, provide the website URL directly rather than relying on automatic discovery.