Case studies
Internal tools

B2B lead scraping & email verification

n8n-native prospecting pipeline that reads targeting inputs from Google Sheets, builds Apollo searches, scrapes via Apify with dedupe and normalization, keeps only verified emails, and appends clean lists back to Sheets—with rate limits tuned for stability.

n8nApolloApifyGoogle SheetsWeb scrapingLead generation
B2B lead scraping & email verification product screenshot

Results at a glance

Dynamic Apollo URL construction from sheet-driven criteria

Duplicate suppression and verified-email-only outputs

Rate limiting and field normalization baked into the graph

Challenge

Growth teams were exporting brittle CSVs from multiple tools, losing dedupe discipline, and burning slots on bad emails. They needed a repeatable graph that marketing could parameterize without engineering babysitting every pull.

What Habrig built

  • Google Sheets as the control plane for titles, company filters, and batch sizing familiar to operators
  • Scheduled or on-demand triggers pulling inputs, looping titles, and assembling crawler-ready payloads
  • Apify-powered scraping nodes with JSON shaping before Apollo-specific enrichment steps
  • Hard gates that drop unverified addresses and collapse duplicates prior to append operations
  • Limiter nodes coordinated with third-party quotas to avoid bans or partial exports
  • Append-only Sheet targets with schema validation so downstream CRM imports do not corrupt existing rows

Outcomes

  • Cleaner lead files ready for CRM import or sequencer tools
  • Less time reconciling “which sheet is canonical?” across reps
  • Operational transparency via n8n execution history when a batch underfills expectations

Technology

frontend

Sheets-first UX for inputs and outputs; optional notifications when jobs finish

backend

n8n loops, HTTP tooling toward Apollo/Apify endpoints, and deterministic cleanup subgraphs

database

Google Sheets as the staging warehouse; optional downstream sync hooks left adapter-friendly

infrastructure

Hosted automation tier sized for nightly pulls with credential rotation support

monitoring

Row-count sanity checks, runtime alerts on repeated HTTP failures, and archived run logs

cicd

Workflow exports reviewed before credential swaps; test sheets isolated from production tabs

Execution detail

Product & frontend

  • Google Sheets as the control plane for titles, company filters, and batch sizing familiar to operators

Backend & data

  • Scheduled or on-demand triggers pulling inputs, looping titles, and assembling crawler-ready payloads
  • Apify-powered scraping nodes with JSON shaping before Apollo-specific enrichment steps
  • Hard gates that drop unverified addresses and collapse duplicates prior to append operations

Platform & delivery

  • Limiter nodes coordinated with third-party quotas to avoid bans or partial exports
  • Append-only Sheet targets with schema validation so downstream CRM imports do not corrupt existing rows

Gallery

Project images 1
1 / 2

Plan your next release

Tell us what shipped, what is at risk, and what success looks like. We will respond with a practical path.

Book a consultation