What You'll Learn
  • How to connect 12 data sources into a single fully automated weekly workflow
  • The 4-phase architecture: collection, AI analysis, assembly, and personalized delivery
  • Prompt engineering decisions that make AI reporting reliable and action-oriented
  • Why department-level analysis beats one giant LLM call — and how to split it
  • The 3 failures we hit in production and exactly how we fixed them
Table of Contents
  1. The Problem We Were Solving
  2. Data Sources Connected
  3. The n8n Workflow Architecture
  4. The Cost to Run It
  5. What Didn't Work the First Time
  6. Replicating This for Your Business

A 7-person reporting team. 12 data sources. Weekly board presentations. By Monday at 7am, every stakeholder had a tailored, context-aware summary in their inbox — assembled by an n8n workflow and GPT-4o. Nobody sent it. Nobody even reviewed it before it went out. Here's exactly how it works.

"The best automation isn't the one that does what humans do. It's the one that does what humans were too burned out to do properly."

The Problem We Were Solving

Our client — a mid-market e-commerce company — had a dedicated analytics and reporting function. Three full-time analysts and four supporting roles. Their weekly output: a 40-page PDF that most executives didn't read and a 20-minute Monday standup that usually ran over 45 minutes.

The core dysfunction: data was available in near-real-time, but insight delivery lagged by a week. By the time the report landed, decisions had already been made on gut feel. The automation didn't just replace people — it made the reports actually useful by making them timely and personalized.

Data Sources Connected

The workflow ingests from 12 sources via API or webhook. Here's the full stack:

12data sources
7amMonday delivery
0manual steps

The n8n Workflow Architecture

The workflow runs on a self-hosted n8n instance. It triggers every Sunday at 11pm via a Cron node. Here's the high-level flow:

Phase 1: Data Collection (11pm–1am)

All 12 API nodes fire in parallel using n8n's parallel execution. Each node has error handling with a fallback to cached data from the previous week. If an API returns an error, the workflow logs it but continues — a missing Gorgias block doesn't kill the Shopify analysis.

Data is normalized into a consistent JSON schema by a Code node running JavaScript. This normalization step is what makes the AI analysis reliable — GPT-4o sees consistent field names regardless of which source the data came from.

Phase 2: AI Analysis (1am–3am)

The normalized data is split by department: Revenue, Marketing, Operations, SEO. Each department has its own GPT-4o call with a specialized system prompt written for that function's stakeholders.

Key prompt engineering decisions that made this work:

Phase 3: Report Assembly (3am–4am)

Department summaries are assembled into a structured report by a second GPT-4o call acting as an editor. Its job: remove redundancy between departments, identify cross-functional implications, and write an executive summary for the CEO.

The output is a clean HTML email template — not a PDF. We learned early that PDFs don't get read. A well-formatted email with clear sections, colour-coded indicators, and embedded sparkline charts gets a 94% open rate within the first hour of delivery.

Phase 4: Personalized Distribution (4am–7am)

The final stage uses a stakeholder database in Airtable. Each stakeholder has a profile: role, department, KPIs they own, preferred level of detail, and any custom metrics they've flagged as high priority.

A final GPT-4o pass filters and reorders the report content for each recipient. The CEO gets a 2-page executive view with the 5 most important actions. The head of paid media gets the full marketing section plus relevant operational context. The supply chain manager gets inventory and ops only.

Personalization isn't a nice-to-have — it's what determines whether your automation gets used or ignored. Generic reports get deleted. Relevant ones get acted on.

The Cost to Run It

People always ask about API costs. Here's the real breakdown for a weekly run covering all 12 sources and 15 stakeholders:

The payback period was under 48 hours after deployment.

What Didn't Work the First Time

This wasn't a smooth build. Three major failures before the system worked reliably:

  1. Token overflow — passing all 12 data sources to a single GPT-4o call exceeded context limits and produced degraded output. Solution: department-level splitting with separate calls.
  2. Hallucinated metrics — GPT-4o occasionally invented plausible-sounding numbers when data was ambiguous. Solution: adding a validation step that cross-checks any mentioned metric against the source data before the report goes out.
  3. API rate limits at scale — firing all data collection nodes simultaneously hit rate limits on 3 of the 12 sources. Solution: staggered retry logic with exponential backoff per node.

Replicating This for Your Business

This architecture isn't client-specific. The pattern — collect, normalize, analyze by domain, assemble, personalize, distribute — works for any business with regular reporting needs. The key variables to customize are the data sources, the stakeholder profiles, and the analysis prompts for your specific KPIs.

If you have 3 or more people spending time on weekly reporting, this automation has a meaningful ROI on day one. If you want help scoping and building it, reach out — this is a standard engagement for us.

Key Takeaways
  • Split AI analysis by department — one GPT-4o call per domain prevents token overflow and produces sharper, more focused output
  • Anomaly-first prompting: ask for deviations, not summaries — if something's normal, "on track" in one sentence is the right answer
  • Always validate AI-mentioned metrics against source data before delivery — hallucinated numbers in executive reports destroy trust fast
  • HTML email beats PDF — well-formatted email with colour-coded indicators gets 94% open rate; PDFs get filed and forgotten
  • Under $1.50/week in API costs vs. $35,000/month team — the ROI conversation takes about 30 seconds
Next Programmatic SEO in 2026: Scale to 10,000 Pages Without Getting Penalized