Skip to content

Commit f56b15c

Browse files
raphaela-nawaclaude
andcommitted
Fix industry field extraction to show actual industries
- Add explicit **Industry:** field to all project READMEs (Days 1-10) - Update README updater to extract from **Industry:** field instead of stakeholder role - Replace stakeholder roles with actual industry sectors: * Day 01: Marketing/Advertising (not "Marketing consultant") * Day 02: Creator Economy/Social Media * Day 03: Legal/Compliance (not "Legal/Compliance Team") * Day 04: Crypto/Blockchain (not "Blockchain/Crypto Analyst") * Day 05: Cultural Heritage/Museums (not "Data Analyst") * Day 06: SaaS/Software (not "C-level") * Day 07: Hospitality/Travel (not "Pousada Owner") * Day 08: SaaS/Software (not "MBA, Strategy") * Day 09: Hospitality/Property Management (not "Property Manager") * Day 10: Wealth Management/Finance (not "Wealth Planning Specialist") Result: Main README now displays actual industry sectors for better portfolio categorization. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
1 parent 1e9fce6 commit f56b15c

File tree

12 files changed

+26
-31
lines changed

12 files changed

+26
-31
lines changed

README.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -23,16 +23,16 @@ Each one ships with full code and documentation.
2323

2424
| Day | Pillar | Project | Industry | Status | Code |
2525
|-----|--------|---------|----------|--------|------|
26-
| 1 | Ingestion | GA4 + Google Ads → BigQuery Pipeline | Marketing consultant | ✅ Complete | [Day 01](./day01) |
27-
| 2 | Ingestion | Creator Intelligence System | TBD | ✅ Complete | [Day 02](./day02) |
28-
| 3 | Ingestion | GDPR Lead Ingestion Pipeline | Legal/Compliance Team | ✅ Complete | [Day 03](./day03) |
29-
| 4 | Ingestion | Cardano Blockchain Transparency Pipeline | Blockchain/Crypto Analyst | ✅ Complete | [Day 04](./day04) |
30-
| 5 | Ingestion | Museu Ipiranga Cultural Data Pipeline | Paula (Cultural Data Analyst) | ✅ Complete | [Day 05](./day05) |
31-
| 6 | Modeling | SaaS Health Metrics Foundation | SaaS Executive (C-level) | ✅ Complete | [Day 06](./day06) |
32-
| 7 | Modeling | Hospitality LTV & Cohort Model | Carol (Pousada Owner) | ✅ Complete | [Day 07](./day07) |
33-
| 8 | Modeling | SaaS Growth Funnel & Cohort Analysis (dbt) | Patrick (MBA, Strategy) | ✅ Complete | [Day 08](./day08) |
34-
| 9 | Modeling | Property Operations Data Warehouse (dbt) | Jo (Independent Property Manager) | ✅ Complete | [Day 09](./day09) |
35-
| 10 | Modeling | Family Office Asset Management Data Warehouse | Rafael (Cross-Border Wealth Planning Specialist) | ✅ Complete | [Day 10](./day10) |
26+
| 1 | Ingestion | GA4 + Google Ads → BigQuery Pipeline | Marketing/Advertising | ✅ Complete | [Day 01](./day01) |
27+
| 2 | Ingestion | Creator Intelligence System | Creator Economy/Social Media | ✅ Complete | [Day 02](./day02) |
28+
| 3 | Ingestion | GDPR Lead Ingestion Pipeline | Legal/Compliance | ✅ Complete | [Day 03](./day03) |
29+
| 4 | Ingestion | Cardano Blockchain Transparency Pipeline | Crypto/Blockchain | ✅ Complete | [Day 04](./day04) |
30+
| 5 | Ingestion | Museu Ipiranga Cultural Data Pipeline | Heritage/Museums | ✅ Complete | [Day 05](./day05) |
31+
| 6 | Modeling | SaaS Health Metrics Foundation | SaaS/Software | ✅ Complete | [Day 06](./day06) |
32+
| 7 | Modeling | Hospitality LTV & Cohort Model | Hospitality/Travel | ✅ Complete | [Day 07](./day07) |
33+
| 8 | Modeling | SaaS Growth Funnel & Cohort Analysis (dbt) | SaaS/Software | ✅ Complete | [Day 08](./day08) |
34+
| 9 | Modeling | Property Operations Data Warehouse (dbt) | Hospitality/Property Management | ✅ Complete | [Day 09](./day09) |
35+
| 10 | Modeling | Family Office Asset Management Data Warehouse | Wealth Management/Finance | ✅ Complete | [Day 10](./day10) |
3636
| 11 | Orchestration | TBD | TBD | 🚧 Planned | [Day 11](./day11) |
3737
| 12 | Orchestration | TBD | TBD | 🚧 Planned | [Day 12](./day12) |
3838
| 13 | Orchestration | TBD | TBD | 🚧 Planned | [Day 13](./day13) |

common/utils/update_readme.py

Lines changed: 5 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -134,20 +134,12 @@ def scan_project(self, day: int) -> ProjectInfo:
134134

135135
# Extract industry (look for various patterns)
136136
industry_patterns = [
137-
# Pattern 1: **For:** Role/Industry | (highest priority, new template format)
138-
r'\*\*For:\*\*\s*(.+?)\s*\|',
139-
# Pattern 2: **For:** Name (Role/Industry)
140-
r'\*\*For:\*\*\s*[^(]+\(([^)]+)\)',
141-
# Pattern 3: **Stakeholder:** Name - Role/Industry
142-
r'\*\*Stakeholder:\*\*\s*[^-]+-\s*([^(]+?)(?:\s+who\s+|$)',
143-
# Pattern 4: **Industry:** or **Industry **
144-
r'\*\*Industry[:\s]+\*\*\s*(.+)',
145-
# Pattern 5: **Built For:** ... **Role/Context:**
146-
r'\*\*Built For[:\s]+\*\*[^\n]*\n\*\*Role/Context[:\s]+\*\*\s*(.+)',
147-
# Pattern 6: Table format
137+
# Pattern 1: **Industry:** Explicit field (highest priority)
138+
r'\*\*Industry:\*\*\s*([^|\n]+)',
139+
# Pattern 2: **For:** ... | **Industry:** ...
140+
r'\*\*Industry:\*\*\s*(.+?)\s*\|',
141+
# Pattern 3: Table format
148142
r'\|\s*\d+\s*\|\s*\w+\s*\|\s*[^|]+\|\s*(.+?)\s*\|',
149-
# Pattern 7: One-line pitch or business problem with industry context
150-
r'\*\*Business Problem:\*\*\s*([^.]+)',
151143
]
152144
for pattern in industry_patterns:
153145
industry_match = re.search(pattern, readme_content, re.IGNORECASE)

day01/README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,7 @@
22

33
**Project 1A - Ingestion Pillar**
44
**Stakeholder:** Daud - Marketing consultant who needs automated GA4 + Google Ads reports for agency clients
5+
**Industry:** Marketing/Advertising
56

67
## 📋 Overview
78

day02/README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,8 @@
22

33
**Instagram Business Analytics & Growth Intelligence Platform**
44

5+
**Industry:** Creator Economy/Social Media
6+
57
An end-to-end analytics system for Instagram creators, built to help a 100K-follower account reach 200K in 6 months through data-driven insights and AI-powered recommendations.
68

79
---

day03/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414

1515
**Business Impact:** Automated compliance enforcement reduces legal risk and ensures 100% of leads have documented consent status and retention dates.
1616

17-
**For:** Legal/Compliance Team | **Time:** 3 hours | **Status:** ✅ Complete
17+
**For:** Legal/Compliance Team | **Industry:** Legal/Compliance | **Time:** 3 hours | **Status:** ✅ Complete
1818

1919
---
2020

day04/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414

1515
**Business Impact:** Enables data-driven verification of blockchain values - decentralization proof (3000 pools vs Bitcoin's 5), accessibility proof (avg $0.17 fees vs Ethereum's $5-50), and real adoption metrics.
1616

17-
**For:** Blockchain/Crypto Analyst | **Time:** 3 hours | **Status:** ✅ Complete
17+
**For:** Blockchain/Crypto Analyst | **Industry:** Crypto/Blockchain | **Time:** 3 hours | **Status:** ✅ Complete
1818

1919
---
2020

day05/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@
2727

2828
**Business Impact:** Reduced artifact research time from 4+ hours of manual listening to 5 minutes of querying structured data, enabling cultural metadata enrichment and content-catalog linking.
2929

30-
**For:** Paula (Cultural Data Analyst) | **Time:** 3 hours | **Status:** ✅ Complete
30+
**For:** Paula (Cultural Data Analyst) | **Industry:** Cultural Heritage/Museums | **Time:** 3 hours | **Status:** ✅ Complete
3131

3232
---
3333

day06/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@
2727

2828
**Business Impact:** Reduced monthly reporting time from 4 hours (manual Excel) to 5 minutes (automated SQL queries), enabling weekly instead of quarterly reviews of key SaaS metrics.
2929

30-
**For:** SaaS Executive (C-level) | **Time:** 3 hours | **Status:** ✅ Complete
30+
**For:** SaaS Executive (C-level) | **Industry:** SaaS/Software | **Time:** 3 hours | **Status:** ✅ Complete
3131

3232
---
3333

day07/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88

99
**Business Impact:** Identified that top 10% of guests generate 3-5x more revenue, with winter cohorts showing 25% higher retention than summer cohorts. System enables data-driven decisions on marketing budget allocation and VIP engagement.
1010

11-
**For:** Carol (Pousada Owner) | **Time:** 3 hours | **Status:** ✅ Complete
11+
**For:** Carol (Pousada Owner) | **Industry:** Hospitality/Travel | **Time:** 3 hours | **Status:** ✅ Complete
1212

1313
---
1414

day08/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@
2727

2828
**Business Impact:** Automated cohort analysis reduces weekly reporting time from 4 hours (manual SQL + spreadsheets) to 2 minutes (single dbt run), enabling daily instead of weekly growth reviews.
2929

30-
**For:** Patrick (MBA, Strategy) | **Time:** 3 hours | **Status:** ✅ Complete
30+
**For:** Patrick (MBA, Strategy) | **Industry:** SaaS/Software | **Time:** 3 hours | **Status:** ✅ Complete
3131

3232
---
3333

0 commit comments

Comments
 (0)