Skip to content

Commit 6dbb5d4

Browse files
committed
frist draft finish
1 parent 960a201 commit 6dbb5d4

File tree

3 files changed

+284
-4
lines changed

3 files changed

+284
-4
lines changed

claude/restructure-report.md

Lines changed: 188 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,188 @@
1+
# Documentation Restructure Report
2+
3+
## Overview
4+
This report analyzes the documentation restructuring changes in the [PR#34](https://github.com/DataRecce/recce-docs/pull/34), comparing production URLs with the new structure and categorizing content changes.
5+
6+
## 1. URL Structure Comparison (Production → New)
7+
8+
### Navigation Structure Changes
9+
10+
**Production Structure:**
11+
- Overview
12+
- Getting Started
13+
- Web UI
14+
- Command Line Interface
15+
- Advanced Features
16+
- Scenarios
17+
- Architecture
18+
- Privacy & Terms
19+
- Cloud (Beta)
20+
21+
**New Structure:**
22+
- What's Recce (1-whats-recce)
23+
- Getting Started (2-getting-started)
24+
- Visualized Change (3-visualized-change)
25+
- Downstream Impacts (4-downstream-impacts)
26+
- Data Diffing (5-data-diffing)
27+
- Collaborate Validation (6-collaboration)
28+
- CI/CD (7-cicd)
29+
- Technical Concepts (8-technical-concepts)
30+
31+
### Complete URL Mapping
32+
33+
| Status | Production URL | New URL | Notes |
34+
|--------|---------------|---------|-------|
35+
| **SAME** | `/` (external redirect) | `/` (external redirect) | Home link unchanged |
36+
| **SAME** | `/blog/` (external) | `/blog/` (external) | Blog link unchanged |
37+
| **MODIFIED** | `/` (Overview) | `/1-whats-recce/` | Main landing page restructured |
38+
| **ARCHIVED** | `/demo/` | `/archived/demo/` | Demo page archived |
39+
| **MODIFIED** | `/get-started/` | `/2-getting-started/oss-vs-cloud/` | Split into multiple pages |
40+
| **ARCHIVED** | `/configure-diff/` | `/archived/configure-diff/` | Configuration guide archived |
41+
| **MODIFIED** | `/get-started-jaffle-shop/` | `/2-getting-started/get-started-jaffle-shop/` | Moved to section 2 |
42+
| **ARCHIVED** | `/start-with-dbt-cloud/` | `/archived/start-with-dbt-cloud/` | DBT Cloud guide archived |
43+
| **MODIFIED** | `/installation/` | `/2-getting-started/installation/` | Moved to section 2 |
44+
| **MODIFIED** | `/features/lineage/` | `/3-visualized-change/lineage/` | Reorganized under visualized change |
45+
| **MODIFIED** | `/features/query/` | `/5-data-diffing/query/` | Moved to data diffing section |
46+
| **MODIFIED** | `/features/checklist/` | `/6-collaboration/checklist/` | Moved to collaboration section |
47+
| **MODIFIED** | `/features/recce-debug/` | `/7-cicd/recce-debug/` | Moved to CI/CD section |
48+
| **ARCHIVED** | `/features/recce-run/` | `/archived/recce-cloud/recce-run/` | Archived under cloud |
49+
| **MODIFIED** | `/features/recce-summary/` | `/7-cicd/recce-summary/` | Moved to CI/CD section |
50+
| **MODIFIED** | `/features/state-file/` | `/8-technical-concepts/state-file/` | Moved to technical concepts |
51+
| **MODIFIED** | `/features/preset-checks/` | `/7-cicd/preset-checks/` | Moved to CI/CD section |
52+
| **ARCHIVED** | `/features/node-selection/` | `/archived/` | Integrated into other pages |
53+
| **MODIFIED** | `/features/impact-radius/` | `/4-downstream-impacts/impact-radius/` | New section created |
54+
| **MODIFIED** | `/features/breaking-change-analysis/` | `/4-downstream-impacts/breaking-change-analysis/` | Moved to downstream impacts |
55+
| **ARCHIVED** | `/features/column-level-lineage/` | `/3-visualized-change/column-level-lineage/` | Moved to visualized change |
56+
| **MODIFIED** | `/reference/configuration/` | `/8-technical-concepts/configuration/` | Moved to technical concepts |
57+
| **MODIFIED** | `/guides/scenario-dev/` | `/7-cicd/scenario-dev/` | Moved to CI/CD section |
58+
| **MODIFIED** | `/guides/scenario-pr-review/` | `/7-cicd/scenario-pr-review/` | Moved to CI/CD section |
59+
| **MODIFIED** | `/guides/scenario-ci/` | `/7-cicd/scenario-ci/` | Moved to CI/CD section |
60+
| **MODIFIED** | `/guides/best-practices-prep-env/` | `/7-cicd/best-practices-prep-env/` | Moved to CI/CD section |
61+
| **ARCHIVED** | `/architecture/overview/` | `/archived/overview/` | Architecture overview archived |
62+
| **ARCHIVED** | `/agreement/privacy-policy/` | External link | Now links to reccehq.com |
63+
| **ARCHIVED** | `/agreement/cookies-policy/` | External link | Now links to reccehq.com |
64+
| **ARCHIVED** | `/agreement/terms-of-use/` | External link | Now links to reccehq.com |
65+
| **ARCHIVED** | `/recce-cloud/*` | `/archived/recce-cloud/*` | Most cloud docs archived |
66+
67+
### New Pages Created
68+
69+
| New URL | Description |
70+
|---------|-------------|
71+
| `/1-whats-recce/community-support/` | New community support page |
72+
| `/2-getting-started/oss-vs-cloud/` | New comparison page |
73+
| `/2-getting-started/start-free-with-cloud/` | New cloud onboarding |
74+
| `/2-getting-started/cloud-5min-tutorial/` | New quick tutorial |
75+
| `/3-visualized-change/code-change/` | New code change visualization |
76+
| `/3-visualized-change/multi-models/` | New multi-model documentation |
77+
| `/4-downstream-impacts/metadata-first/` | New metadata approach guide |
78+
| `/4-downstream-impacts/transformation-types/` | New transformation types guide |
79+
| `/5-data-diffing/row-count-diff/` | New row count diffing guide |
80+
| `/5-data-diffing/profile-diff/` | New profile diffing guide |
81+
| `/5-data-diffing/value-diff/` | New value diffing guide |
82+
| `/5-data-diffing/topK-diff/` | New top-K diffing guide |
83+
| `/5-data-diffing/histogram-diff/` | New histogram diffing guide |
84+
| `/6-collaboration/share/` | New sharing documentation |
85+
| `/7-cicd/setup-cd/` | New CD setup guide |
86+
87+
## 2. Content Change Analysis
88+
89+
### New Content Requiring Full Review
90+
91+
| Page | Status | Review Required |
92+
|------|--------|----------------|
93+
| `1-whats-recce/index.md` | **NEW CONTENT** | ✅ Full review needed (completely rewritten introduction) |
94+
| `1-whats-recce/community-support.md` | **NEW PAGE** | ✅ Full review needed |
95+
| `2-getting-started/oss-vs-cloud.md` | **NEW PAGE** | ✅ Full review needed (detailed comparison table) |
96+
| `2-getting-started/start-free-with-cloud.md` | **NEW PAGE** | ✅ Full review needed |
97+
| `2-getting-started/cloud-5min-tutorial.md` | **NEW PAGE** | ✅ Full review needed |
98+
| `3-visualized-change/code-change.md` | **NEW PAGE** | ✅ Full review needed |
99+
| `3-visualized-change/multi-models.md` | **NEW PAGE** | ✅ Full review needed |
100+
| `4-downstream-impacts/impact-radius.md` | **NEW PAGE** | ✅ Full review needed |
101+
| `4-downstream-impacts/metadata-first.md` | **NEW PAGE** | ✅ Full review needed |
102+
| `4-downstream-impacts/transformation-types.md` | **NEW PAGE** | ✅ Full review needed |
103+
| `5-data-diffing/row-count-diff.md` | **NEW PAGE** | ✅ Full review needed (detailed tutorial with GIFs) |
104+
| `5-data-diffing/profile-diff.md` | **NEW PAGE** | ✅ Full review needed |
105+
| `5-data-diffing/value-diff.md` | **NEW PAGE** | ✅ Full review needed |
106+
| `5-data-diffing/topK-diff.md` | **NEW PAGE** | ✅ Full review needed |
107+
| `5-data-diffing/histogram-diff.md` | **NEW PAGE** | ✅ Full review needed |
108+
| `6-collaboration/share.md` | **NEW PAGE** | ✅ Full review needed |
109+
| `7-cicd/setup-cd.md` | **NEW PAGE** | ✅ Full review needed |
110+
111+
### Partial New Content / Enhanced Pages
112+
113+
| Page | Change Type | Review Required |
114+
|------|-------------|----------------|
115+
| `2-getting-started/installation.md` | **ENHANCED** | 🔄 Partial review (expanded from basic pip install) |
116+
| `3-visualized-change/lineage.md` | **ENHANCED** | 🔄 Partial review (significantly enhanced with new sections, tips, and explanations) |
117+
| `3-visualized-change/column-level-lineage.md` | **MOVED + MINOR EDITS** | 🔄 Minor review (moved from features/, minimal content changes) |
118+
| `6-collaboration/checklist.md` | **ENHANCED** | 🔄 Partial review (expanded with detailed workflow steps) |
119+
120+
### Minor Changes / Terminology Updates
121+
122+
| Page | Change Type | Review Required |
123+
|------|-------------|----------------|
124+
| `5-data-diffing/query.md` | **MINOR EDITS** | ⚪ Minimal review (mostly same content, minor formatting) |
125+
| `4-downstream-impacts/breaking-change-analysis.md` | **MINOR EDITS** | ⚪ Minimal review (removed icon, minor text changes) |
126+
127+
### Restructured Only (Same Content, Different URL)
128+
129+
| Page | Change Type | Review Required |
130+
|------|-------------|----------------|
131+
| `2-getting-started/get-started-jaffle-shop.md` | **MOVED** | ⚪ No content review (just moved) |
132+
| `7-cicd/best-practices-prep-env.md` | **MOVED** | ⚪ No content review (moved from guides/) |
133+
| `7-cicd/preset-checks.md` | **MOVED** | ⚪ No content review (moved from features/) |
134+
| `7-cicd/recce-debug.md` | **MOVED** | ⚪ No content review (moved from features/) |
135+
| `7-cicd/recce-summary.md` | **MOVED** | ⚪ No content review (moved from features/) |
136+
| `7-cicd/scenario-ci.md` | **MOVED** | ⚪ No content review (moved from guides/) |
137+
| `7-cicd/scenario-dev.md` | **MOVED** | ⚪ No content review (moved from guides/) |
138+
| `7-cicd/scenario-pr-review.md` | **MOVED** | ⚪ No content review (moved from guides/) |
139+
| `8-technical-concepts/configuration.md` | **MOVED** | ⚪ No content review (moved from reference/) |
140+
| `8-technical-concepts/state-file.md` | **MOVED** | ⚪ No content review (moved from features/) |
141+
142+
### Archived Content
143+
144+
| Page | Status | Action |
145+
|------|--------|--------|
146+
| All `/archived/*` files | **ARCHIVED** | ⚪ No review needed (preserved for reference) |
147+
148+
## 3. Summary Statistics
149+
150+
- **Total Pages Analyzed**: 47 active pages + 11 archived pages
151+
- **New Pages**: 16 pages requiring full content review
152+
- **Enhanced Pages**: 4 pages requiring partial review
153+
- **Minor Changes**: 2 pages requiring minimal review
154+
- **Restructured Only**: 10 pages (no content changes)
155+
- **Archived Pages**: 11 pages moved to archived/
156+
- **External Links**: 3 pages converted to external links
157+
158+
## 4. Review Priority Matrix
159+
160+
### High Priority (New Content)
161+
- Section 1: What's Recce (2 pages)
162+
- Section 2: Getting Started (3 new pages)
163+
- Section 3: Visualized Change (2 new pages)
164+
- Section 4: Downstream Impacts (3 new pages)
165+
- Section 5: Data Diffing (5 new pages)
166+
- Section 6: Collaboration (2 pages)
167+
- Section 7: CI/CD (1 new page)
168+
169+
### Medium Priority (Enhanced Content)
170+
- Enhanced installation guide
171+
- Enhanced lineage documentation
172+
- Moved but enhanced column-level lineage
173+
174+
### Low Priority (Restructured Only)
175+
- All moved pages with no content changes
176+
- Archived pages (for reference only)
177+
178+
## 5. Recommendations
179+
180+
1. **Content Review Focus**: Prioritize the 18 completely new pages for thorough review
181+
2. **Navigation Testing**: Test all URL redirects to ensure no broken links
182+
3. **Image Assets**: Verify all image paths have been updated correctly (significant reorganization in assets/)
183+
4. **SEO Impact**: Consider 301 redirects for changed URLs to maintain search rankings
184+
5. **User Experience**: The numbered learning path (1-8) provides better progressive learning structure
185+
186+
---
187+
188+
*Report generated from PR changes analysis comparing feature/pla-488-update-doc-structure-with-existing-content branch with main branch.*

claude/terminology.md

Lines changed: 93 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,15 @@ This guide helps maintain consistent, data-team-friendly language across all Rec
2020
| **code change** | code modification | Specific term for data team workflows |
2121
| **validation workflow** | testing workflow, QA process | How teams use Recce |
2222
| **diff** | comparison, delta | Data teams familiar with git diff, use freely |
23+
| **state file** | session file, cache | Stores validation results and checks for later use |
24+
| **preset checks** | default tests, template checks | Pre-configured validation rules for projects |
25+
| **lineage diff** | dependency comparison | Visual comparison of model relationships |
26+
| **breaking change analysis** | impact assessment | Automated detection of schema changes |
27+
| **value diff** | data comparison | Row-by-row data validation between environments |
28+
| **profile diff** | summary statistics | Column-level statistical comparisons |
29+
| **histogram diff** | distribution analysis | Numeric data distribution comparisons |
30+
| **top-K diff** | frequency analysis | Categorical data frequency comparisons |
31+
| **row count diff** | record count validation | Quick data volume verification |
2332
| **Recce**| recce | this is the brand name, should always use "Recce" |
2433
| **dbt** | DBT | this is the brand name, should always use lower cases|
2534

@@ -37,6 +46,13 @@ This guide helps maintain consistent, data-team-friendly language across all Rec
3746
| **automated validation** | CI/CD pipeline | Data processing vs deployment automation |
3847
| **change review** | code review | Reviewing data changes vs code changes |
3948
| **diff** | comparison | Data teams understand diff from git/version control |
49+
| **artifacts** | build outputs | dbt manifest.json, catalog.json files |
50+
| **run dbt** | execute dbt, build models | Preferred: "run dbt" not "execute" or "build" |
51+
| **materialization** | table creation strategy | How dbt creates tables (table, view, incremental) |
52+
| **dbt target** | environment config | profiles.yml target configuration |
53+
| **warehouse connection** | database connection | Data team context for connecting to warehouse |
54+
| **development workspace** | dev environment | Where data analysts work on models |
55+
| **production workspace** | prod environment | Live data warehouse with production data |
4056

4157
### Business Impact Language
4258

@@ -47,6 +63,11 @@ This guide helps maintain consistent, data-team-friendly language across all Rec
4763
| **confident releases** | successful deployments | User empowerment over technical success |
4864
| **team collaboration** | workflow integration | People-first over tool-first |
4965
| **validate changes** | test modifications | Active validation vs passive testing |
66+
| **validation checklist** | test suite | Collaborative validation workflow for teams |
67+
| **share findings** | distribute results | Team collaboration on validation results |
68+
| **review together** | collaborate on validation | Team-based change review process |
69+
| **check status** | test results | Current state of validation checks |
70+
| **approve changes** | sign off on deployment | Business approval vs technical deployment |
5071

5172
## Terms That Confuse Data Teams
5273

@@ -72,6 +93,16 @@ This guide helps maintain consistent, data-team-friendly language across all Rec
7293
- **Software team thinks**: Unit/integration tests for code
7394
- **✅ Use instead**: "validation" or "data quality checks"
7495

96+
**Build**
97+
- **Data team thinks**: "dbt run" - executing transformations
98+
- **Software team thinks**: Compile/package application
99+
- **✅ Use instead**: "run models" or "execute transformations"
100+
101+
**Server**
102+
- **Data team thinks**: Database server (Snowflake, BigQuery)
103+
- **Software team thinks**: Web server or application server
104+
- **✅ Use instead**: "Recce UI" or "Recce interface" for the tool
105+
75106
### ⚠️ Medium Confusion Terms
76107

77108
**Model**
@@ -89,6 +120,21 @@ This guide helps maintain consistent, data-team-friendly language across all Rec
89120
- **Software context**: Deployment target or goal
90121
- **✅ Clarify**: "dbt target" when referring to profiles.yml
91122

123+
**Artifacts**
124+
- **Data context**: dbt-generated metadata files (manifest.json, catalog.json)
125+
- **Software context**: Build outputs, compiled binaries
126+
- **✅ Clarify**: "dbt artifacts" or "metadata files"
127+
128+
**Workspace**
129+
- **Data context**: Development area in data warehouse
130+
- **Software context**: IDE or development environment
131+
- **✅ Clarify**: "development workspace" vs "local workspace"
132+
133+
**Instance**
134+
- **Data context**: Recce UI session or warehouse instance
135+
- **Software context**: Running application or server
136+
- **✅ Clarify**: "Recce session" or "warehouse connection"
137+
92138
## Terminology Alert System
93139

94140
When reviewing documentation, flag confusing terms with this format:
@@ -141,6 +187,48 @@ While software teams deploy applications, data teams release model changes to th
141187
Configure your dbt target (the warehouse connection in profiles.yml) to point to your development environment.
142188
```
143189

190+
## Conflicting Usage Patterns
191+
192+
These terms are used inconsistently across the documentation, creating potential confusion. Document conflicts here without changing the content immediately.
193+
194+
### 🔀 Terms Used Inconsistently
195+
196+
**"Environment" vs "Target" vs "Workspace"**
197+
- **Data team perspective**: "Target" refers to dbt profiles.yml config, "Environment" means warehouse type (Snowflake vs BigQuery)
198+
- **Software team perspective**: "Environment" means dev/staging/prod deployment context
199+
- **Current usage in docs**: Mixed usage - sometimes "environment" for dbt targets, sometimes for deployment stages
200+
- **Recommendation**: Use "dbt target" for profiles.yml, "development workspace" for where analysts work
201+
202+
**"Check" vs "Validation" vs "Test"**
203+
- **Data team perspective**: "Validation" sounds more professional than "testing", "checks" are specific validations
204+
- **Software team perspective**: "Tests" are automated code verification, "checks" are CI/CD gates
205+
- **Current usage in docs**: Mixed - "preset checks", "validation workflow", "test your models"
206+
- **Recommendation**: Standardize on "validation" for data quality, "checks" for specific validation rules
207+
208+
**"Build" vs "Run" vs "Execute"**
209+
- **Data team perspective**: "Run dbt" is the standard command, "build" sounds like compiling code
210+
- **Software team perspective**: "Build" means compile/package, "execute" means run program
211+
- **Current usage in docs**: Mixed usage - "build your models", "run dbt", "execute transformations"
212+
- **Recommendation**: Use "run" for dbt operations, avoid "build" and "execute"
213+
214+
**"Review" vs "Audit" vs "Validation"**
215+
- **Data team perspective**: "Review" is collaborative, "audit" is formal compliance checking
216+
- **Software team perspective**: "Review" is code review process, "audit" is security scanning
217+
- **Current usage in docs**: "Review mode", "change review", "validation workflow"
218+
- **Recommendation**: Use "review" for collaborative processes, "validation" for automated checks
219+
220+
**"Pipeline" vs "Workflow"**
221+
- **Data team perspective**: "Pipeline" means data transformation flow (dbt models)
222+
- **Software team perspective**: "Pipeline" means CI/CD automation pipeline
223+
- **Current usage in docs**: "CI/CD pipeline", "validation workflow", "automation workflow"
224+
- **Recommendation**: "Data pipeline" for transformations, "automation workflow" for CI/CD
225+
226+
**"Deploy" vs "Release" vs "Publish"**
227+
- **Data team perspective**: "Release" means making changes live, "deploy" sounds like infrastructure
228+
- **Software team perspective**: "Deploy" means publish application, "release" means version release
229+
- **Current usage in docs**: Mixed - "deploy changes", "release changes", "publish results"
230+
- **Recommendation**: Use "release" for data changes, "deploy" only for infrastructure
231+
144232
## Maintenance Guidelines
145233

146234
### Adding New Terms
@@ -174,6 +262,11 @@ Before publishing, verify:
174262
| "Production deployment" | "Production release" | Making changes live |
175263
| "Development environment" | "Development warehouse" | Where you develop |
176264
| "Code review" | "Change review" | Reviewing data modifications |
265+
| "Build models" | "Run models" | dbt model execution |
266+
| "Execute dbt" | "Run dbt" | Standard dbt command usage |
267+
| "Check results" | "Validation status" | Status of data validation |
268+
| "Recce server" | "Recce interface" | The UI tool itself |
269+
| "Database connection" | "Warehouse connection" | Data team context |
177270

178271
---
179272

claude/writing-principles.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -143,13 +143,12 @@ Avoid Uncertainty Language, like:
143143
Maintain consistency with the 9-section structure:
144144
1. **What's Recce** - Introduction, value proposition, use cases
145145
2. **Getting Started** - Installation, tutorials, first steps
146-
3. **View Modified** - Core functionality, diff analysis
146+
3. **Visualized Change** - Core functionality, diff analysis
147147
4. **Downstream Impacts** - Impact analysis, dependency mapping
148148
5. **Data Diffing** - Query comparison, validation techniques
149149
6. **Collaborate Validation** - Team workflows, checklist management
150-
7. **Workflow Integration** - CI/CD, automation, best practices
151-
8. **Lots Models** - Scale considerations, performance optimization
152-
9. **Technical Concepts** - Architecture, configuration, security
150+
7. **CI/CD** - CI/CD, automation, best practices
151+
8. **Technical Concepts** - Architecture, configuration
153152

154153
These structure is to align users adoption process. Start from using in PR, colloaborate with team, and using in dev time. And know more technical behind it.
155154

0 commit comments

Comments
 (0)