Skip to content

Commit 63a0984

Browse files
sc-protegrityprotegrity-gheuserprotegrity-gheuser
authored
push to pre-release branch (#33) (#34)
Co-authored-by: protegrity-gheuser <[email protected]> Co-authored-by: protegrity-gheuser <[email protected]>
1 parent 0f14995 commit 63a0984

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

46 files changed

+84070
-92
lines changed

CHANGELOG.md

Lines changed: 138 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,143 @@
22

33
All notable changes to the Protegrity Developer Edition project will be documented in this file.
44

5-
## [Current Release]
5+
## [1.1.0] - 2025-12-15
6+
7+
### 🎉 Major New Features
8+
9+
#### General Enhancements
10+
- **README Improvements**: Added badges for improved visibility and quick access to key resources
11+
- **Repository Restructuring**: Reorganized folders for better organization of samples and source code
12+
- **Documentation Updates**: Comprehensive updates to getting started guides and feature documentation
13+
14+
#### Data Discovery v1.1.1
15+
- **Structured Text Classification**: Added support for structured data classification
16+
- **Harmonized Classifications**: Introduced categorized "harmonized" entity classifications for consistent data element mapping
17+
- **Performance Improvements**: General enhancements to classification accuracy and speed
18+
- **Enhanced Entity Mapping**: Updated entity-to-data-element mapping to align with Discover 1.1
19+
20+
#### Semantic Guardrails v1.1
21+
- **Richer Examples**: Included more comprehensive examples in sample files for easier understanding
22+
- **Vertical-Specific Models**: Added pre-trained support for additional industry verticals (Finance and Healthcare)
23+
- **Jupyter Notebook Sample**: New interactive notebook for seamless evaluation and execution (`samples/python/sample-app-semantic-guardrails/`)
24+
- **Port Updates**: Service now runs on port 8581 with updated image paths
25+
26+
#### Synthetic Data Generation (NEW)
27+
- **Synthetic Data Feature**: New capability for generating synthetic test data to support testing and experimentation
28+
- **Jupyter Notebook Sample**: Interactive notebook for synthetic data generation (`samples/python/sample-app-synthetic-data/`)
29+
- **Docker Compose Profile**: New `synthetic` profile for orchestrating Synthetic Data services
30+
- **Service Integration**: Seamless integration with existing Developer Edition infrastructure
31+
32+
#### Expanded Language & Platform Support
33+
- **Java SDK Samples**: Complete Java implementation with CLI scripts for all major workflows
34+
- Data discovery, classification, protection, and redaction
35+
- Full source code provided for customization and compilation
36+
- Cross-platform compatibility (Linux, macOS, Windows)
37+
- **Python SDK Updates**: Enhanced Python samples with better error handling and documentation
38+
- **Dual Language Support**: Maintained feature parity between Python and Java implementations
39+
- **Java 11+ Compatibility**: Ensured compatibility with modern Java versions
40+
- **Python 3.12+ Support**: Updated minimum Python version requirement
41+
42+
### 🏗️ Architecture & Structure Changes
43+
44+
#### Repository Structure Enhancements
45+
- **New Java Samples Directory**: Added `samples/java/` with comprehensive sample applications
46+
- `sample-app-find.sh` - PII discovery CLI
47+
- `sample-app-find-and-redact.sh` - Discovery and redaction workflow
48+
- `sample-app-find-and-protect.sh` - Discovery and protection workflow
49+
- `sample-app-find-and-unprotect.sh` - Discovery and unprotection workflow
50+
- `sample-app-protection.sh` - Direct protection/unprotection CLI
51+
- Windows `.bat` equivalents for all scripts
52+
- **Enhanced Python Samples**: Updated `samples/python/` structure
53+
- New semantic guardrails Jupyter notebook
54+
- New synthetic data Jupyter notebook
55+
- **Sample Data Organization**: Improved organization of configuration files and test data
56+
- **Cross-Platform Scripts**: Ensured all shell scripts work on Linux, macOS, and Windows
57+
58+
#### Docker Compose Evolution
59+
- **Multi-Profile Support**: Enhanced `docker-compose.yml` with profile-based orchestration
60+
- Default profile: Classification and Semantic Guardrail services
61+
- `synthetic` profile: Adds Synthetic Data generation services
62+
- **Service Dependencies**: Proper orchestration and startup order management
63+
- **Resource Optimization**: Improved container download and deployment efficiency
64+
65+
#### Service Endpoints
66+
- **Classification API**: `http://localhost:8580/pty/data-discovery/v1.1/classify`
67+
- **Semantic Guardrail API**: `http://localhost:8581/pty/semantic-guardrail/v1.1/conversations/messages/scan`
68+
- **Synthetic Data API**: New endpoints for synthetic data generation (when using synthetic profile)
69+
70+
### 🔧 Enhanced Configuration & Service Features
71+
72+
#### Configuration Updates
73+
- **Expanded Entity Mapping**: Enhanced `config.json` with additional entity types
74+
- **Simplified Schema**: Streamlined configuration keys for easier customization
75+
- **Java Configuration Support**: Added `config.ini` format for Java samples
76+
77+
#### Service Health & Logging
78+
- **Improved Health Checks**: Enhanced service health verification procedures
79+
- **Better Logging**: Improved logging options and error messages across all services
80+
- **Restart Procedures**: Documented comprehensive docker compose management commands
81+
82+
### 🧑‍💻 Sample Applications Evolution
83+
84+
#### Java Sample Applications (NEW)
85+
- Complete Java implementation of all Python sample workflows
86+
- Maven-based build system with wrapper scripts
87+
- Fat JAR generation for easy distribution
88+
- Shell and batch scripts for cross-platform execution
89+
- Full source code available for customization
90+
91+
#### Python Sample Enhancements
92+
- Enhanced semantic guardrails samples with richer examples
93+
- New Jupyter notebooks for interactive exploration
94+
- Improved error handling and user feedback
95+
- Better documentation and inline comments
96+
97+
#### Jupyter Notebook Integration
98+
- **Semantic Guardrails Notebook**: Step-by-step guide for conversation scanning and risk assessment
99+
- **Synthetic Data Notebook**: Interactive guide for generating synthetic test data
100+
- **Prerequisites Documentation**: Clear instructions for Jupyter Lab setup
101+
102+
### 🤖 GenAI & AI Integration
103+
104+
#### Advanced AI Security Features
105+
- **Improved Risk Scoring**: Enhanced semantic guardrail capabilities for multi-turn conversations
106+
- **PII Scanning**: Advanced PII detection across conversation history
107+
- **Privacy in Conversational AI**: Better support for securing LLM interactions
108+
- **Prompt Sanitization**: Enhanced capabilities for cleaning LLM prompts
109+
110+
### 📚 Documentation & Developer Experience
111+
112+
#### Improved Getting Started Guides
113+
- **Python Setup**: Updated prerequisites and installation instructions
114+
- **Java Setup**: New comprehensive Java environment setup guide
115+
- **Feature Documentation**: Detailed documentation for all new features
116+
- **Troubleshooting**: Enhanced debugging guidance for common issues
117+
118+
#### Community Support
119+
- **Issue Reporting**: Clear guidelines for reporting issues with sample scripts
120+
- **Log Requirements**: Specified log snippet requirements for better issue resolution
121+
- **Example Code**: More comprehensive code examples across documentation
122+
123+
### ⚙️ Infrastructure & Operations
124+
125+
#### Docker Compose Improvements
126+
- **Profile-Based Orchestration**: Use `--profile synthetic` to enable synthetic data services
127+
- **Optimized Downloads**: Reduced container download times
128+
- **Better Resource Management**: Improved memory and CPU allocation
129+
- **Port Configuration**: Flexible port management with environment variable support
130+
131+
### 🔄 Dependencies
132+
- Updated `requirements.txt` with latest compatible versions
133+
- Enhanced Maven dependencies for Java samples
134+
- Updated Docker image references to latest stable versions
135+
136+
### ⚠️ Breaking Changes
137+
None - This release maintains backward compatibility with 1.0.0
138+
139+
---
140+
141+
## [1.0.0] - 2025-09-30
6142

7143
### 🎉 Major New Features
8144

@@ -152,7 +288,7 @@ export DEV_EDITION_API_KEY="<your_api_key>"
152288

153289
---
154290

155-
## [Previous Release] - README.md Baseline
291+
## [0.9.0] - README.md Baseline
156292

157293
### Features (Baseline)
158294
- Basic unstructured text classification and PII redaction
@@ -169,6 +305,3 @@ export DEV_EDITION_API_KEY="<your_api_key>"
169305
- Limited configuration options
170306
- Basic docker compose setup
171307

172-
---
173-
174-
*Note: This release represents a major evolution from a simple data discovery and redaction tool to a comprehensive data protection and AI security platform with advanced semantic guardrail capabilities, authentication systems, and multiple workflow options.*

0 commit comments

Comments
 (0)