feat: Add Prompt Ops platform pages and reorganize PromptFooAI namespace

codekiln · codekiln · commit 0a6079dbbeb6 · 2025-11-06T15:50:12.000-05:00
- Add pages for PromptLayerAI, BraintrustAI, HumanloopAI, and PromptFooAI
- Create AI/LLM/Observability/Platform and AI/LLM/Ops/Platform concept pages
- Create AI/Security/Attack/Prompt/Injection page with alias
- Move PromptFooAI to its own top-level namespace (like LangSmith)
- Tag all Prompt Ops platforms with observability and ops platform tags
- Update journal entry with PromptFooAI reference
diff --git a/journals/2025_11_06.md b/journals/2025_11_06.md
@@ -1,13 +1,13 @@
-## [[LangSmith Evals]]
+## Langsmithery
 	- [[LangSmith/Eval/Q/Can an Annotation Queue be attached to an Evaluator]]
 	- Discovered that Custom Output Rendering is possible on a [[LangSmith/Annotation/Queue]]; see [[LangSmith/Docs/Custom Output Rendering]]
 	- [[LangSmith/Eval/Idea/Store One Annotation Queue Per Behavior]]
 	- [[LangSmith/Resource/Tag/Q/What types of resource tags are there]]
 	- [[LangSmith/Annotation/Queue/Q/What goes in the default dataset]]
-- ## The GitHub World
+- ## GitHubbery
 	- [[GitHub/Codespace/Q/Can OIDC Grant Access to AWS Bedrock as in GitHub Actions for Claude Code]]
-- ## [[AWS]] Resources
+- ## AWSery
 	- [[AWS/Solutions Library/Sample/Guidance for Claude Code with Amazon Bedrock]] created to document the AWS Solutions Library sample repository that demonstrates secure enterprise authentication for Amazon Bedrock using OIDC identity providers and AWS services
 	- [[AWS/IAM/Identity Center/Technique/Long-Lived Temporary Credentials]] - Notes on using AWS IAM Identity Center as an alternative to role chaining for longer-lived temporary credentials (1-12 hours vs 1 hour limit)
-- ## [[JIRA]] Resources
-	- [[JIRA/How To/Customize Email Notifications by Space]] - Documented that `/jira/settings/personal/notifications` is the path where one can customize email notification preferences by space in Atlassian products
+- ## AI Security
+	- [[PromptFooAI]] - Testing framework for LLM applications that helps detect prompt injection attacks and other security vulnerabilities
diff --git a/pages/AI___LLM___Observability___Platform.md b/pages/AI___LLM___Observability___Platform.md
@@ -0,0 +1,18 @@
+tags:: [[AI/LLM/Observability]]
+
+- # LLM Observability Platform
+	- Platforms that provide observability, monitoring, and tracing capabilities for LLM applications
+	- Key features typically include:
+		- Real-time monitoring of LLM usage
+		- Latency and performance tracking
+		- Cost tracking and analysis
+		- Log aggregation and search
+		- Trace visualization
+		- Error tracking and debugging
+	- ## Platforms
+		- [[PromptLayerAI]] - Prompt management, evaluations, and LLM observability
+		- [[BraintrustAI]] - AI observability platform with real-time monitoring
+		- [[HumanloopAI]] - LLM evaluation platform with observability features
+		- [[LangSmith]] - LLM observability and evaluation platform
+		- [[PromptFooAI]] - Testing framework with evals for LLM applications
+
diff --git a/pages/AI___LLM___Ops___Platform.md b/pages/AI___LLM___Ops___Platform.md
@@ -0,0 +1,21 @@
+tags:: [[AI/LLM/Ops]]
+
+- # LLM Ops Platform (Prompt Ops)
+	- Platforms that help with LLM operations, including prompt engineering, evaluations, versioning, and deployment
+	- Also known as "Prompt Ops" tools
+	- Key capabilities typically include:
+		- Prompt management and versioning
+		- Prompt engineering workflows
+		- Evaluation and testing frameworks
+		- A/B testing and experimentation
+		- Regression testing
+		- Dataset management
+		- Prompt deployment and rollback
+		- Collaboration tools for non-technical stakeholders
+	- ## Platforms
+		- [[PromptLayerAI]] - Your workbench for AI engineering with prompt management, evaluations, and observability
+		- [[BraintrustAI]] - AI observability platform with prompt engineering and batch testing
+		- [[HumanloopAI]] - LLM evaluation platform with prompt management
+		- [[PromptFooAI]] - Testing framework for detecting prompt injection attacks and security vulnerabilities
+		- [[LangSmith]] - LLM observability and evaluation platform
+
diff --git a/pages/AI___Security___Attack___Prompt___Injection.md b/pages/AI___Security___Attack___Prompt___Injection.md
@@ -0,0 +1,18 @@
+tags:: [[AI/Security/Attack]]
+alias:: [[Prompt Injection]]
+
+- # Prompt Injection
+	- A security vulnerability where malicious input is crafted to manipulate or override the intended behavior of an LLM application
+	- Attackers inject malicious instructions into prompts to:
+		- Bypass safety measures
+		- Extract sensitive information
+		- Manipulate the AI's behavior
+		- Gain unauthorized access to systems
+	- ## Types
+		- **Direct Prompt Injection**: Malicious input is directly inserted into user prompts
+		- **Indirect Prompt Injection**: Malicious content is embedded in data sources that the AI processes (e.g., web pages, documents, GitHub issues)
+	- ## Detection and Testing
+	- [[PromptFooAI]] - Testing framework for detecting prompt injection attacks
+- ## Related Concepts
+	- [[AI/Security/Attack/Toxic Agent Flow]] - Use of indirect prompt injection to trigger malicious tool use sequences
+
diff --git a/pages/BraintrustAI.md b/pages/BraintrustAI.md
@@ -0,0 +1,14 @@
+tags:: [[AI/Security]], [[AI/LLM/Observability/Platform]], [[AI/LLM/Ops/Platform]]
+
+- # BraintrustAI
+	- [Braintrust](https://www.braintrust.dev/) - AI observability platform
+		- Platform for building quality AI products
+		- Provides infrastructure for evaluating AI applications, monitoring performance, and ensuring reliable outputs
+		- Features include:
+			- Prompt engineering
+			- Batch testing
+			- Real-time monitoring
+			- Automated and human scoring
+			- Scalable log ingestion
+		- Helps teams iterate, evaluate, and deploy AI applications effectively
+
diff --git a/pages/HumanloopAI.md b/pages/HumanloopAI.md
@@ -0,0 +1,12 @@
+tags:: [[AI/Security]], [[AI/LLM/Observability/Platform]], [[AI/LLM/Ops/Platform]]
+
+- # HumanloopAI
+	- [Humanloop](https://humanloop.com/) - LLM evaluation platform
+		- Development platform for LLM applications
+		- Focused on enabling the safe and rapid adoption of AI
+		- Features include:
+			- Prompt management
+			- Evaluation
+			- Observability
+		- In 2025, Humanloop joined Anthropic to further their mission of enabling the safe and rapid adoption of AI
+
diff --git a/pages/PromptFooAI.md b/pages/PromptFooAI.md
@@ -0,0 +1,11 @@
+tags:: [[AI/Security]], [[AI/Security/Attack/Prompt/Injection]], [[AI/LLM/Observability/Platform]], [[AI/LLM/Ops/Platform]]
+
+- # PromptFooAI
+	- [PromptFoo](https://www.promptfoo.dev/) - Testing framework for LLM applications
+		- PromptFoo is a testing framework for LLM applications that helps detect prompt injection attacks and other security vulnerabilities
+		- Has evals (like [[LangSmith]])
+	- ## Mentions
+		- [[Person/Shawn @swyx Wang]] mentioned PromptFoo in [[Latent Space/Pod]] episode about Sander Schulhoff's The Prompt Report
+			- [Consistently use the OpenAI playground for prompting. (50sec)](https://share.snipd.com/snip/8f14b85f-3deb-4640-83f3-00c201541caf)
+			- swyx called out specialists in this area: **[[PromptLayerAI]], [[BraintrustAI]], [[PromptFooAI]], [[HumanloopAI]]**
+
diff --git a/pages/PromptLayerAI.md b/pages/PromptLayerAI.md
@@ -0,0 +1,18 @@
+tags:: [[AI/Security]], [[AI/LLM/Observability/Platform]], [[AI/LLM/Ops/Platform]]
+
+- # PromptLayerAI
+	- [PromptLayer](https://www.promptlayer.com/) - Your workbench for AI engineering
+		- Platform for prompt management, evaluations, and LLM observability
+		- Enables teams to version, test, and monitor prompts and agents with robust evals, tracing, and regression sets
+		- Empowers domain experts to collaborate in the visual editor
+		- Features include:
+			- Prompt management and versioning
+			- Visual prompt editor (no-code)
+			- A/B testing
+			- Evaluations and regression tests
+			- Observability and monitoring
+			- Dataset management
+			- Prompt chaining
+		- Model agnostic - one prompt template for every model
+		- SOC 2 Type 2 compliant, HIPAA compliant
+