SmartParser

SmartParser is an open-source utility library designed to transform unstructured data — such as raw text or images — into structured, strongly-typed objects. By leveraging the power of Large Language Models (LLMs) like those from OpenAI, Smart-Parser helps you integrate AI-driven content parsing directly into your workflows or applications.

Setup & Installation

Prerequisites:

.NET 9.0 or later
Azure OpenAI endpoint URL and API key

Installation:

Add the Smart-Parser package via NuGet:

dotnet add package AppStream.SmartParser

Configure Services and Settings

Code-Based Configuration

services.AddSmartParser(
    options =>
    {
        options.DeploymentName = "MyDeployment";
        options.OpenAiEndpoint = "https://your-openai-endpoint.azure.com/";
        options.OpenAiCredentialKey = "my-key";
        options.HttpClientNetworkTimeoutSeconds = 30; // required
    },
    // Optional: customize chat completion behavior (defaults to Temperature = 0 for determinism)
    chatOptions =>
    {
        chatOptions.Temperature = 0.2;
        // configure other options if desired
    });

Environment Variables

services.AddSmartParser(options =>
{
    options.DeploymentName = Environment.GetEnvironmentVariable("DEPLOYMENT_NAME") ?? "DefaultDeployment";
    options.OpenAiEndpoint = Environment.GetEnvironmentVariable("OPENAI_ENDPOINT") ?? "https://your-openai-endpoint.azure.com/";
    options.OpenAiCredentialKey = Environment.GetEnvironmentVariable("OPENAI_CREDENTIAL_KEY") ?? "default-key";
    options.HttpClientNetworkTimeoutSeconds = int.TryParse(Environment.GetEnvironmentVariable("HTTP_CLIENT_NETWORK_TIMEOUT_SECONDS"), out var s) ? s : 30;
});

AppSettings File

appsettings.json sample:

{
  "SmartParser": {
    "DeploymentName": "MyDeployment",
    "OpenAiEndpoint": "https://your-openai-endpoint.azure.com/",
    "OpenAiCredentialKey": "my-key",
    "HttpClientNetworkTimeoutSeconds": 30
  }
}

var configuration = new ConfigurationBuilder()
    .AddJsonFile("appsettings.json")
    .Build();

services.AddSmartParser(configuration.GetSection("SmartParser").Bind);

Note: If you do not pass a custom chat options configuration, the library defaults to Temperature = 0 for deterministic outputs.

Important: Some models do not support setting temperature. To avoid setting it, pass an empty completion options configuration delegate so no temperature is applied:

services.AddSmartParser(
    options =>
    {
        options.DeploymentName = "MyDeployment";
        options.OpenAiEndpoint = "https://your-openai-endpoint.azure.com/";
        options.OpenAiCredentialKey = "my-key";
        options.HttpClientNetworkTimeoutSeconds = 30;
    },
    _ => { } // do not set Temperature to support models without it
);

Usage

Once you've configured SmartParser in your project, parsing is straightforward. Whether you're working with unstructured text or scanned images, SmartParser can help produce typed objects tailored to your needs.

Text

Simply pass your raw text content along with any guiding considerations, and SmartParser will return a typed object reflecting the extracted information.

public class SimpleResult
{
    public string Name { get; set; } = "";
    public int Age { get; set; }
    public string? Title { get; set; }
    public string Summary { get; set; } = "";
}

public class Example
{
    private readonly ISmartParser _smartParser;

    public Example(ISmartParser smartParser)
    {
        _smartParser = smartParser;
    }

    public async Task RunAsync(CancellationToken cancellationToken)
    {
        // Input text might be unstructured content describing a person
        var inputText = """
            John Doe is a 29-year-old software engineer who specializes in building 
            cross-platform mobile applications. He has worked at several leading 
            tech companies and contributed to a wide range of open-source projects.
            
            Recently, John has focused on building secure, user-friendly interfaces 
            that help everyday users understand complex data. He occasionally speaks 
            at industry conferences, discussing user experience and modern development 
            practices.
            """;

        var result = await _smartParser.ParseAsync<SimpleResult>(
            inputText,
            // Considerations (optional hints) for the parser:
            """
            - Keep the output factual.
            - If a title isn't clearly stated, set it to null.
            - Keep the summary as concise as possible.
            """,
            cancellationToken);

        if (result == null)
        {
            throw new InvalidOperationException("Parsing failed.");
        }

        Console.WriteLine($"Name: {result.Name}");
        Console.WriteLine($"Age: {result.Age}");
        Console.WriteLine($"Title: {(result.Title ?? "none")}");
        Console.WriteLine($"Summary: {result.Summary}");
    }
}

Image

Just like with text, you can supply an image URL and parsing considerations, and the parser will produce a typed result based on the visual content.

// Suppose this URL points to an image of a CV/resume
var cvImageUrl = "https://example.com/cv.jpg";

var result = await _smartParser.ParseImageAsync<SimpleResult>(
    cvImageUrl,
    """
    Extract the individual's name, approximate age (if stated), any professional title, and a concise summary of their experience.
    If a professional title is not clearly stated, set it to null.
    """,
    cancellationToken);

if (result == null)
{
    Console.WriteLine("CV parsing from image failed.");
    return;
}

Console.WriteLine($"Name: {result.Name}");
Console.WriteLine($"Age: {result.Age}");
Console.WriteLine($"Title: {(result.Title ?? "none")}");
Console.WriteLine($"Summary: {result.Summary}");

Image (binary data)

If you already have the image bytes (e.g., uploaded file), you can call the binary overload and optionally specify an image detail level:

var bytes = await File.ReadAllBytesAsync("cv.png", cancellationToken);
var imageData = BinaryData.FromBytes(bytes);

var result2 = await _smartParser.ParseImageAsync<SimpleResult>(
    imageData,
    mimeType: "image/png",
    imageDetailLevel: OpenAI.Chat.ChatImageDetailLevel.High,
    considerations: "Extract key profile details.",
    cancellationToken);

if (result2 == null)
{
    Console.WriteLine("CV parsing from binary image failed.");
    return;
}

Console.WriteLine($"Name: {result2.Name}");

Error handling

The parser throws exceptions when the model response is unusable or cannot be deserialized:

UnexpectedCompletionsResponseException: content filtered, max tokens reached, or empty content
ResponseDeserializationException: completion content could not be deserialized into the requested type

Contributing

Contributions to this open source library are highly appreciated! If you're interested in helping out, please feel free to submit a pull request with your changes. We welcome contributions of all kinds, whether it's bug fixes, new features, or just improving the documentation. Please ensure that your code is well-documented, tested, and adheres to the coding conventions used in the project. Don't hesitate to reach out if you have any questions or need help getting started. You can open an issue on GitHub or email us at [email protected] - we're happy to assist you in any way we can.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
assets		assets
samples/SimplestUseCase		samples/SimplestUseCase
src		src
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
smart-parser.sln		smart-parser.sln

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SmartParser

Setup & Installation

Prerequisites:

Installation:

Configure Services and Settings

Code-Based Configuration

Environment Variables

AppSettings File

Usage

Text

Image

Image (binary data)

Error handling

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

Appstream-Studio/smart-parser

Folders and files

Latest commit

History

Repository files navigation

SmartParser

Setup & Installation

Prerequisites:

Installation:

Configure Services and Settings

Code-Based Configuration

Environment Variables

AppSettings File

Usage

Text

Image

Image (binary data)

Error handling

Contributing

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages