Skip to content

Commit 04e1753

Browse files
Release v0.4.0 (#52)
1 parent d698b6d commit 04e1753

File tree

3 files changed

+27
-5
lines changed

3 files changed

+27
-5
lines changed

CHANGELOG.md

Lines changed: 24 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,29 @@ All notable changes to this project will be documented in this file.
55
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
66
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
77

8-
## [Unreleased]
8+
## [v0.4.0] - 2023-08-09
9+
10+
### Added
11+
12+
- Support for training a tokenizer from scratch. See `Tokenizers.Tokenizer.train_from_files/3`
13+
and `Tokenizers.Model` for available models.
14+
15+
- Support for changing tokenizer configuration, such as `Tokenizers.Tokenizer.set_padding/2`
16+
and `Tokenizers.Tokenizer.set_truncation/2`. See the "Configuration" functions group in
17+
`Tokenizers.Tokenizer`.
18+
19+
- Support for apply multiple encoding transformations without additional data copies,
20+
see `Tokenizers.Encoding.Transformation`. Transformations can be passed to
21+
`Tokenizers.Tokenizer.encode/3` via `:encoding_transformations` or applied via
22+
`Tokenizers.Encoding.transform/2`.
23+
24+
### Changed
25+
26+
- **(Breaking)** `Tokenizers.Tokenizer.encode/3` no longer accepts a batch of inputs,
27+
to encode a batch use `Tokenizers.Tokenizer.encode_batch/3` instead
28+
29+
- **(Breaking)** `Tokenizers.Tokenizer.decode/3` no longer accepts a batch of inputs,
30+
to encode a batch use `Tokenizers.Tokenizer.decode_batch/3` instead
931

1032
## [v0.3.2] - 2023-04-19
1133

@@ -61,7 +83,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
6183

6284
First release.
6385

64-
[Unreleased]: https://github.com/elixir-nx/tokenizers/compare/v0.3.2...HEAD
86+
[v0.4.0]: https://github.com/elixir-nx/tokenizers/compare/v0.3.2...v0.4.0
6587
[v0.3.2]: https://github.com/elixir-nx/tokenizers/compare/v0.3.1...v0.3.2
6688
[v0.3.1]: https://github.com/elixir-nx/tokenizers/compare/v0.3.0...v0.3.1
6789
[v0.3.0]: https://github.com/elixir-nx/tokenizers/compare/v0.2.0...v0.3.0

mix.exs

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ defmodule Tokenizers.MixProject do
22
use Mix.Project
33

44
@source_url "https://github.com/elixir-nx/tokenizers"
5-
@version "0.4.0-dev"
5+
@version "0.4.0"
66

77
def project do
88
[

notebooks/pretrained.livemd

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,10 +2,10 @@
22

33
```elixir
44
Mix.install([
5-
{:kino, "~> 0.5.2"},
5+
{:kino, "~> 0.10.0"},
66
{:scidata, "~> 0.1.5"},
77
{:tokenizers, "~> 0.4.0"},
8-
{:nx, "~> 0.3"}
8+
{:nx, "~> 0.5"}
99
])
1010
```
1111

0 commit comments

Comments
 (0)