Skip to content

Commit 07d18ae

Browse files
1 parent 81dfcf4 commit 07d18ae

File tree

109 files changed

+272
-217
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

109 files changed

+272
-217
lines changed

.buildinfo

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
11
# Sphinx build info version 1
22
# This file records the configuration used when building these files. When it is not found, a full rebuild will be done.
3-
config: 6fc6aeae794a640ba42173c25ef22770
3+
config: 0f9f2ae3adb0b7bbe8e8519024c84050
44
tags: 645f666f9bcd5a90fca523b33c5a78b7

_modules/index.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,15 +5,15 @@
55
<head>
66
<meta charset="utf-8" />
77
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
8-
<title>Overview: module code &mdash; PyThaiNLP 3ce57f7 documentation</title>
8+
<title>Overview: module code &mdash; PyThaiNLP f7df55d documentation</title>
99
<link rel="stylesheet" type="text/css" href="../_static/pygments.css?v=03e43079" />
1010
<link rel="stylesheet" type="text/css" href="../_static/css/theme.css?v=e59714d7" />
1111
<link rel="stylesheet" type="text/css" href="../_static/style.css?v=eea1f72d" />
1212

1313

1414
<script src="../_static/jquery.js?v=5d32c60e"></script>
1515
<script src="../_static/_sphinx_javascript_frameworks_compat.js?v=2cd50e6c"></script>
16-
<script src="../_static/documentation_options.js?v=2a7bd371"></script>
16+
<script src="../_static/documentation_options.js?v=e5bf50f9"></script>
1717
<script src="../_static/doctools.js?v=9bcbadda"></script>
1818
<script src="../_static/sphinx_highlight.js?v=dc90522c"></script>
1919
<script src="../_static/js/theme.js"></script>

_modules/pythainlp/ancient/aksonhan.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,15 +5,15 @@
55
<head>
66
<meta charset="utf-8" />
77
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
8-
<title>pythainlp.ancient.aksonhan &mdash; PyThaiNLP 3ce57f7 documentation</title>
8+
<title>pythainlp.ancient.aksonhan &mdash; PyThaiNLP f7df55d documentation</title>
99
<link rel="stylesheet" type="text/css" href="../../../_static/pygments.css?v=03e43079" />
1010
<link rel="stylesheet" type="text/css" href="../../../_static/css/theme.css?v=e59714d7" />
1111
<link rel="stylesheet" type="text/css" href="../../../_static/style.css?v=eea1f72d" />
1212

1313

1414
<script src="../../../_static/jquery.js?v=5d32c60e"></script>
1515
<script src="../../../_static/_sphinx_javascript_frameworks_compat.js?v=2cd50e6c"></script>
16-
<script src="../../../_static/documentation_options.js?v=2a7bd371"></script>
16+
<script src="../../../_static/documentation_options.js?v=e5bf50f9"></script>
1717
<script src="../../../_static/doctools.js?v=9bcbadda"></script>
1818
<script src="../../../_static/sphinx_highlight.js?v=dc90522c"></script>
1919
<script src="../../../_static/js/theme.js"></script>

_modules/pythainlp/ancient/currency.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,15 +5,15 @@
55
<head>
66
<meta charset="utf-8" />
77
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
8-
<title>pythainlp.ancient.currency &mdash; PyThaiNLP 3ce57f7 documentation</title>
8+
<title>pythainlp.ancient.currency &mdash; PyThaiNLP f7df55d documentation</title>
99
<link rel="stylesheet" type="text/css" href="../../../_static/pygments.css?v=03e43079" />
1010
<link rel="stylesheet" type="text/css" href="../../../_static/css/theme.css?v=e59714d7" />
1111
<link rel="stylesheet" type="text/css" href="../../../_static/style.css?v=eea1f72d" />
1212

1313

1414
<script src="../../../_static/jquery.js?v=5d32c60e"></script>
1515
<script src="../../../_static/_sphinx_javascript_frameworks_compat.js?v=2cd50e6c"></script>
16-
<script src="../../../_static/documentation_options.js?v=2a7bd371"></script>
16+
<script src="../../../_static/documentation_options.js?v=e5bf50f9"></script>
1717
<script src="../../../_static/doctools.js?v=9bcbadda"></script>
1818
<script src="../../../_static/sphinx_highlight.js?v=dc90522c"></script>
1919
<script src="../../../_static/js/theme.js"></script>

_modules/pythainlp/coref/core.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,15 +5,15 @@
55
<head>
66
<meta charset="utf-8" />
77
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
8-
<title>pythainlp.coref.core &mdash; PyThaiNLP 3ce57f7 documentation</title>
8+
<title>pythainlp.coref.core &mdash; PyThaiNLP f7df55d documentation</title>
99
<link rel="stylesheet" type="text/css" href="../../../_static/pygments.css?v=03e43079" />
1010
<link rel="stylesheet" type="text/css" href="../../../_static/css/theme.css?v=e59714d7" />
1111
<link rel="stylesheet" type="text/css" href="../../../_static/style.css?v=eea1f72d" />
1212

1313

1414
<script src="../../../_static/jquery.js?v=5d32c60e"></script>
1515
<script src="../../../_static/_sphinx_javascript_frameworks_compat.js?v=2cd50e6c"></script>
16-
<script src="../../../_static/documentation_options.js?v=2a7bd371"></script>
16+
<script src="../../../_static/documentation_options.js?v=e5bf50f9"></script>
1717
<script src="../../../_static/doctools.js?v=9bcbadda"></script>
1818
<script src="../../../_static/sphinx_highlight.js?v=dc90522c"></script>
1919
<script src="../../../_static/js/theme.js"></script>

_modules/pythainlp/corpus/common.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,15 +5,15 @@
55
<head>
66
<meta charset="utf-8" />
77
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
8-
<title>pythainlp.corpus.common &mdash; PyThaiNLP 3ce57f7 documentation</title>
8+
<title>pythainlp.corpus.common &mdash; PyThaiNLP f7df55d documentation</title>
99
<link rel="stylesheet" type="text/css" href="../../../_static/pygments.css?v=03e43079" />
1010
<link rel="stylesheet" type="text/css" href="../../../_static/css/theme.css?v=e59714d7" />
1111
<link rel="stylesheet" type="text/css" href="../../../_static/style.css?v=eea1f72d" />
1212

1313

1414
<script src="../../../_static/jquery.js?v=5d32c60e"></script>
1515
<script src="../../../_static/_sphinx_javascript_frameworks_compat.js?v=2cd50e6c"></script>
16-
<script src="../../../_static/documentation_options.js?v=2a7bd371"></script>
16+
<script src="../../../_static/documentation_options.js?v=e5bf50f9"></script>
1717
<script src="../../../_static/doctools.js?v=9bcbadda"></script>
1818
<script src="../../../_static/sphinx_highlight.js?v=dc90522c"></script>
1919
<script src="../../../_static/js/theme.js"></script>

_modules/pythainlp/corpus/core.html

Lines changed: 60 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -5,15 +5,15 @@
55
<head>
66
<meta charset="utf-8" />
77
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
8-
<title>pythainlp.corpus.core &mdash; PyThaiNLP 3ce57f7 documentation</title>
8+
<title>pythainlp.corpus.core &mdash; PyThaiNLP f7df55d documentation</title>
99
<link rel="stylesheet" type="text/css" href="../../../_static/pygments.css?v=03e43079" />
1010
<link rel="stylesheet" type="text/css" href="../../../_static/css/theme.css?v=e59714d7" />
1111
<link rel="stylesheet" type="text/css" href="../../../_static/style.css?v=eea1f72d" />
1212

1313

1414
<script src="../../../_static/jquery.js?v=5d32c60e"></script>
1515
<script src="../../../_static/_sphinx_javascript_frameworks_compat.js?v=2cd50e6c"></script>
16-
<script src="../../../_static/documentation_options.js?v=2a7bd371"></script>
16+
<script src="../../../_static/documentation_options.js?v=e5bf50f9"></script>
1717
<script src="../../../_static/doctools.js?v=9bcbadda"></script>
1818
<script src="../../../_static/sphinx_highlight.js?v=dc90522c"></script>
1919
<script src="../../../_static/js/theme.js"></script>
@@ -116,6 +116,7 @@ <h1>Source code for pythainlp.corpus.core</h1><div class="highlight"><pre>
116116

117117
<span class="kn">import</span><span class="w"> </span><span class="nn">json</span>
118118
<span class="kn">import</span><span class="w"> </span><span class="nn">os</span>
119+
<span class="kn">import</span><span class="w"> </span><span class="nn">re</span>
119120
<span class="kn">from</span><span class="w"> </span><span class="nn">typing</span><span class="w"> </span><span class="kn">import</span> <span class="n">Union</span>
120121

121122
<span class="kn">from</span><span class="w"> </span><span class="nn">pythainlp</span><span class="w"> </span><span class="kn">import</span> <span class="n">__version__</span>
@@ -715,9 +716,6 @@ <h1>Source code for pythainlp.corpus.core</h1><div class="highlight"><pre>
715716
<span class="sd"> # FileNotFoundError: [Errno 2] No such file or directory:</span>
716717
<span class="sd"> # &#39;/usr/local/lib/python3.6/dist-packages/pythainlp/corpus/ttc&#39;</span>
717718
<span class="sd"> &quot;&quot;&quot;</span>
718-
<span class="k">if</span> <span class="n">_CHECK_MODE</span> <span class="o">==</span> <span class="s2">&quot;1&quot;</span><span class="p">:</span>
719-
<span class="nb">print</span><span class="p">(</span><span class="s2">&quot;PyThaiNLP is read-only mode. It can&#39;t remove corpus.&quot;</span><span class="p">)</span>
720-
<span class="k">return</span> <span class="kc">False</span>
721719
<span class="k">with</span> <span class="nb">open</span><span class="p">(</span><span class="n">corpus_db_path</span><span class="p">(),</span> <span class="s2">&quot;r&quot;</span><span class="p">,</span> <span class="n">encoding</span><span class="o">=</span><span class="s2">&quot;utf-8-sig&quot;</span><span class="p">)</span> <span class="k">as</span> <span class="n">f</span><span class="p">:</span>
722720
<span class="n">db</span> <span class="o">=</span> <span class="n">json</span><span class="o">.</span><span class="n">load</span><span class="p">(</span><span class="n">f</span><span class="p">)</span>
723721
<span class="n">data</span> <span class="o">=</span> <span class="p">[</span>
@@ -746,6 +744,63 @@ <h1>Source code for pythainlp.corpus.core</h1><div class="highlight"><pre>
746744

747745
<span class="k">def</span><span class="w"> </span><span class="nf">get_path_folder_corpus</span><span class="p">(</span><span class="n">name</span><span class="p">,</span> <span class="n">version</span><span class="p">,</span> <span class="o">*</span><span class="n">path</span><span class="p">):</span>
748746
<span class="k">return</span> <span class="n">os</span><span class="o">.</span><span class="n">path</span><span class="o">.</span><span class="n">join</span><span class="p">(</span><span class="n">get_corpus_path</span><span class="p">(</span><span class="n">name</span><span class="p">,</span> <span class="n">version</span><span class="p">),</span> <span class="o">*</span><span class="n">path</span><span class="p">)</span>
747+
748+
749+
<span class="k">def</span><span class="w"> </span><span class="nf">make_safe_directory_name</span><span class="p">(</span><span class="n">name</span><span class="p">:</span><span class="nb">str</span><span class="p">)</span> <span class="o">-&gt;</span> <span class="nb">str</span><span class="p">:</span>
750+
<span class="w"> </span><span class="sd">&quot;&quot;&quot;</span>
751+
<span class="sd"> Make safe directory name</span>
752+
753+
<span class="sd"> :param str name: directory name</span>
754+
<span class="sd"> :return: safe directory name</span>
755+
<span class="sd"> :rtype: str</span>
756+
<span class="sd"> &quot;&quot;&quot;</span>
757+
<span class="c1"># Replace invalid characters with an underscore</span>
758+
<span class="n">safe_name</span> <span class="o">=</span> <span class="n">re</span><span class="o">.</span><span class="n">sub</span><span class="p">(</span><span class="sa">r</span><span class="s1">&#39;[&lt;&gt;:&quot;/</span><span class="se">\\</span><span class="s1">|?*]&#39;</span><span class="p">,</span> <span class="s1">&#39;_&#39;</span><span class="p">,</span> <span class="n">name</span><span class="p">)</span>
759+
<span class="c1"># Remove leading/trailing spaces or periods (especially important for Windows)</span>
760+
<span class="n">safe_name</span> <span class="o">=</span> <span class="n">safe_name</span><span class="o">.</span><span class="n">strip</span><span class="p">(</span><span class="s1">&#39; .&#39;</span><span class="p">)</span>
761+
<span class="c1"># Prevent names that are reserved on Windows</span>
762+
<span class="n">reserved_names</span> <span class="o">=</span> <span class="p">[</span><span class="s1">&#39;CON&#39;</span><span class="p">,</span> <span class="s1">&#39;PRN&#39;</span><span class="p">,</span> <span class="s1">&#39;AUX&#39;</span><span class="p">,</span> <span class="s1">&#39;NUL&#39;</span><span class="p">,</span> <span class="s1">&#39;COM1&#39;</span><span class="p">,</span> <span class="s1">&#39;COM2&#39;</span><span class="p">,</span> <span class="s1">&#39;COM3&#39;</span><span class="p">,</span> <span class="s1">&#39;COM4&#39;</span><span class="p">,</span> <span class="s1">&#39;COM5&#39;</span><span class="p">,</span> <span class="s1">&#39;COM6&#39;</span><span class="p">,</span> <span class="s1">&#39;COM7&#39;</span><span class="p">,</span> <span class="s1">&#39;COM8&#39;</span><span class="p">,</span> <span class="s1">&#39;COM9&#39;</span><span class="p">,</span> <span class="s1">&#39;LPT1&#39;</span><span class="p">,</span> <span class="s1">&#39;LPT2&#39;</span><span class="p">,</span> <span class="s1">&#39;LPT3&#39;</span><span class="p">,</span> <span class="s1">&#39;LPT4&#39;</span><span class="p">,</span> <span class="s1">&#39;LPT5&#39;</span><span class="p">,</span> <span class="s1">&#39;LPT6&#39;</span><span class="p">,</span> <span class="s1">&#39;LPT7&#39;</span><span class="p">,</span> <span class="s1">&#39;LPT8&#39;</span><span class="p">,</span> <span class="s1">&#39;LPT9&#39;</span><span class="p">]</span>
763+
<span class="k">if</span> <span class="n">safe_name</span><span class="o">.</span><span class="n">upper</span><span class="p">()</span> <span class="ow">in</span> <span class="n">reserved_names</span><span class="p">:</span>
764+
<span class="n">safe_name</span> <span class="o">=</span> <span class="sa">f</span><span class="s2">&quot;_</span><span class="si">{</span><span class="n">safe_name</span><span class="si">}</span><span class="s2">&quot;</span> <span class="c1"># Prepend underscore to avoid conflict</span>
765+
<span class="k">return</span> <span class="n">safe_name</span>
766+
767+
768+
<span class="k">def</span><span class="w"> </span><span class="nf">get_hf_hub</span><span class="p">(</span><span class="n">repo_id</span><span class="p">:</span><span class="nb">str</span><span class="p">,</span> <span class="n">filename</span><span class="p">:</span> <span class="nb">str</span><span class="o">=</span><span class="kc">None</span><span class="p">)</span> <span class="o">-&gt;</span> <span class="nb">str</span><span class="p">:</span>
769+
<span class="w"> </span><span class="sd">&quot;&quot;&quot;</span>
770+
<span class="sd"> HuggingFace Hub in :mod:`pythainlp` data directory.</span>
771+
772+
<span class="sd"> :param str repo_id: repo_id</span>
773+
<span class="sd"> :param str filename: filename</span>
774+
<span class="sd"> :return: path</span>
775+
<span class="sd"> :rtype: str</span>
776+
<span class="sd"> &quot;&quot;&quot;</span>
777+
<span class="k">if</span> <span class="n">_CHECK_MODE</span> <span class="o">==</span> <span class="s2">&quot;1&quot;</span><span class="p">:</span>
778+
<span class="nb">print</span><span class="p">(</span><span class="s2">&quot;PyThaiNLP is read-only mode. It can&#39;t download.&quot;</span><span class="p">)</span>
779+
<span class="k">return</span> <span class="kc">False</span>
780+
<span class="k">try</span><span class="p">:</span>
781+
<span class="kn">from</span><span class="w"> </span><span class="nn">huggingface_hub</span><span class="w"> </span><span class="kn">import</span> <span class="n">hf_hub_download</span><span class="p">,</span> <span class="n">snapshot_download</span>
782+
<span class="k">except</span> <span class="ne">ModuleNotFoundError</span><span class="p">:</span>
783+
<span class="k">raise</span> <span class="ne">ModuleNotFoundError</span><span class="p">(</span><span class="s2">&quot;&quot;&quot;</span>
784+
<span class="s2"> huggingface-hub isn&#39;t found!</span>
785+
<span class="s2"> Please installing the package via &#39;pip install huggingface-hub&#39;.</span>
786+
<span class="s2"> &quot;&quot;&quot;</span><span class="p">)</span>
787+
<span class="k">except</span> <span class="ne">Exception</span> <span class="k">as</span> <span class="n">e</span><span class="p">:</span>
788+
<span class="k">raise</span> <span class="ne">Exception</span><span class="p">(</span><span class="sa">f</span><span class="s2">&quot;An unexpected error occurred: </span><span class="si">{</span><span class="n">e</span><span class="si">}</span><span class="s2">&quot;</span><span class="p">)</span>
789+
<span class="n">hf_root</span> <span class="o">=</span> <span class="n">get_full_data_path</span><span class="p">(</span><span class="s2">&quot;hf_models&quot;</span><span class="p">)</span>
790+
<span class="n">name_dir</span> <span class="o">=</span> <span class="n">make_safe_directory_name</span><span class="p">(</span><span class="n">repo_id</span><span class="p">)</span>
791+
<span class="n">root_project</span> <span class="o">=</span> <span class="n">os</span><span class="o">.</span><span class="n">path</span><span class="o">.</span><span class="n">join</span><span class="p">(</span><span class="n">hf_root</span><span class="p">,</span> <span class="n">name_dir</span><span class="p">)</span>
792+
<span class="k">if</span> <span class="n">filename</span><span class="o">!=</span><span class="kc">None</span><span class="p">:</span>
793+
<span class="n">output_path</span> <span class="o">=</span> <span class="n">hf_hub_download</span><span class="p">(</span>
794+
<span class="n">repo_id</span><span class="o">=</span><span class="n">repo_id</span><span class="p">,</span>
795+
<span class="n">filename</span><span class="o">=</span><span class="n">filename</span><span class="p">,</span>
796+
<span class="n">local_dir</span><span class="o">=</span><span class="n">root_project</span>
797+
<span class="p">)</span>
798+
<span class="k">else</span><span class="p">:</span>
799+
<span class="n">output_path</span> <span class="o">=</span> <span class="n">snapshot_download</span><span class="p">(</span>
800+
<span class="n">repo_id</span><span class="o">=</span><span class="n">repo_id</span><span class="p">,</span>
801+
<span class="n">local_dir</span><span class="o">=</span><span class="n">root_project</span>
802+
<span class="p">)</span>
803+
<span class="k">return</span> <span class="n">output_path</span>
749804
</pre></div>
750805

751806
</div>

_modules/pythainlp/corpus/oscar.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,15 +5,15 @@
55
<head>
66
<meta charset="utf-8" />
77
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
8-
<title>pythainlp.corpus.oscar &mdash; PyThaiNLP 3ce57f7 documentation</title>
8+
<title>pythainlp.corpus.oscar &mdash; PyThaiNLP f7df55d documentation</title>
99
<link rel="stylesheet" type="text/css" href="../../../_static/pygments.css?v=03e43079" />
1010
<link rel="stylesheet" type="text/css" href="../../../_static/css/theme.css?v=e59714d7" />
1111
<link rel="stylesheet" type="text/css" href="../../../_static/style.css?v=eea1f72d" />
1212

1313

1414
<script src="../../../_static/jquery.js?v=5d32c60e"></script>
1515
<script src="../../../_static/_sphinx_javascript_frameworks_compat.js?v=2cd50e6c"></script>
16-
<script src="../../../_static/documentation_options.js?v=2a7bd371"></script>
16+
<script src="../../../_static/documentation_options.js?v=e5bf50f9"></script>
1717
<script src="../../../_static/doctools.js?v=9bcbadda"></script>
1818
<script src="../../../_static/sphinx_highlight.js?v=dc90522c"></script>
1919
<script src="../../../_static/js/theme.js"></script>

_modules/pythainlp/corpus/th_en_translit.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,15 +5,15 @@
55
<head>
66
<meta charset="utf-8" />
77
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
8-
<title>pythainlp.corpus.th_en_translit &mdash; PyThaiNLP 3ce57f7 documentation</title>
8+
<title>pythainlp.corpus.th_en_translit &mdash; PyThaiNLP f7df55d documentation</title>
99
<link rel="stylesheet" type="text/css" href="../../../_static/pygments.css?v=03e43079" />
1010
<link rel="stylesheet" type="text/css" href="../../../_static/css/theme.css?v=e59714d7" />
1111
<link rel="stylesheet" type="text/css" href="../../../_static/style.css?v=eea1f72d" />
1212

1313

1414
<script src="../../../_static/jquery.js?v=5d32c60e"></script>
1515
<script src="../../../_static/_sphinx_javascript_frameworks_compat.js?v=2cd50e6c"></script>
16-
<script src="../../../_static/documentation_options.js?v=2a7bd371"></script>
16+
<script src="../../../_static/documentation_options.js?v=e5bf50f9"></script>
1717
<script src="../../../_static/doctools.js?v=9bcbadda"></script>
1818
<script src="../../../_static/sphinx_highlight.js?v=dc90522c"></script>
1919
<script src="../../../_static/js/theme.js"></script>

0 commit comments

Comments
 (0)