-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathChangeLog.old
More file actions
3128 lines (3128 loc) · 238 KB
/
ChangeLog.old
File metadata and controls
3128 lines (3128 loc) · 238 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
77c845d (HEAD -> master, origin/master, origin/HEAD) Fix flag elimination bug (reported in issue #342).
c827bb6 Do not call fsm_destroy until issue with 'double free or corruption' has been fixed.
3b0e58a Provide a get_current_dir_name() for platforms without one
b0fb7a6 Merge branch 'master' of https://github.com/hfst/hfst
65a4d79 Search for included files under scriptdir, not working dir. If an included filename is non-absolute, prefix it with the directory the script was in when trying to read it. This only happens in non-Windows environments for now.
f5e94d7 Free allocated memory.
6144e75 tokenise -g: check for eof
cf46e69 Apply more changes from latest release of foma.
a9f8397 Apply some changes from newest release of foma.
cc8c96b Make a separate directory python/pypi for creating pypi packages.
479789e Rename the pypi package to 'hfst' and use a four-number versioning scheme. Also define '__version__'. Add an empty config file as hfst c++ headers include it.
eb304f0 tests: trying sed instead of awk for exit on <STREAMCMD:FLUSH>
798383d tokenize: handle Apertium-style superblanks in --giella-cg format
7922399 Provide a way to scale the vector similarity calculations via variable
166d418 Merge branch 'master' of https://github.com/hfst/hfst
9bc5315 parse word2vec -style text format and assume first line is info
5d3f6b8 Do not allow unescaped dots in regular expressions.
ce5e2a5 Merge branch 'master' of https://github.com/hfst/hfst
a8f21f0 Observe HAVE_SFST and HAVE_FOMA when including back-end header files.
8a9c0de nwords catenate-style argument for Like() function
1433e45 Merge branch 'master' of https://github.com/hfst/hfst
1ec8e03 Add experimental two-vector model for word sense
72d4f72 (tag: v3.12.0) Ready for release 3.12.0.
da86647 Read vector models and do simple ranking
42df833 Small fixes to memory leaks and efficiency.
e2abd89 Fix some memory leaks.
da54acc Fix leaks reported by valgrind.
92cf4dc Fix more memory leaks.
9bd9749 Fix some lost memory issues reported by valgrind.
6e129a4 Remove unnecessary variable. Delete transducer definitions and latest regexp when destructor is called.
d700309 flush given outstream, not necessarily cout
8d4f824 Add a destructor to XfstCompiler.
6b10aab Update README.
863e94a Add missing files to dist and remove generated ones.
e4c1304 Rename hfst.rules into hfst.sfst_rules. Fix hfst.types into hfst.ImplementationType in tests.
80f18a8 Use single quotes instead of double quotes in docstrings.
52a9d75 Add test for hfst.xerox_rules.
00b020c Rename xerox_replace_left and xerox_restriction back to replace_left and restriction in package hfst.xerox_rules.
d1556dd Fix documentation of parameter 'output' of function 'extract_paths'.
3df4a48 Sentence separation for finnpos-mode
d3058c7 Issue a warning if automake is too old, which can make build fail if pre-generated yacc files are modified.
9166cdc Move ImplementationTypes and ReplaceTypes to their own classes and get rid of package hfst.types.
6d6c631 Fix datatypes used in for loops.
c6aa5e1 Fix some typos in documentation.
8073675 Use renamed versions of xerox rule functions.
ae70056 Tentatively add HfstXeroxRules to python API in package hfst.xerox_rules.
dee0176 Fix some more warnings.
37edbbb Fix minor warnings generated by clang.
faa7647 Fix typos in ifdef guards.
41c5a32 Add a simple script for fetching statistics about hfst downloads.
d9d8077 Update pypi setup.
822f4bb doh, forgot to add test expected output
cccba22 hfst-tokenise giella-cg: fix indentation issue with backtracking
5a26269 Fix still some warnings.
32c82b3 giella-cg backtracking: ensure ignored spaces are still output
03d6a23 Do not warn about pragma push/pop issues or unsafe cstring functions on windows
9c12094 Use hfst::hfst_fopen instead of fopen. Fix some warnings related to datatype conversions.
d64cb49 Add function hfst::hfst_fopen that uses fopen_s when compiling with msvc.
5ed27ea Fix some warnings caused by conversions (possible loss of data).
b93a144 Do not use extern templates when compiling with msvc.
73f7e18 Remove files generated by the tests.
61ac4f8 Add a batch file version of test.sh for testing the python bindings on windows.
b53a128 Fix warnings.
7cb8740 Update pypi setup script.
cc9cca4 Remove unnecessary back-end include directories as well as commented code from Makefiles.
fd5f007 Fix some minor warnings.
ec114ea --giella-cg support for @PMATCH_BACKTRACK@ symbol (#337)
15fbb1f Update windows script.
097826c Remove extra comma from initialization.
d6b001d Add missing header file.
9b3c122 Fix warnings generated when compiling with -Wall.
a0dee64 Fix most of the warnings generated when compiling with -Wall.
89b2061 rename --gtd to --giella-cg, and more descriptive help text
335df14 Add support for compiling on windows.
85585c4 Fix a typo.
e628f55 Add a manifest file for pypi packaging.
48c34ef Tentatively add a setup file for creating pypi packages.
6abde49 update to proc to make it possible to weight surface forms
5e379fc Update windows compilation script.
1f55a51 Define int64 for msvc.
21f52d0 Implement snprintf for msvc compilation.
c05469e Replace definition _Bool with Boolean, as _Bool conflicts with typedefs in msvc header files.
00bbb1d Get rid of unnecessary typenames and typos noticed during compiling HFST on windows.
b160c07 Forward declare back-end implementation datatypes and wrap their destructors.
067c3c9 Add missing <iterator> header.
2f0c8df Add missing std:: to vector.
34e99ad Issue an error message if glib unicode handler is requested but not found.
d28075c Declare all functions that are needed in start_xfst().
ec34dea Add math.h
4a3be6b Make preparations for supporting OpenFst version 1.5.4.
c9940f6 Include HfstBasicTransducer.h instead of HfstTransitionGraph.h.
8560638 Add extern templates.
dc96953 Remove unnecessary OpenFst headers and use forward declarations instead.
699c098 Wrap StdVectorFst and LogFst destructors as both classes are forward declared.
7f81f41 Forward declare as much as possible when using OpenFst library. Also remove unnecessary include directives and add missing std namespaces.
a0b2fb3 Prepend missing 'std::'.
c8137fa Make HfstBasicTransducer and HfstBasicTransition fully implemented classes instead of generated template specializations.
eb5b877 Improve the way error messages are handled.
b8278ec Change 'import libhfst' to 'import hfst'.
2b9e44b Move implementations of HfstTropicalTransducerTransitionData away from header file.
ce05c67 Merge branch 'master' of https://github.com/hfst/hfst
3ec0d09 Alter expected result to conform with newly expected behaviour
759605c Do not allow newline characters inside quoted literals.
f296b6b Merge branch 'master' of https://github.com/hfst/hfst
63d7544 Allocate enough memory for XRE error message.
f968ebf Trying out facility for communicating sentence boundaries to plain tokenization mode
6a78c4c Merge branch 'master' of https://github.com/hfst/hfst
bf2ded3 Don't print size info when unverbose
5a33841 Don't separate tokens with empty lines in plain tokenization mode
d0a1b55 Warn about undefined multichar symbols when compiling lexc in verbose mode.
5d2c67d Fix a typo in option handling.
e2fee81 Tentatively add an option --do-not-minimize to hfst-regexp2fst which performs determinization instead of minimization. Also make hfst-xfst use determinization if variable 'minimal' is set to 'OFF'. Implement this by adding function hfst::set_minimization(bool) to HFST API and a function HfstTransducer::optimize() which performs minimization, or determinization if hfst::set_minimization(false) has been called. Replace calls to minimize() with optimize() in parts of HFST code relevant to the tools mentioned.
82aa26e Get rid of CR-LF newlines.
44f2e21 Remove commented code.
94b7607 Print an error message if output type is defined several times.
966a895 Fix mismatched free/delete reported by valgrind.
6dc1d8f Do not use strdup with -std=c99.
407a517 Use delete_foma instead of delete/free when deallocating memory used by a foma transducer.
07508b5 Limit to C++11 for now
d15b4be Use free with malloc, not delete.
b693b3a Add a function 'void LexcCompiler::reset()' and use it in XfstCompiler when reading lexc input again.
bc5ac27 Remove revision info from output of hfst-info as it doesn't work correctly in git.
ace4801 Pass cascade argument by reference in lookup_cascading functions.
e2ea4cb Fix a typo in function 'strip_percents' which resulted in incorrectly escaped zeros.
92a7152 delete more lines
9f940a3 remove osx lines from yml
f7bd085 Unobservable change - nothing happened here
d1bf79b close #336: off-by-one segfaults in OlLetterTrie
2574e27 Avoid using projections in casing functions (which cause minimization issues with large weighted transducers).
73a3ff0 Parametrize optionality in casing functions
0adc1f9 Make sure there's no case fallthrough
58eb35d Fix other casing functions too
c366c4c Another fix to Cap()
704c1c7 Merge branch 'master' of https://github.com/hfst/hfst
5a5df2d Change operation of Cap() and make timing per-object
44f0ae7 Fix missing empty lines in output when cascading composition is used.
284f9d0 Add an option --max-number=N for OL transducers which limits the number of results. If not defined, defaults to 5 for infinitely ambiguous input and to infinity for finitely ambiguous input.
5686864 There are some problems in using the bindings with python version 2, so refer only to python3 in README.
5a17fb0 Add a missing 'break' to switch block, making variable infinite_cutoff work correctly.
4c2f26f If no results are found, print it only once.
d703776 Tentatively add support for --xfst=print-pairs for OL transducers.
b7fcab9 Remember to flush the streams also when no results are found.
7ba32cf Merge branch 'master' of https://github.com/hfst/hfst
bc7a80a Implement a pair-lookup for optimized-lookup transducers.
ce3133f Add a fourth argument of type hfst::ImplementationType to the constructor and a function get_type() to ImplementationTypeNotAvailableException.
4ffe105 Print to console on windows if print-pairs is specified.
587f3c0 Do not print extra newlines in cascading composition.
4353894 Fix a bug/typo in string comparison.
72de782 Improve documentation in HfstTransducer.lookup.
0c888e9 Rename HfstBasicTransducer.lookup_fd to HfstBasicTransducer.lookup and add an argument 'obey_flags'.
d5493ac Rename HfstTransitionGraph's lookup_fd to lookup and change some parameter names to be more descriptive.
56a8d2f Add instructions for linking hfst c++ library statically in comments.
67e25ff Add missing file to windows scripts.
8db14ce (tag: v3.11.0) Ready for release 3.11.0.
fe37aa0 Update morphology checks.
8d3f162 Update EXTRA_DIST.
6dac246 Do not use binary transducers in tests.
ee62c44 Add missing doc file.
a367645 Mention that python tests are no longer available.
da2678d Skip Python tests as they can now be performed in the folder python/test.
ea47f0b One more stray memory leaking cache
95e3480 Memory conservation: only cache named objects and always minimize them.
6b113b2 Merge branch 'master' of https://github.com/hfst/hfst
974d240 Turn off automatic delimiter-contexts ("set need-separators off") in the presence of user-defined contexts
01fccfd Use docstrings.i instead of swig's autodoc. Document the contents of hfst and hfst.exceptions modules.
4322ca7 Add a docstrings file created semiautomatically from doxygen documentation.
816ce6c Add at least a simple docstring for all functions.
82c8f6a Merge branch 'master' of https://github.com/hfst/hfst
1b11fbf Obey -i option for giving the pmatch archive filename
4620b1b std=c++ works on OS X but not Linux, so disable Linux+Clang
d245a74 Test if gnu++ appeases Travis' clang
3d9c66b Include needed m4
5e73db3 Copy hfst-ospell's C++11+ detection
9af66d0 short int -> short; long int -> long
daa457e Yet more documentation..
e1b01ff More docstrings.
711a089 Add docstrings.
acf5696 tokenise: don't skip output-empties
b5b4434 tokenise: don't let empty analyses affect weight classes
18f5a2b travis: fix indentation
62d216a getopt_long() returns an int, not char; this is important because char is unsigned on many platforms which means char(-1) is 255
39d28d1 Do not check if lookup is infinitely ambiguous if --time-cutoff is defined.
c42af37 Remove redundant flag diacritic check. It was once removed, but got pushed there again.
b7b5882 Add support for --cascade also for optimized-lookup transducers.
9169e38 Support combination --cascade=composition --xfst=print-pairs.
51a0b61 Merge branch 'master' of https://github.com/hfst/hfst
ef6ed81 Fix the way cycle limit argument is passed.
9ad0950 Merge branch 'master' of https://github.com/hfst/hfst
6d94c90 Add a preliminary omorfi-friendly conll-u mode
45a0bf2 Remove redundant filtering of paths containing flag diacritics. Flags are handled already in function HfstBasicTransducer::lookup_fd.
5c7135a Implement option --cascade.
18d5e66 Tentatively add option --cascade to hfst-lookup.
1dcad09 Print error message if --xfst=obey-flags is used with optimized-lookup transducers. They always obey flag diacritics.
670ee39 Merge branch 'master' of https://github.com/hfst/hfst
f903d9c When doing apply up or down in hfst-xfst's, check if 'obey-flags' is ON or OFF.
8e9c6e6 travis: --enable-silent-rules
a253b8d Add an argument obey_flags=false to is_lookup_infinitely_ambiguous functions.
14bdd9c Make is_lookup_infinitely_ambiguous aware of flags. Use typedef hfst::StringVector in HfstLookupFlagDiacritics to prevent conflicting definitions in hfst-twolc which defines a class StringVector.
349fb73 Add option obey_flags=false to lookup_fd.
fdf912a Add tentative script for converting from doxygen documentation to docstrings.
7618907 Modify function visibility.
bc01c0b Control visibility of functions in HfstTransducer.
b535aa6 Follow foma and xfst convention for operator ignore '/', i.e. expand any identities and unknowns before inserting the strings to be ignored.
c1b4341 Add an argument 'force' to HfstTransducer::harmonize which makes the function harmonize also FOMA_TYPE transducers.
724b4e9 Obey variable 'print-sigma' when print_net is called.
6bec507 Add brief descriptions for hfst packages.
ca892d1 Document the implementation types.
98764da Fix bugs in options --xfst=show-flags and --xfst=obey-flags when non-optimized lookup is performed.
93b9fe2 Close the outstream when done. Not doing this was causing a bug where some buffered data never got written to the output file.
7d97106 Obey variable 'print-pairs' when performing apply down in non-optimized format.
ac2fc3a Add parentheses in conditionals of function 'flag_build' to ensure correct interpretation. Also remove redundant code.
28867c1 Allow punctuation as context in the "need-separators on" mode
0cd05ee Add "need-separators no" to test script to reflect changed default in pmatch
2597799 exclude gcc from osx builds
7dfa8ec Track used symbol names
81128c2 Allow lowercase define()
52a4ca4 Merge branch 'master' of https://github.com/hfst/hfst
af71662 Add side arguments to casing functions
7c1b388 Revert back to the original extra_link_args in python bindings.
2d5c670 Rename examples.py to test_examples.py. Perform tests in a more logical order.
4c8a1e3 Update documentation about linking to the HFST C++ library.
9f8ca0f If --enable-no-tools is used, skip tests in test/tools/.
41dc9a2 Mention module hfst.types.
a8c3e56 Remember to apply modifications to weight pushing to tests and hfst package.
835f0ef Get rid of TO_FINAL_STATE and TO_INITIAL_STATE and replace function 'push_weights' with 'push_weights_to_start' and 'push_weights_to_end'.
d05b9db Wrap HfstTransducer implementation types in package 'hfst.types'.
5e1a245 Add missing files to dist.
58d4602 Update README. Handle linking to extension library better.
de888b6 Update README. Handle cases where an implementation type is not available. Add missing files to dist.
1a68126 Fix a typo in URL.
7a2296b Update link in README.
861baf6 tokeniser: --weight-classes=N opt, as in hfst-proc
151f1fa Add functions lookup_optimize, remove_optimization and copy to HfstTransducer.
4efc9af Make HfstTransducer() create the empty transducer.
0f237c1 Comment out hfst_ol::set_extract_tags_mode(bool). Improve error messages in att and prolog readers.
57426ca Remove unnecessary commented file.
b7139cd Fix HfstException documentation.
3b13bf5 Merge branch 'master' of https://github.com/hfst/hfst
7e6d27c Fix a typo.
9f4ceb7 Merge branch 'master' of https://github.com/hfst/hfst
829c9e2 Implement remaining variables in the library and support them in hfst-pmatch, with the exception of need-separators which has to be specified during rule compilation, at least for now.
457c5de Document AttReader and PrologReader.
8695c47 Add HfstException::what() to python api.
2cffffa Add an alias what() for operator() to HfstException.
5fdf61a Add class PrologReader and tests for it.
77eda55 Add a new class AttReader and tests for it.
18edac5 Add package hfst to setup. The package is essentially a wrapper around libhfst module which is still available as before.
e41f131 travis: re-encrypt channel to #hfst
6329bb8 Use all implementation formats in tests.
5a5018c Do not use default_fst_type as pmatch transducers are always in ol format.
b38f87b Add stream tests.
ccaea0e Represent empty transducers as empty lines so that read_att_transducer will work correctly.
fe66b18 Remove unnecessary files.
bf5511d Forgot to do git add before committing...
3710f91 Update test examples.py and add it to test.sh. Remove extra files.
7d5064d Add directory tree for package 'hfst' and its subpackages 'exceptions' and 'rules'. This package encapsulates module libhfst.
7818c1e Delete unnecessary file.
91b86a0 Rewrite part of tests. Document the interface file.
657013d Add new tests.
3b4c186 Merge pull request #323 from unhammer/tokenise--gtd-tags-are-multichars
67125ab rm unneeded import
d927d79 pmatch: store input/output_symbol_strings, _parts refer to them
c665f8a Add missing endverbatim tag.
884d1dc Add missing file.
bc87fc5 Get rid of HfstFile wrapper.
30cde02 Try to preserve the names of transducers when converting.
90b3328 Expand prolog tests. Fix a small bug in naming transducers.
8e61537 Forgot to update testfile.
63a0456 Add a new function for reading prolog input and improve the way att input is handled.
9ef38f7 Split part of code related to reading att input into a separate function. Change the way transducer name is set when reading prolog input.
6d12e0d erge branch 'master' of https://github.com/hfst/hfst
22f1c47 Modify documentation, update tests, add new functions.
d14bc2f link to travis; bug tracker link sf→github
e8c7fd6 README is now symlink to README.rst; for more readable github
2258354 travis: no irc notes on success
1d8a1f4 travis: encrypt IRC channel for hfst/hfst
38c9544 Add HfstTokenizer examples as tests.
c635cd4 More documentation to HfstTokenizer.
c1ca3b6 Add more examples for HfstTokenizer.
0dc0cf4 Reorganize doxygen documentation to represent the new module structure of hfst python api.
95d658c Remove hfst.py, a directory structure is needed instead for doxygen.
5025aab Add new documentation for hfst python package.
4749745 Add more tests.
8f8b345 Split python tests to several files.
45aeb38 Skip one more failing pmatch test.
83dee76 Add missing files to dist.
a44b99e erge branch 'master' of https://github.com/hfst/hfst
b5ca1ec Add more documentation and modify functions.
579ee71 Merge branch 'master' of https://github.com/hfst/hfst
8050575 A PmatchContainer constructor taking a std::vector<HfstTransducer>. This should leave the passed vector untouched.
88aa682 Add more documentation.
ab82047 Add a simple test for XreCompiler.
a25eb2b A new test.
caf1ca8 XreCompiler::setOutputToConsole is needed, uncomment it.
f6acf2a Split c++ code to multiple files from libhfst.i. Update URL in setup.py.
780374a Merge branch 'master' of https://github.com/hfst/hfst
4c9ca19 Settle on a way to handle variable names with dashes (which look like minus signs)
967cc78 actually run the tokenize tests
5df87b6 Add more failing tests.
775a499 Fix some typos.
2e77e2d Add missing files to dist.
11ebca0 Merge branch 'master' of https://github.com/hfst/hfst
067e637 Merging master
4c9d43a Variable setting and reading
881b81c Use -Wno-deprecated also for tests.
85cb277 Modify messages that get printed during make check.
0761730 Skip tests that do not exist. Make prints more informative.
573324e Skip failing pmatch tests.
164e621 Add missing tests for ConvertTransducerFormat. Compile them with -Wno-deprecated.
f945e17 travis: irc notifications on build failure
0229baa Merge branch 'master' of https://github.com/hfst/hfst
93ed0db Remove const declarations from openfst back-end that cause compiler warnings.
5df2a2b Run g++ with -j2 (not -j) on travis; o/w runs out of memory
54b8f45 Comment out a function that is not defined.
48c91b8 Specify -Wno-deprecated also for tests in folders hfst-tagger and hfst-twolc.
810c330 https://travis-ci.org test rules
708171e doh, define multichar-symbols (for 2eb6888)
2eb6888 test for #320 / bf26db9
3aecdbc Merge branch 'master' of github.com:hfst/hfst
bf26db9 only change case if sf and lm case differ (#320)
b3782de Remove a rule that cannot be matched since a rule has already been defined for "DownCase(" earlier.
5ad3487 Add the keyword const to arrays used to store xfst commands.
79c7dfe Make compiler happy by using strdup instead of plain const char *.
f6d9953 Handle return values of fread in foma and sfst back-ends to prevent warnings during compilation.
214b7b6 Fix a typo.
862e858 Replace foma binary files that are used for testing results from lexc with equivalent files in prolog format. Then the tests will not depend on the foma back-end.
680389b tokenize tests
2b7cf7e tokenize: new --gtd Giellatekno/Divvun mode
adb37d4 tokenise: fix #318, header parsing for TOP
76e9069 Fix a bug in flag elimination involving multiple flags. Fixes bug #315.
f653780 Evaluate transducers (as opposed to argumentless functions) where declared
e6c6e82 (tag: v3.10.0) Ready for release 3.10.0.
97c1285 Swap implementations of commands 'apply up' and 'apply down' in hfst-xfst, making them work as they do in foma and xfst.
85ef34c Forgot to add before latest commit...
13280ea Edit warning about converting native foma transducers.
b31a6c1 Fix typo: swap left and right.
808d984 Add hfst-flookup to windows scripts.
66db285 Update bug tracker url and copyright year.
b127ab3 Git ignore hfst-flookup.
1b40de6 Add tests for hfst-flookup.
12e4cca Trandsducer operations no longer return a reference to the transducer, fix this in tests and interface operations.
a128c7f Add a tool hfst-flookup which does lookup from right to left, i.e. in the same way as foma's flookup and xfst's lookup. hfst-lookup and hfst-optimized-lookup stay as they are, but mention in their help message that they do lookup in the opposite direction.
cea21b4 Add a comment about the way gzipped input is handled.
da7ebcc Make HfstInputStream constructor throw a FileIsInGZFormatException if input is in gzipped format. In hfst-fst2fst, issue an error message that asks user to unzip the transducer.
5c2755b Merge commit frankier:add-gitignore manually and also add more files to be ignored by git.
99b470d Merge pull request #313 from frankier/escape-dot-labels
f6905a1 Improve handling of argumentless functions and funcalls
d804d3d Remove some memory leaks in compilation
ef740e0 Fix nested funcall issues
61e1e0a Add optional versions of cap functions and AnyCase().
68e078b Corrections and additions to casing functions, and add Implode() and Explode()
2484633 Add a facility for detecting when hfst-tokenize is given a "bare" dictionary and generating a somewhat intelligent tokenizer when that happens. Due to awkward architectural reasons this involves changes and additions in many parts of the library, but they are not major.
089f3ca Add .gitignore
d1d81db Merge branch 'master' of https://github.com/hfst/hfst
485f575 Fix bug that caused exclusionary lists to be missed much of the time.
03ff992 Escape " in dot labels.
9984096 Factor out replace_all into string-utils.cc.
ca10d77 Revert swapping of hfst-xfst's 'apply up' and 'apply down' commands.
7352cf7 Swap implementations of 'apply up' and 'apply down' in hfst-xfst, so that these commands work in the same way as in foma and xfst.
db2d4dc Add an example for HfstTransducer constructor.
f37b8d1 Tentatively implement lookup also for transducers that are not in optimized-lookup format using composition and path extraction.
1cb2992 Add an HfstTransducer constructor that takes a StringVector as argument.
f32c049 Support python 2 unicode strings in lookup.
8d2c6b8 Fix a typo
f442b91 Make HfstInputStream iterable by implementing functions __iter__, __next__ (for Python 3) and next (for Python 2).
2c660ff Add functions in namespace hfst::rules to python API.
947bb62 Add some information about Python interface in README and advise the users to see file python/README.
067b0c2 Remove slashes from headers used when processing the file.
c4b5ce8 Update documentation.
0e4e8a7 Redefine functions which return a reference to themselves as void to avoid problems on the python side.
5f97eac Tentatively add a C++ function 'std::vector<hfst::HfstTransducer> compile_pmatch_expression(const std::string & pmatch)', mostly copied from hfst-pmatch2fst.cc.
21da500 Add a const declaration to string& argument of PmatchContainer::process and PmatchContainer::match.
cd7f2f8 Tentatively add some pmatch functions. Add a missing typedef for HfstTransducerVector.
a9fbd44 Add a function to HfstInputStream for finding out whether hfst headers are included in transducers. When reading native foma transducers in hfst-fst2fst, warn that inversion may be needed for lookup to work as expected.
fa4565e (tag: v3.9.2) Release 3.9.2.
3f7a57e Update hfst binary file used for tests.
5cd8a71 Add option --pythonpath to python tests. Document the test tools better.
927da9c Make fst2txt functionality test silent.
ad8c82c Update installation check scripts.
ddf749d Remove test.py from EXTRA_DIST, the tests have been rewritten.
dd7d0be Merge pull request #309 from unhammer/pmatch-input-marks
97b6e3f Merge pull request #308 from unhammer/tokenise-minor-cg-fix
3781806 Make small modifications to documentation and typedefs used in function declarations/definitions, so that doxygen can generate documentation right.
00d89ce Update Python and C++ API documentation files.
4319b94 Move api documentation file under folder 'doc' too.
67e1e8c Move C++ documentation file under folder 'doc'.
8e2e651 Do not catch exceptions derived from HfstException inside the constructor of HfstInputStream and throw them as HfstException. Instead, throw them as such.
56ffca8 pmatch: special symbol @PMATCH_INPUT_MARK@ for input marks
90aa695 pmatch: on seeing a special symbol, mark the offset in the input string
bc345c6 CG forms always go in "<>", not before the tab
8dceb24 Merge pull request #307 from unhammer/tokenise-short-options-missing
2c207de short option -m missing from getopt_long
a9314ad Add m4 for older distros
6c944ba Bump patch version since revision has decreased
859f8da Add builting function Interpolate()
c6f9908 Fixed condition for including symbols in exclusionary lists Rather than the printability vector, which may not be completely set yet, check the symbol itself for printability.
780b9af Fix bug involving stale caches of function call results
26d3d8f Change python3 into python in README as version 2 works now too.
7af6f3f Move tools/src/hfst-string-conversions.[cc|h] under libhfst/src/.
a1b5bb2 Update version number to 3.9.1.
d23eb93 Do something useful in the case of a missing root rule
9e4e663 Make Makefiles in openfst back-end licensed as LGPL.
03e1949 Candidate for release 3.9.1.
27f80ee Fix names of files whose names were changed.
8e4f324 Set server based search to NO.
5b2c698 Add word boundaries in xerox mode.
6e78e1f Fix copypasted typo
b6970ab Fix ToLower and ToUpper, which had transposed names and flawed implementations
c939413 Use iterator instead of range-based loop on windows.
e5897fb Fix a couple of typos inside 'ifdef _MSC_VER' detected when compiling on windows.
7694ee0 License for the library code is now lgpl 3.
b8a5b98 LGPL license
a031526 Compatibility with python2.
1baee13 Add exclusionary lists (eg. Exc(Whitespace) matches symbols not in Whitespace)
a92e1d0 -X option fixed in hfst-pair-test. Additionally, the user gets an error message if unescaped :-symbols were found.
8d0a028 Fix a bug in HfstTransitionGraph::find_replacements. Implement compile-replace without composition in XfstCompiler to avoid alignment issues in result.
887381e Fix epsilon handling in compile-replace expressions.
2891a27 Allow epsilons inside compile-replace expressions as xfst does.
bcfea4a Warn about '@_EPSILON_SYMBOL_@' not being an ordinary symbol in hfst only in verbose mode.
5fc456b Allow space after operator '^' in regular expressions.
0758730 Fix binding precedence of :
9f4628e Add throw to appease rpmlint no-return-in-nonvoid-function
78efc5d Fix a bug in function 'fst'.
23a5285 Print a blank line even if there were no results or if tokenization failed
73d317b One && too many
5f4a8fc Rename hfst-proc2 to hfst-tokenize, with hfst-proc2 and hfst-tokenise as aliases
732e37e Add tests for python bindings.
51cbc5e Complete rewrite of pmatch compilation This is a cleanup and rewrite of pmatch compilation, hopefully allowing easier extensibility and maintainability in the future. While this has been reasonably well tested, it is likely that there are some regressions; please report any. Various additional features are pending.
31de91e Move scripts under subdirectories and remove scripts that are not needed.
6156215 Move windows scripts under scripts/windows. Get rid of libhfst_win.i.
0bf33d7 Remove HfstFastTransition.h also from dist.
02fa326 Remove HfstException.cc and HfstException.h files, HfstExceptionDefs.cc and HfstExceptionDefs.h are used instead.
9356517 Remove class HfstFastTransitionData and the template HfstFastTransducer that uses it. HfstBasicTransducer is the template of HfstTransitionGraph that is used everywhere and it is fast enough.
dd6873e Remove tentative reentrant version of xre compiler. The normal xre compiler can handle expressions that the reentrant one was originally intended for.
05c78d8 Update python API documentation.
f933525 Remove old commented code. Add a warning about python bindings not being under autotools; advise users to see python/README.
b127d48 Set -std=c++0x unless a higher standard is required with the -std option.
f2738be Do not generate hfst-scanner header file. Make hfst-scanner depend on hfst-compiler header file to make sure they are generated in the right order.
a44eb61 Check automake version in configure to determine whether header files generated by yacc/bison should use extension .h (automake < 1.12) or .hh (automake >= 1.12).
7f206c0 call_counter is a member of the container, not the transducer
2aeda9f Reweight arcs iff all conditions given with --input-symbol, --output-symbol and --symbol are met.
fddc9ea Forgot to increment call counter
a56089b Only call clock() every millionth time
1c4c871 Add missing declarations
a710ddd Only check clock() every million calls to get_analyses() to save time
8cffb74 Move start_clock setting to where the line is read to make the time limit per-analysis
6488877 Add short option for time cutoff (-t)
b6fdeb6 In pmatch, make the time cutoff "soft" to ensure we at least return something (this was an issue for rulesets with RTNs only - all the time could be spent in one leaving nothing for anything else, causing needless failure)
70e025f Ignore previous commit, I had some old library code confusing me
8dfcd66 Add time cutoff options
8a9f0c3 Fix what seems like a typo in hfst-lookup, &infinite_cutoff for infinite_cutoff It seems like the size_t and size_t* can substitute for each other, but the prototype is for size_t.
2134ef4 Fix time cutoff check in ol library code
8ff4aa3 Omit infinitely ambiguous check when we have a time cutoff
969d479 Fix time cutoff handling
dd30c33 Switch from !€ / !$ to !!€ / !!$ as the test case prefixes, to bring it in line with the conventions in the rest of the Giella infrastructure (where this testing setup is used).
fd175f1 Fix option --to-transducer of hfst-substitute.
d213839 Update HfstTransducer::priority_union declaration. It takes only one argument instead of three.
0dec506 Add new test files to dist.
7915ca0 Priority union bug solved.
daee8f7 Update output of morphology tests.
1148413 Add a check for hfst-ospell.
a179a32 Add new replace markup tests to dist.
c56d1f0 Markup rules completely changed
cb9f08a Update installation check.
5a174af Finally ready for release 3.9.0.
0ec3e97 Add missing files to dist.
61d1d14 fix for --xerox bug#328 introduced by weight fix
b6e49fd Revert -r4579 "Revert changes to hfst-proc made in revision 4427. This will (temporarily) fix bug #328."
f0e85c8 Remember to update the man pages, too.
e31c5d9 Ready for release 3.9.0.
64c985c Revert changes to hfst-proc made in revision 4427. This will (temporarily) fix bug #328.
9ca8ab0 Change all pythoncode delimiters from '{' and '}' to '%{' and '%}'.
128331b Add two missing %
7b961c5 Add tests for hfst-substitute.
67f3dce Add tests where one or both inputs are archives.
26aa331 Set name of composition transducer after composition is called.
1e2ed00 Add tests for binary tools that use input streams whose types differ.
4c1e655 Make HfstTransducer::compose throw and exception if transducers have mismatching types.
e21378d Implement conversion of mismatching transducers also in hfst-substitute.
43e107d Make binary command line tools convert mismatching input streams into common format.
573609f Make 'harmonize_flag_diacritics' throw an exception if transducer types differ. Make 'is_safe_conversion' public.
d750847 Fix a bug in hfst-regexp2fst related to comments and remove support for legacy way of defining weights from regexp parser.
ee3b02c Xre parser - added check if contexts in replace rules are automata
d5258af Implement commands 'view net' and 'write dot' as system calls for non-windows platforms.
7e041e8 Update windows scripts so that they will work with the latest changes in dot and pckimmo printing functions.
8cb95f8 Move dot and pckimmo related functions in their own files.
70199c8 Throw an exception in HfstTransducer::eliminate_flag if flag feature does not occur in the transducer or the flag includes a value or operator.
6d723ca Add missing file
aa6325f Move python tests to their own directory.
bf0db23 Rename directory 'swig' to 'old_python' and directory 'new_python_api' to 'python'.
da9b806 Add information about the possible values of option --xerox-composition to the help message.
f068284 Modify infinite_cutoff and time_cutoff arguments in lookup functions.
8645f60 Lexc, removed option -M + cosmetic changes to the alignment
95e18d3 Handle standard and console streams when libhfst is run from IDLE.
1275140 Throw a FunctionNotImplementedException in HfstTransducer::lookup_fd instead of converting HfstTransducer automatically into HfstBasicTransducer.
7d11276 Update python interface.
dc5a1f4 Added option -A to lexc, it aligns same input and output transitions
1d3e8af Fix a minor bug in python interface on windows.
897a7dd Fix LexcCompiler::printConnectedness const declaration.
de59f0e Replace rules support flag diacritics (fixed lexc with -FM flags)
6817b35 Use separate ostringstreams for standard output and standard error on Windows.
f30e6e9 Improvements to error and output stream handling.
1178005 Support printing to console on Windows.
0f0dd44 Update foma back-end composition algorithm to fix a bug noticed in lexc tests.
6e93227 Replace system exit calls with HfstFatalExceptions.
c73aa33 Check for epsilon cycles with negative weights.
4679a3a Optimize function 'has_negative_epsilon_cycles'.
3bd5ce0 Add a test for checking that unknowns are not expanded into flag diacritics.
6d803df Add option 'check-negative-epsilon-cycles' to hfst-txt2fst and tests for it.
a678e81 Add function HfstTransitionGraph::has_negative_epsilon_cycles.
4405b78 Improve performance of transducer harmonization.
12d3d8e Time cutoff for the bundled standalone optimized-lookup tool too
05e7b6b Add parameter to control output in 'compile_lexc_file'.
879d01e Add function unsigned int LexcCompiler::getVerbosity().
acd442e Allow user to control output in xre and xfst compilation.
cf2a1d7 Improve error handling in lexc compiler.
688823f Improve error handling mechanism in xre and xfst parsers.
083f848 Add a missing period to the end of symbol lines when printing in prolog format.
50f7ebf Added warning about very slow lookup when using HfstTransducer::lookup_fd.
0cf9553 Added warning about very slow lookup when using HfstTransducer::lookup_fd.
c8458dc Added possibility to transform individual transition weights in HfstTransitionGraph.
fb50256 Add option to suppress multichar tokenization and default to on in proc2
8e370a6 Replace part of XfstCompiler's FILE pointers with equivalent calls to ostream.
5a773d1 Tentatively move towards o(string)streams instead of FILE pointers when printing output.
b792ca8 Remove debug printing
42585f5 Implement a time-based search cutoff for hfst-lookup in optimized-lookup mode only
2aafec3 Make 'define_function_args' return a bool instead of calling exit on error in xre parser.
18d6b12 Fix error handling in xre and xfst compilers.
d1f2b2a Rewrite regular expression examples to be more informative.
52dc82e Scale weights to be non-negative in function n_best.
21d8577 Update documentation.
5c71e4b Small fixes to read_att_string.
727e74f Update documentation about special symbols. Add a function for reading AT&T strings.
7b2e11d Add function start_xfst() that starts an interactive XFST compiler.
ca3143c Update documentation between HFST and backend conversions and add the examples to tests.
9ae4311 Fix issues related to level of verbosity when compiling xfst and regexps. Add missing documentation.
94e7535 Merging of modified or added functionality from the xre parser
116a076 Remove obsolete comment
9130ba1 Revert HfstXeroxRules files to version 4484.
04d3a64 Rewrite 'compile_lexc_file'.
4988bb6 Improve handling of error and exit conditions in xfst.
4ba4909 Add missing namespace: 'string' -> 'std::string'.
b3c446d Greatly reduce memory leaks when using hfst-ol format
f9b44a4 Update new python API.
3239aa9 Update windows scripts after moving XfstCompiler.
ba65f77 Move XfstCompiler from tools/src/parsers to libhfst/src/parsers.
330447d Add three missing files to dist.
2570d1f Add flushing to allow instrumentation of hfst-lookup
5a1307d Replace rules now should support Flag diacritics.
571878b Fixed lenient composition (it used to be unimplemented).
209e969 Move HfstStrings2FstTokenizer from tools to libhfst directory.
9105ea6 Use the actual transition data object when copying transitions in conversion.
7b06fa5 Fix a warning about comparison of signed and unsigned value.
4d67d87 Throw StateIndexOutOfBounds exception in get_final_weight, if state does not exist.
4c4f56c Add examples used in documentation to the tests.
8a37298 Test examples and fix them, if needed.
12825fe Add missing documentation and functions.
b2aa4b1 Add a function that takes tokenized input and returns a transducer.
b3da97a Define new functions for creating transducers from string, string list and dictionary.
c6ebab5 Fix some issues in the new python interface.
4be6a9d Add more documentation and tests to the new python interface.
3b312a3 Add aliases for different substitution functions.
ca7834b Modify substitution functions to be more pythonic.
21c7d01 Add more stuff to the new python interface.
5c94995 Fix output of hfst-xfst's command 'help'.
fb4d2ef Fix small bugs in hfst-xfst.
21f9fbb Modify also determinization function so that negative weights are handled correctly.
9b42c70 Added more documentation.
833f6c6 Added more documentation.
ceb8de6 fix NUL flush, bug #240
5ed671d Add function __iter__() to libhfst.HfstBasicTransducer.
323778a Modified libhfst files.
6bc7b3f Create directory for new python API.
d929a12 Update Doxygen HFST version number.
39b7bd7 Update README and Doxygen documentation.
a7efde8 Add some tests with negative weights for contain, merge and parallel rule operators.
8bda879 Modify minimization of weighted transducers so that negative weights are handled correctly.
e7f3b91 Always include 'generate-cc-files.sh' script in dist package.
be9d1ad Do not allow omitting the question mark on either or both sides of colon in regular expressions. Xerox has also removed this feature from their regexp syntax. Interpret expressions such as [foo : bar], [foo: bar] and [foo :bar] as [foo:bar]. Expressions of type [foo:], [:bar] and [:] will yield an error.
98c8423 Test if pmatch-tests.sh.* files exist before removing them to avoid 'rm: cannot remove:' messages.
b8b1039 Remove foma license warning. The current license of foma has no compatibility issues.
13553bd Make some changes for the python interface. Move HfstFile wrapper class to the swig interface file. Make HfstStates and HfstTransitions typedefs public. Add a destructor for HfstTransition. Update version number to 3.8.3.
c64206b Handle a frequently encountered exception
c48b828 Reverted changes done for class HfstFile in revision 4431.
a8cf4ea Removed HfstFile wrapper class and functions using it. A better place for them is in the swig directory.
65b418c Added destructor for class HfstException. It needs to be defined for SWIG.
9965217 tests for --weight-classes 1 and ordering by weights in proc
3e74d14 make tests pass again (only analysis order changed here)
a882fef use std::vector as ProcResult to get output sorted by weight-class
a0bcb70 fix --weight-classes, bug #308
bd9ac6b Now the script generate-cc-files.sh creates also lexer files.
c37b24c Add $(EXEEXT) for Windows
2fa67a4 More Foma fixes for cross-building and clang
aa58fe3 Small modifications to foma code: struct and variable had the same name.
f0b6c85 Updated to foma version 0.9.18alpha.
d566985 GNU address removal
638efc5 Fixed some memory leaks in hfst-compare reported in bug #167.
4a7ea0c Adding the renamed files.
28182f1 Trying to rename README files...
cdd1ad3 Updated scripts.
6906e45 Now all files in this directory are listed in README
ab78a6b Added a README file for the scripts directory.
2374665 Fix help message
445eeb1 Fix containment, implement some additional wanted operations
892fdd8 Ready for release 3.8.3.
4eab0f4 Updated windows scripts.
9d0aba0 Now a warning message gets printed if stack contains transducers whose types differ (can happen if lookup-optimize is called before pushing another transducer onto stack).
f7a8ce8 Added a windows README file for the package that contains eight command line tools.
e4977d4 Added a Windows README file for hfst-xfst.
9e2ef69 Added tests for windows. Made small fixes to hfst-proc and hfst-twolc for windows.
3b3a397 Replaced alternative tokens for logical operators with the standard ones.
65aa949 Checking for existence of config.h and getopt.h.
f2f7fc3 Updated scripts.
aa1f4df Added make scripts for twolc.
e33968b Added twolc to windows compilation scripts.
c9c02b9 Updated windows scripts.
91a1945 Fixing again reading and writing to/from console on windows.
3fd62f3 Using hex values for unicode characters when compiling with msvc to get rid of warnings (and possible segfaults).
023e533 Terminator needs reading, and yes there are partial reads
8575c6d std::string all the way
4ba02e9 Added missing HAVE_CONFIG_H ifdef. Added globals-common.h header as some compilers complain about missing extern variables. Now creating variable size array 'next_u8' dynamically with operator 'new'.
0a36470 Renamed help_message.cc into xfst_help_message.cc and added a header file xfst_help_message.h. Now XfstCompiler includes only the header file.
e25e08d Fixed a typo and added explicit braces to an else statement.
0061f6c Now option --pipe-mode takes an optional argument { input, output, both }, 'both' being the default.
62f26f0 Install hfstdll.h
d09ea9c Now hfst-lookup and hfst-optimized-lookup read from console and print to console by default on windows.
09eaafb Commented out one debug print...
cb24cf2 More improvements to utf-8 handling on windows.
e29bdce Small modifications for better utf-8 support on windows.
75a25d4 Small modifications for some command line tools for windows.
b141560 Added tool compilation scripts for windows. Argument handling on windows is also supported for some tools needed for hfst-xfst testing.
8b73295 Fixed a bug in argument handling.
e516f16 Edited help messages and warnings of hfst-xfst and hfst-lexc on windows about character encodings.
9d31b76 Moved getopt implementation to a separate file.
57f9a81 Added an implementation for long options for windows.
bb564a4 Fixed some issues noticed during msvc compilation. Renamed function 'min' to 'min_' to avoid collision with a macro with the same name. Changed arguments of comparison function given to qsort to (const void *, const void *). Added support for commandline arguments of type '--option=argument' to hfst-xfst for windows. Also changed hfst-xfst's option --print-to-console (default false) into --no-console (default false) on windows.
34d1c54 Updated windows script files.
b46445d Deleted getopt_long cc and h files. Processing command line arguments manually in hfst-xfst.cc on windows.
88c44eb Added some ifdef _MSC_VER directives.
eed630e Applied patch for musl libc compilation.
f3ffa44 Fixed some bugs noticed when compiling on windows.
0029101 Blank line between outputs in --finnpos mode
5c3f801 Updated windows scripts. Added ifdefs for msvc compilation.
1734803 Updated swig scripts for windows.
1169774 Add FinnPos mode and reorganise
d9c6038 Updated and added scripts for compiling python bindings on windows.
5a00166 Fix harmonization issue and speed up the common case of only on top-level matcher
4d8a4a6 Documented the tsv file format, added some linebreaks to make the helpt text easier to read.
2871135 Always skip comment lines. Added verbose output for TSV file reweighting.
3d2e4db Fixed bug #293 by initialising the line variable before using it.
bfd227b Whitespace only.
d850180 Added missing file 'hfstdll.h'.
ed20de2 Edited and added scripts for windows compilation.
25974b1 Added HFSTDLL's for some functions for windows compilation.
9fd87e0 Added option --arcs-only to hfst-reweight. Now weights of all arcs and end states are modified unless --arcs-only or --end-states-only is used. Also fixed '--end-state-only' as '--end-states-only'.
e8c1be4 Added some explicit type conversions.
2a77392 Changed the short form of the xerox mode from -x to -X (upper case), to be more consistent with the other options, and to match the test on line 195.
3ba83d9 Now it is checked if foma back-end and zlib are available before hfst-lexc-wrapper is generated. This also affects the way HfstTransducer::read_lexc_ptr is implemented for FOMA_TYPE: native HFST lexc compiler is used if lexc wrapper is not generated.
8557d61 Updated hfst version number in the python setup file. Fixed the argument type of LexcCompiler::setVerbosity from bool to unsigned int.
4a31565 Changed again some 'not' operators into \!.
e1c0779 Added a missing // in the beginning of a comment line noticed during windows compilation.
f4ecd94 Edited windows batch scripts and added a new one for creating python bindings.
590d56c Changed declaration 'class yy_buffer_state' into 'struct yy_buffer_state'.
1f83336 Fixed declared return type from 'void' into 'int' for functions hlexclex_destroy and pmatcherror.
cfe7852 Xerox twolc test support in hfst-pair-test
48dcdd7 Tentatively fixed missing paths problem in compose-intersect
2f75bbc Moved definition of member 'static const HfstState START' of classes ComposeIntersectFst and ComposeIntersectRulePair from header to cc file.
d4bdfb3 Add [].t() syntax for delimiting tagged (and context-checked!) regions
5eb0069 Added new scripts for testing native windows compilation.
982836a Added missing return values for some functions. Replaced ciso646 keywords for logical operators with standard ones.
ad4273d Added make scripts for windows.
e94301e Updated script.
7fccd3a Removed unnecessary HfstUtf8.h and HfstStrings2FstTokenizer.h inclusions from libhfst/src/ files. Also moved HfstStrings2FstTokenizer into hfst namespace.
d726d57 Now handling all stdbool definitions in foma headers.
7137e4d Now HAVE_CONFIG_H is checked before including config.h. Removed inclusion of zlib.h that had no effect.
b1ba306 Changed ifdef WINDOWS into _MSC_VER. Now zlib.h is included only when requested with -DZLIB.
a16942d Using dynamic arrays instead of static ones to keep cl.exe happy. Also added definition for 'ssize_t' on windows.
7203413 Replaced 'and' with '&&'.
beb57e3 Replaced 'and' with '&&'. Also made a variable defining the size of an array const.
9b26e06 Replaced ciso646 aliases and,or,not with &&,||,\!. Also changed sizeof(variable) into sizeof(variable_type) in ol transducer header because cl.exe complained about that for some reason.
237842a Fixed a bug where operator '==' was used instead of '=' when setting the value of last element in a char array.
dc51b2e Added missing files.
83d7d32 Added scripts and headers for windows compilation.
33409ad Changed operators {not,and,or} into {\!,&&,||} because cl.exe complains about them on windows.
d3779cd Added a script for windows compilation
1d327ca Added support for compiling openfst and foma back-ends with cl.exe on windows.
27b6994 Include zero in signed and unsigned integers
06517c7 Correction to handling trailing nonmatching material in locate_mode
f03b162 Forgot to commit these files too...
da83b10 Added xfsm support for some command line tools.
3097ddf Now HfstTransducer functions throw a FunctionNotImplementedExeption if xfsm implementation is not available.
4008054 Added more xfsm implementations.
818ad51 Added more implementations for XFSM_TYPE for HfstTransducer functions.
664573a Add ignoring
7b7d13f Correct parsing order of PARALLEL_RULES
40066a1 Added support for composition for xfsm transducers.
3c11736 Added a variable to control if (xfsm) transducers that are already minimal are still minimized for profiling purposes.
877c46f Fixed some bugs in escaping special symbols in prolog format. Added support for xfsm format in some command line tools.
3cf0029 Fixed an error noticed when -Werror=format-security was enabled during compilation.
d0ed632 Now xfsm conversion works in hfst-fst2fst and between prolog and binary format in hfst-fst2txt and hfst-txt2fst.
f688567 Added XFSM initializer class.
c803964 Added function minimize to xfsm interface
4c4ca0b Now hfst-compose-intersect does NOT harmonize transducers by default, as harmonizing produces unexpected results with hyperminimized transducers. Option --harmonize turns on harmonization. This is a partial fix to bug #288.
2e99ef0 Added a cross-build Windows patch given in bug report #289.
4cd63b0 Added a missing #if HAVE_XFSM statement.
e000b0c Fixed some bugs in xfsm stream handling.
a478fb2 Added more functions for writing and reading xfsm transducers.
fcc4c5d Tentatively added input and output stream functions for xfsm transducer.
091ca1d Modified and added scripts for creating static binaries.
8d6dd33 Fast compose intersect available using option -f
1915dd1 Reorganized xfsm interface.
ed7f448 Hopefully speed up compose_intersect by doing l .o. (l.2 .o. r) instead of l .o. r directly.
4b965b7 Updated script.
cc3632a Disable hfst-xfst's auto-complet with rl_insert instead of rl_abort, which is not found on Mac.
c41e726 Added a script to generate static binaries.
3a95772 Now xfsm conversion swaps the state numbers to make a more readable state numbering.
b2348d7 Now special symbols should also work in xfsm conversion.
b3af023 Xfsm conversion functions almost work, escaped symbols still need some attention.
b15f184 Conversion functions between xfsm and HfstBasicTransducer formats implemented, alphabets and special symbols still need some work.
5f79106 Autoindent entire file after removing tab characters to appease commit script
68ffd8e Handle single-char ascii symbols shadowing multichar ones
b1bdd27 Modified hfst to support xfsm format.
05da3ff Fix cg quote interpolation
38273d0 Clarify help string
b3450b4 Add segmenting and cg mode; improve handling of special cases and weights
5766bbf Tentatively modified hfst to handle xfsm implementation type.
db747e7 Preserve nonmatching sequances in locate_mode and parse the appropriately downstream
b7156e2 Tentatively added a skeleton implementation for xfsm library that can be included with configure option --with-xfsm (default is no). No linking to the library is performed, this is just a test version.
7427081 Added a preprocessing directive that defines hfst-twols's Alphabet as TwolCAlphabet if HAVE_XFSM is on. This will prevent variable collisions.
f579458 Changed hfst-txt2fst's read_prolog into read_prolog_format to avoid conflict with xfsm library function.
424e9f2 Named tokens 'UPPER' and 'LOWER' in pmatch and xre parsers into 'XRE_UPPER', 'XRE_LOWER', 'PMATCH_UPPER' and 'PMATCH_LOWER' because the xfsm library (that might be added to hfst at some point) has an enumerator with the same name in C namespace.
cc5e2fb Named foma's write_prolog into foma_write_prolog because the xfsm library (that might be added to hfst at some point) has a function with the same name in C namespace.
b7b9a5b Took away unused token 'LOWER' from xfst parser.
7701e12 Added an option --do-not-harmonize to hfst-compose-intersect.
ae06cea Now hfst-compose-intersect harmonizes the rule transducers with the lexicon.
3bfd0e8 Added option --beam to hfst-strings2fst.
b96dd3b More accurately name set_locate_mode as set_extract_tags mode (locate vs. match is determined by function call)
47ef7f0 Fix bug in compiling UNKNOWNs on the left side of pair separators
b97a28c Fix bug affecting locate-mode where ?-matches were being seen as IDENTITIES in input.
ec66bdd Added option --beam also to hfst-lookup. It is mostly untested for this tool.
0057847 Tentatively added option --beam to hfst-optimized-lookup.
e236dd2 Fixed a typo in warning message about missing symbols in input tapes of rule transducers ('output' changed to 'input').
43d9948 In locate mode, zero-length hits could cause infinite loops. Commit fixes it.
f5c1467 Now hfst-lookup checks it it is possible for an input to go through a transducer before calling is_infinitely_ambiguous. Should fix bug #278.
b4e4655 Now reserved symbols are detected in composition if xerox-composition is ON. This will prevent symbol collisions by throwing an error message, giving at least a temporary solution to issues with reserved symbols.
d2ef472 Fixed substitution operator in regexp, now flag diacritics are allowed both as substituting and subtituted symbols. Should fix bug #284.
92d2bc0 Added better support for special symbols in hfst-xfst's substitute command.
7cc650e Added two tests for hfst-xfst's substitute command. They are skipped until substitute can also handle them.
1ff2f40 Fixed a bug in composition when xerox-composition is ON. Flag diacritics @SOMEFLAG@ were earlier transformed into $ and back into @SOMEFLAG@, but symbols of form $...$ are already reserved for lexc. Now diacritics are escaped as %SOMEFLAG% during composition. This creates a new set of reserved symbols which maybe needs more consideration though... Also added debug prints for LexcCompiler.
ef9407e Removed xml checks from configure, since they cause unlinked xml references in hfst-edit-metadata on some platforms.
791aeb4 Fixed flag diacritic recognition, now also flags of form e.g. '@D.FOO@' are accepted and an empty string is returned as their value. Also added full support for flag handling in hfst-xfst.
94b723e Added some conversion functions between StringPairVectors and StringVectors. Added checks for flag diacritics in hfst-xfst non-optimized lookup.
6df280c A small fix to hfst-summarize option handling.
479630d Ready for release 3.8.2.
d5c8545 Added option --print-symbol-pair-statistics(=N) to hfst-summarize.
adff4fc Small fixes to compile-replace. Also restarting the char counter every time a new xre parsing is started. Removed the xml2 dependecy in configure.
b8fd981 Omit unnecessary step in stringification
adb2d91 Another slight speed improvement
b8564f4 Some double free insurance related to previous commit
786a502 Some more pmatch runtime speedup (around 5-10% in most cases)
6c383b4 Remove commented-out lines (same ones I was intending in the previous commit)
8c8be0b Remove extraenous index table fitting test
c150b35 Further improvements to conversion to optimized lookup format
ebb5d63 Large speedup in conversion to optimized-lookup format
5eaf679 Added test cases for compile-replace.
80059f5 Added test cases for merge operation.
e8c709f Fixed an error in hfst-fst2strings --print-separator where two consecutive lines of -- were printed between non-empty transducers in some cases.
9a30760 Now minimizing the merging automaton before merge operation so that epsilons do not cut a succesfull merge path. Also allowing epsilon-to-regexp-marker transitions in the merge filter.
5d89aba Improvements to compile-replace function, now it should work for input and output sides of a transducer.
372d184 Revert bungled change to precedence order
06fa073 Now xre compiler of function merge does not increment the char counter, making it possible to have many merge operators inside one regex.
bc32286 Added a constructor XreCompiler(XreConstructorArguments & args) to facilitate passing xre variables to merge function which needs them in its internal xre compiler.
de47594 Now using internal starptr variables in functions hfst::xre::compile and hfst::xre::compile_first instead of global hfst::xre::startptr. This should fix the strange memory errors which occurred when calling merge operation inside a regular expression.
e7c15a3 Now merge operation filters out non-optimal paths.
e153281 Tentatively added a function 'tokenize_and_align_flag_diacritics' to HfstTokenizer.
fe5cc6d Runtime speed improvements Prereserve table vectors, eliminate special_symbols map
a62fe68 Profiling support with Counter() and --profile & a bunch of smaller changes
09703ef Allowing 1-to-n composition of automata in archives. Fixes bug (or feature request) #277.
035edcd Now passing verbosity to LexcCompiler as an unsigned integer via setVerbosity(uint). Also made small fixes to warning prints in lexc compiler.
5fb6ba7 Added tests for one-sided flag diacritics for hfst-lexc.
602ec80 Now lexc compiler warns about one-sided flag diacritics in verbose mode.
fa95239 Added a tokenizing function that warns about symbol pairs, if needed.
4ad8eb4 Added test case for previous lexc commit (sublexicon defined more than once treated as an error).
4582622 Now multiple definitions of the same lexicon in lexc are treated as an error unless LexcCompiler::setAllowMultipleLexiconDefinitions(true) is called first.
95f58f4 Now lexc parser updates the error status hlexcnerrs when hlexcerror is called. In case of warnings, the error status is nor updated.
946bd68 Speed up list arc processing by replacing some maps and sets with vectors
373721d Changed list to set in merge operation. Removed commented code.
52fe564 Added option --encode-weights to hfst-lexc.
038ebd2 Reinstate undefined symbols as valid tokens as per documentation
212f177 Add defined lists
25e7a02 Add Sigma()
491d771 Added Lst() and support for list arcs in runtime
fe95b0a Sync precedence rules
482bad0 Switch order of precedence of concatenation and other binary operations
93d2887 Added Lit()
a5114b5 Modified the function merge, it now takes as an argument a map of list symbols.
7ef0509 Fixed a bug in xre parser, now definitions and unknowns can be used together in expressions such as 'regex [def:?] ;'.
ed742e9 Now the epsilon symbol is not added as a multichar symbol to hfst-lookup tokenizer if it is the empty string. Should fix bug #275.
0fb0fdb Allow \U00NNNNNN syntax for code points in utf-8 but above U+FFFF
2db8062 Allow \UNNNN as well as \uNNNN
4954a40 Allow \uNNNN in range notation and make some fixes to utf-8 handling
7784091 Added utf-8 character range expressions
f2674b2 An untested implementation of the merge operation added to hfst-xfst.
565d9bb Syntax-level completion of functions (arg placement still not completely free)
9bb6843 Fixed a small bug in hfst tool tester.
83312f7 Ready for release 3.8.1.
bcae2c3 Fixed std::cout into &std::cout in stream pointer comparison.
246258f Now using definitions USE_TR1_UNORDERED_(MAP|SET) when defining what unordered maps and sets to use.
c7c5ee8 Fixed a typo tr2 -> tr1.
97a3e59 Unordered maps and sets are used from std namespace if -std=gnu++11 is requested.
a5a62cb Forgot to update version number in swig bindings.
a578323 Various improvements and additions to function syntax, particularly empty args and string-args
5fb9c8d Added functions for merge operation in HfstTransitionGraph.
1df7d58 Modified weight handling in HfstTransitionGraph::intersect.
1e5bbfc When the alphabet is constructed from a symbol table, set identity to NO_SYM this was supposed to always happen anyway but didn't matter until recently
417bc81 Modified intersection algorithms in HfstTransitionGraph.
67f0a92 Added functions to be used in xerox's merge operation.
05651a4 Small fixes to list definitions in hfst-xfst.
2e1b79d Added an implementation for compile-replace in hfst-xfst, it still needs lot of testing.
244d6bf Yet some more functions added to compile-replace.
e18a645 Added more functions for compile-replace.
39800a1 Tentatively added functions in HfstTransitionGraph to be used in compile-replace.
7b5981f Added function is_well_formed_for_compile_replace to be used in compile-replace command.
c099596 Forgot to comment out debugging prints in tests.
f695bb4 Added brackets around member calls 'Interval.end' and 'Interval.begin' to avoid them getting confused with std::end() and std::begin() templates in C++11.
b0e0fe5 Rolled back earlier revision in interval-set.h
d223f72 Made small modifications for better c++11/c++0x support.
51ff0da Fix bug where identity wasn't being set to NO_SYMBOL when absent from alphabet
c2ee587 Refuse to enter flag loops more than once fixes bug #250 bug - arguably this could be applied to epsilon loops too, pending discussion
15f25c6 Fixed a typo in Makefile.
1faecef Now hfst-xfst gives a warning (or exits) if a binary command tries to access a stack with less than 2 transducers.
a7fe5d6 Now hfst-xfst exits if a command tries to access an empty stack if quit-on-fail is ON and hfst-xfst is not in interactive mode.
25255de Comment explaining usage of hfst-fst2tesseract.xfst.
ed8f040 Added script for converting morphological analyzers to Tesseract word models.
61b43f5 Improvements to loop finding
4d3bab0 Further corrections to loop detection slowdown back to ~10x but may be improved from here
38bb11f Fix some cases of overdetecting infinite ambiguity, there's still some left
f99de3f Forgot to keep adding the repeated states in the loop detection phase
f327e83 Only try to catch infinite ambiguity at epsilon arcs This is the big speed win and presumably correct.
aa0928d This order of comparison is a bit faster since sizes never differ
90b4387 Speed up is_lookup_infinitely_ambiguous() somewhat
3edea32 Try to avoid using negative indexes for arrays
1eedad1 Optimised lookup tests
a1eb3e1 Runtime handling of identity and unknown
dd8a983 Forgot to remove one thing in the last commit
e1db16f Take out our own harmonization hacks now that they're unneeded
6b0a888 Don't use delimiters when they're not necessary, also don't insert everything to RTNs anymore and provide the is_special() function the previous commit required
e6a35c3 Treat special pmatch symbols like flag diacritics for harmonization, also after harmonization add all symbols, including flags, to the alphabets
40feabb Fix bug where delimiters were shadowing the named transducers' names also remove extraenous parsing path
7432ed4 Updated pmatch functionality tests.
6a89ad8 Add string literal syntax for standalone %-escaped chars (they used to be considered symbols which now have to be defined or cause an error)
18b6039 Use minimization guards to keep multiple negative contexts separate in disjunctions
817ec29 Fixed some more spelling errors noticed by lintian.
dcb9e44 Fixed spelling errors found by lintian.
727436d Added again hfst-train-tagger man page which is no more a symlink.
ee79595 Added missing man pages.
3e650c8 Now hfst-reweight-tagger --help returns EXIT_SUCCESS before trying to access uninitialized values.
9665cdc Warn about shadowing definitions
8608975 Alternate syntaxes regex for Define TOP and .#. for #
1874455 Revamp LABEL parsing and introduce curly literal pairs
7b2c44e Updated and added man pages.
cbb4a40 Moved option checking after possible returning from program so that option --help will not generate error messages.
7f69948 Require backslash character to be escaped as \\ in curly literals
832d7b3 Fix bug in unescaping function
6918fd0 Ready for release 3.8.0.
9336e7a Added variable 'lexc-rename-flags' to hfst-xfst.
73c2a33 Desperately fiddle with the way the minus operation expands things
ee6d6fa Add lambda-like anonymous definitions for controlling subexpression boundaries
9ef3676 Changed the flag handling behavior of hfst-xfst and hfst-lexc. Now both tools by default use Xerox's way when composing, i.e. flag diacritics match unknown and identity symbols. This can be controlled with variable 'xerox-composition' (the default is ON) in hfst-xfst and with option '--xerox-composition={ON,OFF}' (the default is also ON) in hfst-lexc. hfst-regexp2fst also has the option '--xerox-composition' which by default is OFF, as it was earlier.
6a57210 Fixed a typo in hfst-regexp2fst option handling.
a64b247 Now one-sided flag diacritics are allowed in composition when flag-is-epsilon is used.
1dfbb23 Added option --log10 for 10-based logarithmic weights in hfst-strings2fst.
0bcf18f Now an error is thrown if flags are not twosided in composition when xerox composition is used.
a959c4b Added exception class FlagDiacriticsAreNotIdentitesException.
6d8c1ab Add pmatch functionality test suite
d5d441e Added option -X flag-is-epsilon to hfst-regexp2fst.
33b17d7 Fix another symbol-leaking issue
91d5ce0 Allow nested logical operations on contexts
d4b01d2 We need to avoid symbol pollution for more than just special symbols (this doesn't completely resolve pollution issues, just some urgent ones)
89149b0 Don't forget to pop the rtn stack when there's nothing matched
6a3552f Minimize after adding delimiters, not before
fdf46b1 Revert bracket-bounding behaviour
fecbf54 Use brackets for extra delimiters to control tag and context boundaries more
51382e2 add_delimiters was happening in the wrong place since recent syntax changes
8467812 Fixed a too strict assertion.
7ac7e49 Added a return value for a case that should never happen to make scan-build happy.
09b1260 Reverted back to the buggy behaviour of function 'getMarkerNumber', since some HfstXeroxRules tests will fail if it works correctly...
c7d12d4 Fixed an error in function 'getMarkerNumber' where istringstream was not properly initialized and returned random values. Also added a print method for class Rule.
7b7f0fb Fixed a typo in filename in EXTRA_DIST.
caad8cb Now alphabets are copied when encoding and decoding flags in composition. Fixes bug #267.
ad47d32 Oops, forgot about restoring scope state afer entry arcs
f325949 Add a scope facility for local context boundaries
591c5b5 Fixed most scan-build issues other than dead store, memory leak and errors from foma back-end.
d207568 Added implementation for twosided flag-diacritics in hfst-xfst.
d3c0506 Now harmonize-flags and flag-is-epsilon are by default OFF in hfst-xfst. Also added a new, mostly untested variable xerox-composition that matches flags with unknowns and identities in composition (default is OFF). All flag-is-epsilon functionalitites are moved under HfstTransducer.
ce348d2 Purge commented-out lines
3752461 Major syntactic changes and additions (AND & OR) to bring in line with Karttunen's documentation - amazingly doesn't break backwards compatibility (brobably / mostly)! Function syntax still lags and is incomplete.
79f7b42 Added missing return value to a function in openfst back-end.
636451e Reverted back to version 3992.
e0442ee Instead of NULL, return a new HfstTransducer in a condition that should never occur. This is for some compilers.
0d7cdee Fixed errors reported in bug #265. Also tentatively added a switch --xfst-harmonization to hfst-xfst that treats flags as ordinary symbols in composition.
08462a7 Resolved another rpmlint issue, hopefully the right way (A HfstTransitionGraph method returning nothing when it seemed to be meant to be returning *this)
68ed253 Resolved some issues and nonissues revealeled by rpmlint
0e2f676 Now xre parser compiles expressions of type foo:bar^{0,N} correctly.
be1e33d A better way see if we have a better location than before
ba31a12 It seems to be generally faster to defer minimization at [] boundaries
17b833b Made 'harmonize-flags == ON' the default for hfst-xfst. Also hfst-xfst's xre parser now harmonizes flags according to variable 'harmonize-flags'.
90cff75 Fixed some issues noticed on c+11 and reported in bug #258: a space between literal and identifier, missing cstdlib header and ostringstream conversions.
9da1a0f hfst-fst2strings now gives an error message if option --nbest or --random is used with transducers in optimized lookup format.
ddea524 Apparently older gcc's libstdc++ require operator< to be const
f6df4d2 Added locate mode on the library side; support for multimatch, weight access and various internal changes
084d1a6 Now all command line tools should give an error message and exit with >0 if they cannot process input in hfst optimized lookup format.
1959537 Now hfst-invert prints an error message if given a transducer in optimized lookup format. This should be fixed in all command line tools.
5a5a3e4 Function set_expand_definitions added to the python interface.
d89d63b Updated HFST version number.
26bad71 Updated HFST version number.
78a7e30 Added option --renameFlags to hfst-lexc for testing purposes.
9e7a3ff Added option --encode-weights to hfst-determinize.
c9ae91f Now hfst-lexc has options --withFlags and --minimizeFlags and hfst-xfst variables lexc-with-flags and lexc-minimize-flags to control if hyperminimization is used when parsing lexc files. Flag minimization can be the default behaviour when it has been properly tested.
57cc2df The Xerox output should have an empty line between each cohort.
01928da Now flag minimization in should work in LexcCompiler. Commented the changes out until all expected results from lexc tests are changed accordingly.
38b3724 Apparently failing to read a stream now throws HfstException so catch that instead
c29a94e Tentatively added a piece of code that filters out multiple flags in lexc result. Currently commented out.
8b0216c A hopefully temporary tool called proc2 for simple tokenization, to become part of something else or be renamed in the future
ae42592 Some fixes to python wrapping of pmatch and showing original input in pmatch
cee590b Make locate() return a simple data structure rather than a string
3ad4e8e Convenience loader function for pmatch
0cc41ab Fix outdated prototype of printConnectedness() and add pmatch prototypes
125ed8a Added option --Werror to hfst-lexc that treats warnings as errors.
2e28445 Added tests for the fixed bug #243 that will pass now.
24e7b46 Now keyword LEXICON is allowed without a preceding newline in lexc parser. Should fix bug #243.
a50dd1a Added a variable encode_epsilons to function priority_union whose default value (true) should fix bug #254. Function lenient composition still calls priority_union with a false value of encode_epsilons.
94dc75b Added support for cross-products such as {foo}:[bar] and foo:[bar] in regexp parser. Tests still needed.
d42799e .NOTPARALLEL for now
85a69f9 Fix serious bug with epsilons in contexts - they were incorrectly moving the input tape.
b04de37 with pipes
941baa9 A super cool progress bar functionality \o/
30cf7bd Fix context-related bug
13c1904 Readability refactoring
7f22c97 Comment fixes and some more deadwood elimination
cf24a3b Remove more unneeded stuff
8d58ce1 Remove some obsolete & commented out bits
dd779dd Locatefy option
93d00f8 Don't make noise about almost always harmless result truncation
9abaa72 Large refactor of almost everything (mainly to better serve tokenization apps w/tape synchronisation)
8f8d846 Undo previous commit, was confused by polluted automake cache
df23ced It seems that newer toolchains don't like c++ classes being defined over multiple .cc files, or something..
1958ec8 Don't count flag diacritics when locatefying
516bf5c Remove obsolete bogus -> rule from when @-> had problems
e5d943a Handle missing @bin files more nicely
04cf7dd Autoconvert @bin arguments if necessary
9ce5490 Remove obsolete dependency readme that hogs the svn trunk page
c427c3d Add constructor option to make diacritic strings blank or not; make pmatch consider them blank
2c2e664 Added file 'test' to EXTRA_DIST..
b1f01a2 Fixed hfst-twolc test Makefile.am
211935e Disabled outdated unit tests.
6b74d9f Tentatively implemented variable 'flag-is-epsilon' in hfst-xfst.
531acff Fixed issue with left arrow conflicts for contexts with impossible word boundaries.
2881c9e Also added tests for regexps with different continuation lexicons in hfst-lexc.
489474a Now regexps with different continuation lexicons are allowed in hfst-lexc. Should fix bug #247
21db219 Fixed list center rules and conflict resolutions.
6942188 Tests for list centers.
cd1193b Tests for list centers.
6e909fd Exit on parse error Once we've printed a parsing error, exit rather that print intimidating hfst exception messages
efdd01b Pattern-locating mode
9061298 Some changes to approach to avoiding stack overflow should fix problems with very long input lines
2cbb071 Add harmonization
1c1007c Add extract tags -option (this commit also includes a secret commented-out facility for profiling activity on the input tape by drawing a funky ascii histogram)
79c629f Fix incorrect check for whether tokenization needs to respect a long symbol
7633686 Allow epsilon-symbol pairs
01eb211 Don't print huge data dumps when parsing fails
2273237 Revert back to the original TropicalWeightTransducer.cc and forget splitting it..
d7755c3 Moved minimize and push functionalities from TropicalWeightTransducer.cc to separate files. This will make compiling TropicalWeightTransducer faster and prevent 'File too big' errors on Windows.
1c74930 While scanning for possible first symbols, don't accidentally go beyond the end of the input tape
c50a51f Ready for release 3.7.1.
f17327e Now removing CR characters from output of hfst-xfst so that tests will pass on windows too.
f374022 Now an error message is thrown in HFST when reading a native SFST transducer that uses the empty symbol.
6596539 Added option --encode-weights (-E) to hfst-minimize.
e7d0c0e Added documentation to parallel replace and removed unnecessary marker insertions.
1d1251a Modified conpilation of weighted parallel rules by inserting markers to keep same mappings with different weights separate. Also added a StringPair variant of insert_symbols_to_alphabet to HfstTransducer and HfstTransitionGraph. Added a test case for hfst-xfst. Updated NSIS installer.
60b45cb Tentatively added a function for substituting weights with markers.
fd29751 Remove spurious obsolete comment
2925fb7 Bounds check accesses to ascii_symbols when reading input too (ascii_symbols used to have a NO_SYMBOL entry for any char, but now without a check some accesses were resulting in bogus keys)
d23d42f Add -D_XOPEN_SOURCE=500 in order to prevent -std=c99 from preventing strdup() prototype being visible
9f7311e Correct bounds check for ascii_symbols alterations
988e259 Now checking that ascii_symbols vector index is not out of bounds when removing shadowing symbols. Should fix bug #235.
7b89f3f Replace rules bug fix
6bbf8bd Parallel rules bug solved
4ad528d Ready for release 3.7.0.
6fd6272 Added missing files to Makefile.
180ebd2 Added option --encode-weights also to hfst-regexp2fst.
04837d3 Added option --encode-weights (default false) to hfst-compose-intersect.
ec55eab replace rules - weights are removed from calculating contexts (they were unnecessary there)
34fcf5f Removed commented code.
d469a2b Modified TropicalWeightTransducer::are_equivalent so that determinization is carried out in a way similar to 'intersect' and 'subtract'.
aee1515 Updated configure.ac
b4fbd3f Reverted hfst-ospell changes.
edb752e Tentatively added hfst-ospell under tools/src, it is only enabled if --enable-ospell is requested.
911b8a0 Use precision 1 in weighted tests.
e45e2fc Reinstate special ascii tokenization while avoiding shadowing longer symbols