KAISER: KAIst Semantic parsER

***** [Update] November, 2019 *****

Kaiser is available for both of English FrameNet and Korean FrameNet

About

Kaiser is a semantic parser to understand the meaning of texts in terms of FrameNet.

frame (frame semantics) is a schematic representation of a situation or an event. For an example sentence, '헤밍웨이는 1899년 7월 21일 일리노이에서 태어났고, 62세에 자살로 사망했다.', KAIST-frame-parser identifies several frames such as Being_born and Death for Korean lexical units (e.g. 태어나다.v and 사망하다.v)

Our model is based on the BERT with fine-tuning. The model predict Frames and their arguments jointly.

prerequisite

python 3
pytorch-pretrained-BERT (Link)
Korean FrameNet (Link)

How to use

Install

Install transformers, kaiser, and Korean FrameNet

pip install transformers
git clone https://github.com/machinereading/kaiser.git
cd ./kaiser
git clone https://github.com/machinereading/koreanframenet.git

How to use a single-language frame-semantic parser

Download the pretrained model

Download two pretrained model files to {your_model_dir} (e.g. /home/model/bert_ko_srl_model.pt).

Korean Model: (download)
English Model: (download)
Multilingual Model (En+Ko): (download)

Import model (in your python code) (make sure that your code is in a parent folder of kaiser)

from kaiser import kaiser

model_path = {your_model_dir} # absolute_path (e.g. /home/model/bert_ko_frame_model.pt)
parser = parser.ShallowSemanticParser(model_path=model_path, masking=True)

optional: If you want to DO NOT USE LU DICTIONARY, set argument masking=False)

Parse the input text

text = '헤밍웨이는 1899년 7월 21일 미국 일리노이에서 태어났고 62세에 자살로 사망했다.'
parsed = parser.parser(text, sent_id='1', result_format='all')

optional: sent_id and result_format is not mandatory argument. You can get the result in following argument: conll', graph, textae, and all. The result consits of following three parts:

(1) triple format (result_format='graph') (2) conll format (result_format='conll') (3) pubannotation format (result_format='textae')

Or, you can get all result in json by result_format='all'

result

conll format The result is a list, which consists of multiple Frame-Semantic structures. Each SRL structure is in a list, which consists of four lists: (1) tokens, (2) lexical units, (3) its frames, and (4) its arguments. For example, for the given input text, the output is in the following format:

[
    [
        ['헤밍웨이는', '1899년', '7월', '21일', '미국', '일리노이에서', '태어났고,', '62세에', '자살로', '사망했다.'], 
        ['_', '_', '_', '_', '미국.n', '_', '_', '_', '_', '_'], 
        ['_', '_', '_', '_', 'Origin', '_', '_', '_', '_', '_'], 
        ['O', 'O', 'O', 'O', 'O', 'B-Entity', 'O', 'O', 'O', 'O']
    ], 
    [
        ['헤밍웨이는', '1899년', '7월', '21일', '미국', '일리노이에서', '태어났고,', '62세에', '자살로', '사망했다.'],
        ['_', '_', '_', '_', '_', '_', '태어나다.v', '_', '_', '_'], 
        ['_', '_', '_', '_', '_', '_', 'Being_born', '_', '_', '_'], 
        ['B-Child', 'B-Time', 'I-Time', 'I-Time', 'B-Place', 'I-Place', 'O', 'O', 'O', 'O']
    ], 
    [
        ['헤밍웨이는', '1899년', '7월', '21일', '미국', '일리노이에서', '태어났고,', '62세에', '자살로', '사망했다.'], 
        ['_', '_', '_', '_', '_', '_', '_', '_', '자살.n', '_'], 
        ['_', '_', '_', '_', '_', '_', '_', '_', 'Killing', '_'], 
        ['B-Victim', 'O', 'O', 'O', 'O', 'O', 'O', 'O', 'O', 'O']
    ],
    [
        ['헤밍웨이는', '1899년', '7월', '21일', '미국', '일리노이에서', '태어났고,', '62세에', '자살로', '사망했다.'], 
        ['_', '_', '_', '_', '_', '_', '_', '_', '_', '사망.n'], 
        ['_', '_', '_', '_', '_', '_', '_', '_', '_', 'Death'], 
        ['B-Protagonist', 'O', 'O', 'O', 'O', 'O', 'O', 'B-Time', 'B-Manner', 'O']
    ]
]

Another example sentence is '그는 그녀와 사랑에 빠졌다.'.

[
    [
        ['그는', '그녀와', '사랑에', '빠졌다.'], 
        ['_', '_', '사랑.n', '_'], 
        ['_', '_', 'Personal_relationship', '_'], 
        ['B-Partner_1', 'B-Partner_2', 'O', 'O']
    ],
    [
        ['그는', '그녀와', '사랑에', '빠졌다.'], 
        ['_', '_', '_', '빠지다.v'], 
        ['_', '_', '_', 'Experiencer_focus'], 
        ['B-Experiencer', 'B-Topic', 'I-Topic', 'O']
    ]
]

The word '빠지다' would be have different meaning in its usage in the context.

An example is '검은 얼룩이 흰 옷에서 빠졌다.'.

[
    [
        ['검은', '얼룩이', '흰', '옷에서', '빠졌다.'], 
        ['_', '_', '_', '옷.n', '_'], 
        ['_', '_', '_', 'Clothing', '_'], 
        ['O', 'O', 'B-Descriptor', 'O', 'O']
    ],
    [
        ['검은', '얼룩이', '흰', '옷에서', '빠졌다.'], 
        ['_', '_', '_', '_', '빠지다.v'], 
        ['_', '_', '_', '_', 'Emptying'], 
        ['B-Theme', 'I-Theme', 'B-Source', 'I-Source', 'O']
    ]
]

Licenses

CC BY-NC-SA Attribution-NonCommercial-ShareAlike
If you want to commercialize this resource, please contact to us

Publisher

Machine Reading Lab @ KAIST

Contact

Younggyun Hahm. hahmyg@kaist.ac.kr, hahmyg@gmail.com

Acknowledgement

This work was supported by Institute for Information & communications Technology Promotion(IITP) grant funded by the Korea government(MSIT) (2013-0-00109, WiseKB: Big data based self-evolving knowledge base and reasoning platform)

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
data		data
koreanframenet		koreanframenet
src		src
0108-eval-mul.out		0108-eval-mul.out
0108-eval-proto.out		0108-eval-proto.out
0109-eval-mul.out		0109-eval-mul.out
README.md		README.md
Untitled.ipynb		Untitled.ipynb
Untitled1.ipynb		Untitled1.ipynb
c.json		c.json
check00110.ipynb		check00110.ipynb
distilling.out		distilling.out
dummy0104.ipynb		dummy0104.ipynb
en-for-en-1231.out		en-for-en-1231.out
en-for-ko.err		en-for-ko.err
en-for-ko.out		en-for-ko.out
eval-10-0107.out		eval-10-0107.out
eval-25-0107.out		eval-25-0107.out
eval-distilling-1231.out		eval-distilling-1231.out
eval-proto-distill-0105.out		eval-proto-distill-0105.out
eval_graph.ipynb		eval_graph.ipynb
eval_mulModel_1229.out		eval_mulModel_1229.out
evaluate.ipynb		evaluate.ipynb
evaluate.py		evaluate.py
evaluate_multilingual.ipynb		evaluate_multilingual.ipynb
evaluate_multilingual.py		evaluate_multilingual.py
inference.ipynb		inference.ipynb
inference.py		inference.py
ko-for-ko-nomaksing.out		ko-for-ko-nomaksing.out
ko-for-ko-nomasking.err		ko-for-ko-nomasking.err
mul-for-en.err		mul-for-en.err
mul-for-en.out		mul-for-en.out
multi-for-en.err		multi-for-en.err
multi-for-en.out		multi-for-en.out
multi-for-ko.err		multi-for-ko.err
multi-for-ko.out		multi-for-ko.out
nohup.out		nohup.out
nohup.out.bak1210		nohup.out.bak1210
nomasking.err		nomasking.err
nomasking.out		nomasking.out
parser.ipynb		parser.ipynb
parser.ipynb.bak1223		parser.ipynb.bak1223
parser.py		parser.py
parser_bak1223.ipynb		parser_bak1223.ipynb
restApp.py		restApp.py
run_rest_service.py		run_rest_service.py
target_identifier.ipynb		target_identifier.ipynb
target_identifier.py		target_identifier.py
train-proto-0102.out		train-proto-0102.out
train_distill_proto_0104.out		train_distill_proto_0104.out
train_prototype.ipynb		train_prototype.ipynb
train_prototype.py		train_prototype.py
training.ipynb		training.ipynb
training.py		training.py
training_25.out		training_25.out
training_25_0105.ipynb		training_25_0105.ipynb
training_25_0105.py		training_25_0105.py
training_distil_prototype.ipynb		training_distil_prototype.ipynb
training_distil_prototype.ipynb.bak0104		training_distil_prototype.ipynb.bak0104
training_distil_prototype.py		training_distil_prototype.py
training_distilling.ipynb		training_distilling.ipynb
training_distilling.py		training_distilling.py
training_finetuning.ipynb		training_finetuning.ipynb
training_finetuning.py		training_finetuning.py
training_multilingual.ipynb		training_multilingual.ipynb
training_multilingual.py		training_multilingual.py
trining_with_prototype.ipynb		trining_with_prototype.ipynb
trining_with_prototype.py		trining_with_prototype.py
trn_10per.json		trn_10per.json
trn_25per.json		trn_25per.json
withProto0110.out		withProto0110.out

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KAISER: KAIst Semantic parsER

About

prerequisite

How to use

How to use a single-language frame-semantic parser

Licenses

Publisher

Contact

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

KAISER: KAIst Semantic parsER

About

prerequisite

How to use

How to use a single-language frame-semantic parser

Licenses

Publisher

Contact

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages