Spacy Debugger 's log gives different results from Assemble in spacy-llm #12812
-
Hello ! i need assistance on spacy-llm. when I use it with GPT-3.5, if the same entity.text is already extracted it won't shown or extracted again. Here is example of results from logging : There above there is entity like Course Name_Two. But when I use: from spacy_llm.util import assemble
nlp = assemble("config.cfg")
doc = nlp("Magdalena Gerson Mtera ACCADEMIC BACK GROUND PROFESSIONAL DEVELOPMENT I have an ability to interact with different kind of people easily Hard working and committed to achieve expected results on time I can work under non or minimum level of supervision Also I am cooperative and a good team player AWARDS. Trophy of the Best Netball player of the Year (2017) from the Institute of Social Work Dar es Salaam. A certificate of Restless Development Program from the Institute of Social Work Dar es Salaam (2014). A Certificate of Computer Applications in Microsoft Office (Word Excel and Publisher) Internet and E-mails from South and East Africa Minerals and Geosciences Centre (SEAMGC) Dar es Salaam Tanzania (2012). A Certificate of Recognition Youth of United Nations of Tanzania (YUNA). From the UN Club at Morogoro Secondary School (2011). A Certificate of Nursing from Morogoro Secondary School (2011). Bachelor of Human Resource Management Institute of Social Work 2015 to 2017 Diploma of Human Resource Management Institute of Social Work 2013 to 2015 Certificate of Social Work Practice Institute of Social Work 2012 to 2013 Certificate of Secondary Education Morogoro Secondary School 2008 to 2011 Magdalena Gerson Mtera Address South and Eastern Mineral Centre P.O. Box 9573 Dar es Salaam Tanzania Mobile +255 716 665 162 +255 752 720 741 E-mail [email protected] INTERESTS Sports and Volunteering Art and Culture Travelling REFERIES. Ricky Emmanuel Loan Manager Moraiah Financial Services Ltd [email protected] Tel +(255) 758 811217 Salome Doreen Magoha Human Resources Officer Murphy Ben International Tanzania Tel + (255) 687 246425 Mr. Dickson Ndege Lecturer at the Institute of Social Work Dar es Salaam Department of Human Resources Management Tel + (255)715 011234") for x in doc.ents:
print(x.label_,x.text) I got this : There above the entity Course Name_Two is missing , why ? ... and this happens for all entity that appear to carry the same value. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Thanks for the report! I've been able to reproduce this with
and then
which gives as logging output:
and as stdout output reflecting the (wrong) data in
This must be due to the relatively simple parsing/alignment procedure that |
Beta Was this translation helpful? Give feedback.
Hi @innocent-charles,
With
spacy-llm
, it should be relatively straightforward to implement your own task definition according to your needs, cf here.We're also working on improving the built-in NER task in the meantime, and will take this feedback on board.