From 4f40bd0517a906cc3441e6e1854c76a37b1e9976 Mon Sep 17 00:00:00 2001 From: Nico Matentzoglu Date: Wed, 8 Nov 2023 13:01:00 +0200 Subject: [PATCH 1/2] Add ML-based techniques to semapv --- semapv-terms.owl | 282 +++++++++++++++++------------- semapv-terms.tsv | 11 +- semapv.owl | 435 ++++++++++++++++++++++++++--------------------- 3 files changed, 421 insertions(+), 307 deletions(-) diff --git a/semapv-terms.owl b/semapv-terms.owl index 44417db..1406780 100644 --- a/semapv-terms.owl +++ b/semapv-terms.owl @@ -9,7 +9,7 @@ xmlns:skos="http://www.w3.org/2004/02/skos/core#" xmlns:terms="http://purl.org/dc/terms/"> - + @@ -71,9 +71,9 @@ - A matching process that exploits background knowledge from external resources, commonly referred to as background knowledge resources. This approach is also known as indirect matching, BK-based matching or context-based matching. - A match between a subject A and an object B was established because they appear equivalent under consideration of externally provided background knowledge. - background knowledge-based matching process + A matching process that exploits background knowledge from external resources, commonly referred to as background knowledge resources. This approach is also known as indirect matching, BK-based matching or context-based matching. + A match between a subject A and an object B was established because they appear equivalent under consideration of externally provided background knowledge. + background knowledge-based matching process @@ -83,9 +83,19 @@ - A normalization technique replacing all blank characters, such as space, tabulation, carriage return (or sequences of these) into a single blank character. - The string 'ontology matching' (two spaces between 'ontology' and 'matching) becomes 'ontology matching' (one space). - blank normalisation + A normalization technique replacing all blank characters, such as space, tabulation, carriage return (or sequences of these) into a single blank character. + The string 'ontology matching' (two spaces between 'ontology' and 'matching) becomes 'ontology matching' (one space). + blank normalisation + + + + + + + + + A matching process based on the comparison of matched super and subclasses (paths) of two entities. + bounded path matching @@ -94,7 +104,7 @@ - cardinality filtering + cardinality filtering @@ -104,9 +114,9 @@ - A normalization technique converting each alphabetic character in a string to their lower case counterpart. - The string 'Ontology' is normalised to 'ontology' (lower case). - case normalization + A normalization technique converting each alphabetic character in a string to their lower case counterpart. + The string 'Ontology' is normalised to 'ontology' (lower case). + case normalization @@ -115,9 +125,9 @@ - A matching process based on multiple, possibly intertwined, matching approaches. - An ontology matching tool determines that a subject should be mapped to an object by employing a range of techniques, including lexical, semantic and structural. - composite matching process + A matching process based on multiple, possibly intertwined, matching approaches. + An ontology matching tool determines that a subject should be mapped to an object by employing a range of techniques, including lexical, semantic and structural. + composite matching process @@ -127,9 +137,9 @@ - A normalization technique replacing diactritic signs (e.g. accents, hats) with their most frequent replacements. - The string 'Protégé' is normalised to 'Protege'. - diacritics suppression + A normalization technique replacing diactritic signs (e.g. accents, hats) with their most frequent replacements. + The string 'Protégé' is normalised to 'Protege'. + diacritics suppression @@ -139,9 +149,9 @@ - A normalization technique removing all numbers in a string. - The string 'ontology1' becomes 'ontology'. - digit suppression + A normalization technique removing all numbers in a string. + The string 'ontology1' becomes 'ontology'. + digit suppression @@ -151,7 +161,18 @@ - edit distance + edit distance + + + + + + + + + A matching process that uses graph representation learning which is a method in machine learning that focuses on learning a compact representation for graphs, capturing their structural information. + Matches between entities are identified by analyzing the learned representations that encode the structural features and relationships within graph data. + graph representation learning-based matching process @@ -161,7 +182,7 @@ - hamming distance + hamming distance @@ -171,9 +192,20 @@ - A matching process based on individual representations (or instances). - A match between a subject A and an object B was established because they share the same instances. - instance-based matching process + A matching process based on individual representations (or instances). + A match between a subject A and an object B was established because they share the same instances. + instance-based matching process + + + + + + + + + A matching process that employs large language models (LLMs) which are pre-trained on vast amounts of text data and can understand and generate human-like text, making them suitable for tasks requiring a deep understanding of language. + Matches between entities are determined through the language understanding capabilities of LLMs, such as semantic context and language inference. + LLM-based matching process @@ -183,7 +215,7 @@ - language-based similarity measure + language-based similarity measure @@ -193,8 +225,8 @@ - The process of reducing the different forms of a word to one single form. - lemmatization + The process of reducing the different forms of a word to one single form. + lemmatization @@ -204,7 +236,7 @@ - levenshtein distance + levenshtein distance @@ -213,9 +245,9 @@ - A matching process based on a lexical comparison between one or more syntactic features of the subject with one or more syntactic features of the object. - The label of a subject entity matches to an exact synonym of an object entity. - lexical matching process + A matching process based on a lexical comparison between one or more syntactic features of the subject with one or more syntactic features of the object. + The label of a subject entity matches to an exact synonym of an object entity. + lexical matching process @@ -224,9 +256,9 @@ - A lexical matching process based on a minimum threshold of a score from a comparison based on a lexical similarity algorithm. - A match between a subject and an object was established because they had a Levenshtein score higher than 0.8. - lexical similarity threshold-based matching process + A lexical matching process based on a minimum threshold of a score from a comparison based on a lexical similarity algorithm. + A match between a subject and an object was established because they had a Levenshtein score higher than 0.8. + lexical similarity threshold-based matching process @@ -236,9 +268,9 @@ - A normalization technique replacing specific links between words, such as apostrophes, dashes, underscore, etc into dashes or blanks. - The string 'alzheimer's' becomes 'alzheimers'. - link stripping + A normalization technique replacing specific links between words, such as apostrophes, dashes, underscore, etc into dashes or blanks. + The string 'alzheimer's' becomes 'alzheimers'. + link stripping @@ -247,7 +279,7 @@ - logical consistency filtering + logical consistency filtering @@ -256,9 +288,20 @@ - A matching process based on the inferences made by a logical reasoner. - Two classes across ontologies are determined equivalent by an OWL reasoner such as ELK. - logical reasoning process + A matching process based on the inferences made by a logical reasoner. + Two classes across ontologies are determined equivalent by an OWL reasoner such as ELK. + logical reasoning process + + + + + + + + + A matching process that involves machine learning algorithms which learn from data to find patterns or make decisions with minimal human intervention. + Matches between entities are made by applying learned models to data points to predict similarities or relationships. + machine learning-based matching process @@ -267,9 +310,9 @@ - An matching process that is performed by a human agent and is based on human judgement and domain knowledge. - A human curator determines that a subject should be mapped to an object by virtue of their domain expertise. - manual mapping curation + An matching process that is performed by a human agent and is based on human judgement and domain knowledge. + A human curator determines that a subject should be mapped to an object by virtue of their domain expertise. + manual mapping curation @@ -277,9 +320,9 @@ - A triple <s,p,o> comprising a subject entity s, an object entity o and a mapping predicate p. - The subject entity NCI:C9305 is mapped to the object entity ICD10:C80.9 using the skos:relatedMatch mapping predicate. - mapping + A triple <s,p,o> comprising a subject entity s, an object entity o and a mapping predicate p. + The subject entity NCI:C9305 is mapped to the object entity ICD10:C80.9 using the skos:relatedMatch mapping predicate. + mapping @@ -287,9 +330,9 @@ - A process that relates to the creation, confirmation, rejection or curation of a mapping. - Matching is a mapping activity that results in the creating of a mapping; mapping review is an activity that results in the confirmation of a mapping. - mapping activity + A process that relates to the creation, confirmation, rejection or curation of a mapping. + Matching is a mapping activity that results in the creating of a mapping; mapping review is an activity that results in the confirmation of a mapping. + mapping activity @@ -298,9 +341,9 @@ - A matching process based on the traversing of multiple mappings. - A match between a subject A and an object B was established because A was mapped to C, C was mapped to D and D was mapped to B. - mapping chaining-based matching process + A matching process based on the traversing of multiple mappings. + A match between a subject A and an object B was established because A was mapped to C, C was mapped to D and D was mapped to B. + mapping chaining-based matching process @@ -310,13 +353,13 @@ - mapping flipping - mapping inversion - mapping inverting - semantic walk-backs - A matching process based on the inverting or flipping of the subject with the object of a mapping in accordance with the semantics of the mapping predicate. - A broad match between a subject A and an object B using the skos:narrowMatch mapping relation was established because B was mapped to A using the skos:broadMatch mapping relation. - mapping inversion-based matching process + mapping flipping + mapping inversion + mapping inverting + semantic walk-backs + A matching process based on the inverting or flipping of the subject with the object of a mapping in accordance with the semantics of the mapping predicate. + A broad match between a subject A and an object B using the skos:narrowMatch mapping relation was established because B was mapped to A using the skos:broadMatch mapping relation. + mapping inversion-based matching process @@ -325,9 +368,9 @@ - A process that is concerned with determining if a mapping “candidate” (otherwise determined) is reasonable/correct. - A match between a subject A and an object B was established elsewhere, and a human reviewer determined that the mapping is true (or false) based on an independent evaluation. - mapping review + A process that is concerned with determining if a mapping “candidate” (otherwise determined) is reasonable/correct. + A match between a subject A and an object B was established elsewhere, and a human reviewer determined that the mapping is true (or false) based on an independent evaluation. + mapping review @@ -336,11 +379,11 @@ - matching operation - matching task - An process that results in a mapping between a subject and an object entity. - The label of a subject entity matches to an exact synonym of an object entity. - matching process + matching operation + matching task + An process that results in a mapping between a subject and an object entity. + The label of a subject entity matches to an exact synonym of an object entity. + matching process @@ -349,8 +392,8 @@ - A process applied to a set of mappings, usually with the intention of changing it. - matching process post-processing + A process applied to a set of mappings, usually with the intention of changing it. + matching process post-processing @@ -359,8 +402,8 @@ - A preliminary processing of inputs prior to performing matching. - matching process pre-processing + A preliminary processing of inputs prior to performing matching. + matching process pre-processing @@ -370,8 +413,8 @@ - A family of preprocessing techniques for reducing strings to be compared to a common format. - normalization + A family of preprocessing techniques for reducing strings to be compared to a common format. + normalization @@ -381,9 +424,9 @@ - A normalization technique removing all punctation characters from a string. - The string 'e.g.' becomes 'eg'. - punctuation elimination + A normalization technique removing all punctation characters from a string. + The string 'e.g.' becomes 'eg'. + punctuation elimination @@ -392,9 +435,9 @@ - A preprocessing method transforming a string by matching a regular expression and then removing that match. - The regex match “[ -]phenotype” is removed from the label field of the subject entity in the mapping. - regex removal + A preprocessing method transforming a string by matching a regular expression and then removing that match. + The regex match “[ -]phenotype” is removed from the label field of the subject entity in the mapping. + regex removal @@ -403,9 +446,9 @@ - A preprocessing method transforming a string by matching a regular expression and then replacing that match with a specified substitution string. - The regex match “[ -]phenotype” is replaced by “-disease” for the label field of the subject entity in the mapping. - regex replacement + A preprocessing method transforming a string by matching a regular expression and then replacing that match with a specified substitution string. + The regex match “[ -]phenotype” is replaced by “-disease” for the label field of the subject entity in the mapping. + regex replacement @@ -414,9 +457,9 @@ - A matching process based on a minimum threshold of a score from a comparison based on a semantic similarity algorithm. - A match between a subject and an object entity was established because they had a Jaccard score higher than 0.8 based on the set of (common) superclasses. - semantic similarity threshold-based matching process + A matching process based on a minimum threshold of a score from a comparison based on a semantic similarity algorithm. + A match between a subject and an object entity was established because they had a Jaccard score higher than 0.8 based on the set of (common) superclasses. + semantic similarity threshold-based matching process @@ -425,8 +468,8 @@ - A technique for determining a score that characterises the similarity between two entities. - similarity measure + A technique for determining a score that characterises the similarity between two entities. + similarity measure @@ -435,7 +478,7 @@ - stable marriage filtering + stable marriage filtering @@ -444,8 +487,8 @@ - The process of reducing a word to its word stem. - stemming + The process of reducing a word to its word stem. + stemming @@ -455,9 +498,9 @@ - A preprocessing method transforming a string by removing a list of stop words. - The stop-words “of, and, is, a” are removed from the label field of the subject entity in the mapping. - stop-word removal + A preprocessing method transforming a string by removing a list of stop words. + The stop-words “of, and, is, a” are removed from the label field of the subject entity in the mapping. + stop-word removal @@ -467,8 +510,8 @@ - A similarity measure based on the comparison of strings. - string-based similarity measure + A similarity measure based on the comparison of strings. + string-based similarity measure @@ -478,8 +521,8 @@ - A string-based similarity measure which involves determining if two strings associated with mapping entities are equal. - string equality + A string-based similarity measure which involves determining if two strings associated with mapping entities are equal. + string equality @@ -489,9 +532,9 @@ - Structural matching does not involve looking at "values" of properties. - A match between a subject and an object was established because of the similarity of their structural features, e.g., the number of direct property of a class. - structural matching process + Structural matching does not involve looking at "values" of properties. + A match between a subject and an object was established because of the similarity of their structural features, e.g., the number of direct property of a class. + structural matching process @@ -501,7 +544,7 @@ - substring similarity + substring similarity @@ -511,7 +554,7 @@ - substring test + substring test @@ -521,8 +564,8 @@ - named entity recognition - term extraction + named entity recognition + term extraction @@ -531,7 +574,7 @@ - threshold filtering + threshold filtering @@ -541,7 +584,7 @@ - token-based distance + token-based distance @@ -551,7 +594,18 @@ - tokenization + tokenization + + + + + + + + + A matching process that utilizes transformer models, which are a type of deep learning model architecture designed to handle sequential data, particularly for natural language processing tasks. + Matches between entities are established based on the contextual relationships learned by the transformer from large datasets. + transformer-based matching process @@ -560,9 +614,9 @@ - A matching process based on an unspecified comparison. - A mapping between a subject and an object was established, but it is no longer clear how or why. - unspecified matching process + A matching process based on an unspecified comparison. + A mapping between a subject and an object was established, but it is no longer clear how or why. + unspecified matching process @@ -572,11 +626,11 @@ - ngram similarity + ngram similarity - + diff --git a/semapv-terms.tsv b/semapv-terms.tsv index af278c7..de027c4 100644 --- a/semapv-terms.tsv +++ b/semapv-terms.tsv @@ -17,6 +17,7 @@ semapv:MappingInversion mapping inversion-based matching process A matching proc semapv:MappingReview mapping review A process that is concerned with determining if a mapping “candidate” (otherwise determined) is reasonable/correct. A match between a subject A and an object B was established elsewhere, and a human reviewer determined that the mapping is true (or false) based on an independent evaluation. semapv:MappingActivity semapv:ManualMappingCuration manual mapping curation An matching process that is performed by a human agent and is based on human judgement and domain knowledge. A human curator determines that a subject should be mapped to an object by virtue of their domain expertise. semapv:Matching semapv:MatchingPreprocessing matching process pre-processing A preliminary processing of inputs prior to performing matching. semapv:MappingActivity +semapv:Normalization normalization A family of preprocessing techniques for reducing strings to be compared to a common format. https://doi.org/10.1007/978-3-642-38721-0 semapv:MatchingPreprocessing semapv:Stemming stemming The process of reducing a word to its word stem. semapv:Normalization semapv:Lemmatization lemmatization The process of reducing the different forms of a word to one single form. https://doi.org/10.1007/978-3-642-38721-0 semapv:Normalization semapv:StopWordRemoval stop-word removal A preprocessing method transforming a string by removing a list of stop words. https://doi.org/10.1007/978-3-642-38721-0 The stop-words “of, and, is, a” are removed from the label field of the subject entity in the mapping. semapv:Normalization @@ -27,10 +28,9 @@ semapv:ThresholdFiltering threshold filtering semapv:MatchingPostProcessing semapv:StableMarriageFiltering stable marriage filtering semapv:MatchingPostProcessing semapv:CardinalityFiltering cardinality filtering semapv:MatchingPostProcessing semapv:LogicalConsistencyFiltering logical consistency filtering semapv:MatchingPostProcessing -semapv:Normalization normalization A family of preprocessing techniques for reducing strings to be compared to a common format. https://doi.org/10.1007/978-3-642-38721-0 semapv:MatchingPreprocessing semapv:CaseNormalization case normalization A normalization technique converting each alphabetic character in a string to their lower case counterpart. https://doi.org/10.1007/978-3-642-38721-0 The string 'Ontology' is normalised to 'ontology' (lower case). semapv:Normalization semapv:DiacriticsSuppression diacritics suppression A normalization technique replacing diactritic signs (e.g. accents, hats) with their most frequent replacements. https://doi.org/10.1007/978-3-642-38721-0 The string 'Protégé' is normalised to 'Protege'. semapv:Normalization -semapv:BlankNormalisation blank normalisation A normalization technique replacing all blank characters, such as space, tabulation, carriage return (or sequences of these) into a single blank character. https://doi.org/10.1007/978-3-642-38721-0 The string 'ontology matching' (two spaces between 'ontology' and 'matching) becomes 'ontology matching' (one space). semapv:Normalization +semapv:BlankNormalisation blank normalisation A normalization technique replacing all blank characters, such as space, tabulation, carriage return (or sequences of these) into a single blank character. https://doi.org/10.1007/978-3-642-38721-0 The string 'ontology matching' (two spaces between 'ontology' and 'matching) becomes 'ontology matching' (one space). semapv:Normalization semapv:LinkStripping link stripping A normalization technique replacing specific links between words, such as apostrophes, dashes, underscore, etc into dashes or blanks. https://doi.org/10.1007/978-3-642-38721-0 The string 'alzheimer's' becomes 'alzheimers'. semapv:Normalization semapv:DigitSuppression digit suppression A normalization technique removing all numbers in a string. https://doi.org/10.1007/978-3-642-38721-0 The string 'ontology1' becomes 'ontology'. semapv:Normalization semapv:PunctuationElemination punctuation elimination A normalization technique removing all punctation characters from a string. https://doi.org/10.1007/978-3-642-38721-0 The string 'e.g.' becomes 'eg'. semapv:Normalization @@ -46,4 +46,9 @@ semapv:LevenshteinEditDistance levenshtein distance https://doi.org/10.1007/978 semapv:TokenBasedDistance token-based distance https://doi.org/10.1007/978-3-642-38721-0 semapv:StringBasedSimilarityMeasure semapv:LanguageBasedSimilarityMeasure language-based similarity measure https://doi.org/10.1007/978-3-642-38721-0 semapv:SimilarityMeasure semapv:Tokenization tokenization https://doi.org/10.1007/978-3-642-38721-0 semapv:Normalization -semapv:TermExtraction term extraction https://doi.org/10.1007/978-3-642-38721-0 named entity recognition semapv:Normalization \ No newline at end of file +semapv:TermExtraction term extraction https://doi.org/10.1007/978-3-642-38721-0 named entity recognition semapv:Normalization +semapv:BoundedPathMatching bounded path matching A matching process based on the comparison of matched super and subclasses (paths) of two entities. semapv:StructuralMatching +semapv:TransformerBasedMatching transformer-based matching process A matching process that utilizes transformer models, which are a type of deep learning model architecture designed to handle sequential data, particularly for natural language processing tasks. Matches between entities are established based on the contextual relationships learned by the transformer from large datasets. semapv:Matching +semapv:LLMBasedMatching LLM-based matching process A matching process that employs large language models (LLMs) which are pre-trained on vast amounts of text data and can understand and generate human-like text, making them suitable for tasks requiring a deep understanding of language. Matches between entities are determined through the language understanding capabilities of LLMs, such as semantic context and language inference. semapv:Matching +semapv:MachineLearningBasedMatching machine learning-based matching process A matching process that involves machine learning algorithms which learn from data to find patterns or make decisions with minimal human intervention. Matches between entities are made by applying learned models to data points to predict similarities or relationships. semapv:Matching +semapv:GraphRepresentationLearningBasedMatching graph representation learning-based matching process A matching process that uses graph representation learning which is a method in machine learning that focuses on learning a compact representation for graphs, capturing their structural information. Matches between entities are identified by analyzing the learned representations that encode the structural features and relationships within graph data. semapv:Matching \ No newline at end of file diff --git a/semapv.owl b/semapv.owl index eb1639c..b215ea4 100644 --- a/semapv.owl +++ b/semapv.owl @@ -2,6 +2,7 @@ + xmlns:semapv_voc="https://w3id.org/semapv/vocab/"> - - The Semantic Mapping Vocabulary provides and defines terms used for creating and maintaining semantic mappings, in particular mapping metadata. - - Semantic Mapping Vocabulary + + The Semantic Mapping Vocabulary provides and defines terms used for creating and maintaining semantic mappings, in particular mapping metadata. + + Semantic Mapping Vocabulary @@ -69,10 +70,10 @@ - https://www.w3.org/TR/skos-reference/ - http://www.w3.org/2004/02/skos/core# - A match where the object is a broader concept than the subject. - broad match + https://www.w3.org/TR/skos-reference/ + http://www.w3.org/2004/02/skos/core# + A match where the object is a broader concept than the subject. + broad match @@ -81,10 +82,10 @@ - https://www.w3.org/TR/skos-reference/ - http://www.w3.org/2004/02/skos/core# - A match where the subject and object are sufficiently similar that they can be used interchangeably in some information retrieval applications. - close match + https://www.w3.org/TR/skos-reference/ + http://www.w3.org/2004/02/skos/core# + A match where the subject and object are sufficiently similar that they can be used interchangeably in some information retrieval applications. + close match @@ -99,10 +100,10 @@ - https://www.w3.org/TR/skos-reference/ - http://www.w3.org/2004/02/skos/core# - A match where the subject and object are sufficiently similar, with a high degree of confidence, that they can be used interchangeably across a wide range of information retrieval applications. - exact match + https://www.w3.org/TR/skos-reference/ + http://www.w3.org/2004/02/skos/core# + A match where the subject and object are sufficiently similar, with a high degree of confidence, that they can be used interchangeably across a wide range of information retrieval applications. + exact match @@ -118,10 +119,10 @@ - https://www.w3.org/TR/skos-reference/ - http://www.w3.org/2004/02/skos/core# - A mapping (alignment) link between two concepts. - mapping relation + https://www.w3.org/TR/skos-reference/ + http://www.w3.org/2004/02/skos/core# + A mapping (alignment) link between two concepts. + mapping relation @@ -129,10 +130,10 @@ - https://www.w3.org/TR/skos-reference/ - http://www.w3.org/2004/02/skos/core# - A match where the object is a narrower concept that the subject. - narrow match + https://www.w3.org/TR/skos-reference/ + http://www.w3.org/2004/02/skos/core# + A match where the object is a narrower concept that the subject. + narrow match @@ -147,10 +148,10 @@ - https://www.w3.org/TR/skos-reference/ - http://www.w3.org/2004/02/skos/core# - A match where the subject and the object are inherently related, but where none is in any way more general than the other. - related match + https://www.w3.org/TR/skos-reference/ + http://www.w3.org/2004/02/skos/core# + A match where the subject and the object are inherently related, but where none is in any way more general than the other. + related match @@ -159,10 +160,10 @@ - https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 - A match where the subject is considered analogous to the subject in a different taxonomic grouping, but the object refers to a broader concept. - FBbt:00007091 (D. melanogaster “subperineurial glial sheath”) and UBERON:0000202 (taxon-neutral “glial blood brain barrier”) are a cross-species broad match. - cross-species broad match + https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 + A match where the subject is considered analogous to the subject in a different taxonomic grouping, but the object refers to a broader concept. + FBbt:00007091 (D. melanogaster “subperineurial glial sheath”) and UBERON:0000202 (taxon-neutral “glial blood brain barrier”) are a cross-species broad match. + cross-species broad match @@ -171,9 +172,9 @@ - https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 - A match where the subject and the object belong in different taxonomic groupings, but refer to concepts similar enough that they can be used interchangeably. - cross-species close match + https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 + A match where the subject and the object belong in different taxonomic groupings, but refer to concepts similar enough that they can be used interchangeably. + cross-species close match @@ -183,10 +184,10 @@ - https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 - A match where the subject is considered analogous to the object in a different taxonomic grouping. - FBbt:00005074 (D. melanogaster “muscle cell”) and CL:0000187 (taxon-neutral “muscle cell”) are a cross-species exact match. - cross-species exact match + https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 + A match where the subject is considered analogous to the object in a different taxonomic grouping. + FBbt:00005074 (D. melanogaster “muscle cell”) and CL:0000187 (taxon-neutral “muscle cell”) are a cross-species exact match. + cross-species exact match @@ -196,9 +197,9 @@ - https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 - A match where the subject is considered analogous to the subject in a different taxonomic grouping, but the object refers to a narrower concept. - cross-species narrow match + https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 + A match where the subject is considered analogous to the subject in a different taxonomic grouping, but the object refers to a narrower concept. + cross-species narrow match @@ -207,9 +208,9 @@ - https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 - A match where the subject is isomorphic to the object, i.e. considered of identical or similar form, shape, or structure. - isomorphic match + https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 + A match where the subject is isomorphic to the object, i.e. considered of identical or similar form, shape, or structure. + isomorphic match @@ -218,9 +219,9 @@ - https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 - A match where the subject cannot be considered isomorphic to the object, i.e. considered of identical or similar form, shape, or structure. The object corresponds to exactly one subject in the subject_source. - non-isomorphic match + https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3 + A match where the subject cannot be considered isomorphic to the object, i.e. considered of identical or similar form, shape, or structure. The object corresponds to exactly one subject in the subject_source. + non-isomorphic match @@ -241,10 +242,10 @@ - - A matching process that exploits background knowledge from external resources, commonly referred to as background knowledge resources. This approach is also known as indirect matching, BK-based matching or context-based matching. - A match between a subject A and an object B was established because they appear equivalent under consideration of externally provided background knowledge. - background knowledge-based matching process + + A matching process that exploits background knowledge from external resources, commonly referred to as background knowledge resources. This approach is also known as indirect matching, BK-based matching or context-based matching. + A match between a subject A and an object B was established because they appear equivalent under consideration of externally provided background knowledge. + background knowledge-based matching process @@ -253,10 +254,20 @@ - - A normalization technique replacing all blank characters, such as space, tabulation, carriage return (or sequences of these) into a single blank character. - The string 'ontology matching' (two spaces between 'ontology' and 'matching) becomes 'ontology matching' (one space). - blank normalisation + + A normalization technique replacing all blank characters, such as space, tabulation, carriage return (or sequences of these) into a single blank character. + The string 'ontology matching' (two spaces between 'ontology' and 'matching) becomes 'ontology matching' (one space). + blank normalisation + + + + + + + + + A matching process based on the comparison of matched super and subclasses (paths) of two entities. + bounded path matching @@ -265,7 +276,7 @@ - cardinality filtering + cardinality filtering @@ -274,10 +285,10 @@ - - A normalization technique converting each alphabetic character in a string to their lower case counterpart. - The string 'Ontology' is normalised to 'ontology' (lower case). - case normalization + + A normalization technique converting each alphabetic character in a string to their lower case counterpart. + The string 'Ontology' is normalised to 'ontology' (lower case). + case normalization @@ -286,9 +297,9 @@ - A matching process based on multiple, possibly intertwined, matching approaches. - An ontology matching tool determines that a subject should be mapped to an object by employing a range of techniques, including lexical, semantic and structural. - composite matching process + A matching process based on multiple, possibly intertwined, matching approaches. + An ontology matching tool determines that a subject should be mapped to an object by employing a range of techniques, including lexical, semantic and structural. + composite matching process @@ -297,10 +308,10 @@ - - A normalization technique replacing diactritic signs (e.g. accents, hats) with their most frequent replacements. - The string 'Protégé' is normalised to 'Protege'. - diacritics suppression + + A normalization technique replacing diactritic signs (e.g. accents, hats) with their most frequent replacements. + The string 'Protégé' is normalised to 'Protege'. + diacritics suppression @@ -309,10 +320,10 @@ - - A normalization technique removing all numbers in a string. - The string 'ontology1' becomes 'ontology'. - digit suppression + + A normalization technique removing all numbers in a string. + The string 'ontology1' becomes 'ontology'. + digit suppression @@ -321,8 +332,19 @@ - - edit distance + + edit distance + + + + + + + + + A matching process that uses graph representation learning which is a method in machine learning that focuses on learning a compact representation for graphs, capturing their structural information. + Matches between entities are identified by analyzing the learned representations that encode the structural features and relationships within graph data. + graph representation learning-based matching process @@ -331,8 +353,8 @@ - - hamming distance + + hamming distance @@ -341,10 +363,21 @@ - - A matching process based on individual representations (or instances). - A match between a subject A and an object B was established because they share the same instances. - instance-based matching process + + A matching process based on individual representations (or instances). + A match between a subject A and an object B was established because they share the same instances. + instance-based matching process + + + + + + + + + A matching process that employs large language models (LLMs) which are pre-trained on vast amounts of text data and can understand and generate human-like text, making them suitable for tasks requiring a deep understanding of language. + Matches between entities are determined through the language understanding capabilities of LLMs, such as semantic context and language inference. + LLM-based matching process @@ -353,8 +386,8 @@ - - language-based similarity measure + + language-based similarity measure @@ -363,9 +396,9 @@ - - The process of reducing the different forms of a word to one single form. - lemmatization + + The process of reducing the different forms of a word to one single form. + lemmatization @@ -374,8 +407,8 @@ - - levenshtein distance + + levenshtein distance @@ -384,9 +417,9 @@ - A matching process based on a lexical comparison between one or more syntactic features of the subject with one or more syntactic features of the object. - The label of a subject entity matches to an exact synonym of an object entity. - lexical matching process + A matching process based on a lexical comparison between one or more syntactic features of the subject with one or more syntactic features of the object. + The label of a subject entity matches to an exact synonym of an object entity. + lexical matching process @@ -395,9 +428,9 @@ - A lexical matching process based on a minimum threshold of a score from a comparison based on a lexical similarity algorithm. - A match between a subject and an object was established because they had a Levenshtein score higher than 0.8. - lexical similarity threshold-based matching process + A lexical matching process based on a minimum threshold of a score from a comparison based on a lexical similarity algorithm. + A match between a subject and an object was established because they had a Levenshtein score higher than 0.8. + lexical similarity threshold-based matching process @@ -406,10 +439,10 @@ - - A normalization technique replacing specific links between words, such as apostrophes, dashes, underscore, etc into dashes or blanks. - The string 'alzheimer's' becomes 'alzheimers'. - link stripping + + A normalization technique replacing specific links between words, such as apostrophes, dashes, underscore, etc into dashes or blanks. + The string 'alzheimer's' becomes 'alzheimers'. + link stripping @@ -418,7 +451,7 @@ - logical consistency filtering + logical consistency filtering @@ -427,9 +460,20 @@ - A matching process based on the inferences made by a logical reasoner. - Two classes across ontologies are determined equivalent by an OWL reasoner such as ELK. - logical reasoning process + A matching process based on the inferences made by a logical reasoner. + Two classes across ontologies are determined equivalent by an OWL reasoner such as ELK. + logical reasoning process + + + + + + + + + A matching process that involves machine learning algorithms which learn from data to find patterns or make decisions with minimal human intervention. + Matches between entities are made by applying learned models to data points to predict similarities or relationships. + machine learning-based matching process @@ -438,9 +482,9 @@ - An matching process that is performed by a human agent and is based on human judgement and domain knowledge. - A human curator determines that a subject should be mapped to an object by virtue of their domain expertise. - manual mapping curation + An matching process that is performed by a human agent and is based on human judgement and domain knowledge. + A human curator determines that a subject should be mapped to an object by virtue of their domain expertise. + manual mapping curation @@ -448,9 +492,9 @@ - A triple <s,p,o> comprising a subject entity s, an object entity o and a mapping predicate p. - The subject entity NCI:C9305 is mapped to the object entity ICD10:C80.9 using the skos:relatedMatch mapping predicate. - mapping + A triple <s,p,o> comprising a subject entity s, an object entity o and a mapping predicate p. + The subject entity NCI:C9305 is mapped to the object entity ICD10:C80.9 using the skos:relatedMatch mapping predicate. + mapping @@ -458,9 +502,9 @@ - A process that relates to the creation, confirmation, rejection or curation of a mapping. - Matching is a mapping activity that results in the creating of a mapping; mapping review is an activity that results in the confirmation of a mapping. - mapping activity + A process that relates to the creation, confirmation, rejection or curation of a mapping. + Matching is a mapping activity that results in the creating of a mapping; mapping review is an activity that results in the confirmation of a mapping. + mapping activity @@ -469,9 +513,9 @@ - A matching process based on the traversing of multiple mappings. - A match between a subject A and an object B was established because A was mapped to C, C was mapped to D and D was mapped to B. - mapping chaining-based matching process + A matching process based on the traversing of multiple mappings. + A match between a subject A and an object B was established because A was mapped to C, C was mapped to D and D was mapped to B. + mapping chaining-based matching process @@ -480,14 +524,14 @@ - - mapping flipping - mapping inversion - mapping inverting - semantic walk-backs - A matching process based on the inverting or flipping of the subject with the object of a mapping in accordance with the semantics of the mapping predicate. - A broad match between a subject A and an object B using the skos:narrowMatch mapping relation was established because B was mapped to A using the skos:broadMatch mapping relation. - mapping inversion-based matching process + + mapping flipping + mapping inversion + mapping inverting + semantic walk-backs + A matching process based on the inverting or flipping of the subject with the object of a mapping in accordance with the semantics of the mapping predicate. + A broad match between a subject A and an object B using the skos:narrowMatch mapping relation was established because B was mapped to A using the skos:broadMatch mapping relation. + mapping inversion-based matching process @@ -496,9 +540,9 @@ - A process that is concerned with determining if a mapping “candidate” (otherwise determined) is reasonable/correct. - A match between a subject A and an object B was established elsewhere, and a human reviewer determined that the mapping is true (or false) based on an independent evaluation. - mapping review + A process that is concerned with determining if a mapping “candidate” (otherwise determined) is reasonable/correct. + A match between a subject A and an object B was established elsewhere, and a human reviewer determined that the mapping is true (or false) based on an independent evaluation. + mapping review @@ -507,11 +551,11 @@ - matching operation - matching task - An process that results in a mapping between a subject and an object entity. - The label of a subject entity matches to an exact synonym of an object entity. - matching process + matching operation + matching task + An process that results in a mapping between a subject and an object entity. + The label of a subject entity matches to an exact synonym of an object entity. + matching process @@ -520,8 +564,8 @@ - A process applied to a set of mappings, usually with the intention of changing it. - matching process post-processing + A process applied to a set of mappings, usually with the intention of changing it. + matching process post-processing @@ -530,8 +574,8 @@ - A preliminary processing of inputs prior to performing matching. - matching process pre-processing + A preliminary processing of inputs prior to performing matching. + matching process pre-processing @@ -540,9 +584,9 @@ - - A family of preprocessing techniques for reducing strings to be compared to a common format. - normalization + + A family of preprocessing techniques for reducing strings to be compared to a common format. + normalization @@ -551,10 +595,10 @@ - - A normalization technique removing all punctation characters from a string. - The string 'e.g.' becomes 'eg'. - punctuation elimination + + A normalization technique removing all punctation characters from a string. + The string 'e.g.' becomes 'eg'. + punctuation elimination @@ -563,9 +607,9 @@ - A preprocessing method transforming a string by matching a regular expression and then removing that match. - The regex match “[ -]phenotype” is removed from the label field of the subject entity in the mapping. - regex removal + A preprocessing method transforming a string by matching a regular expression and then removing that match. + The regex match “[ -]phenotype” is removed from the label field of the subject entity in the mapping. + regex removal @@ -574,9 +618,9 @@ - A preprocessing method transforming a string by matching a regular expression and then replacing that match with a specified substitution string. - The regex match “[ -]phenotype” is replaced by “-disease” for the label field of the subject entity in the mapping. - regex replacement + A preprocessing method transforming a string by matching a regular expression and then replacing that match with a specified substitution string. + The regex match “[ -]phenotype” is replaced by “-disease” for the label field of the subject entity in the mapping. + regex replacement @@ -585,9 +629,9 @@ - A matching process based on a minimum threshold of a score from a comparison based on a semantic similarity algorithm. - A match between a subject and an object entity was established because they had a Jaccard score higher than 0.8 based on the set of (common) superclasses. - semantic similarity threshold-based matching process + A matching process based on a minimum threshold of a score from a comparison based on a semantic similarity algorithm. + A match between a subject and an object entity was established because they had a Jaccard score higher than 0.8 based on the set of (common) superclasses. + semantic similarity threshold-based matching process @@ -595,9 +639,9 @@ - - A technique for determining a score that characterises the similarity between two entities. - similarity measure + + A technique for determining a score that characterises the similarity between two entities. + similarity measure @@ -606,7 +650,7 @@ - stable marriage filtering + stable marriage filtering @@ -615,8 +659,8 @@ - The process of reducing a word to its word stem. - stemming + The process of reducing a word to its word stem. + stemming @@ -625,10 +669,10 @@ - - A preprocessing method transforming a string by removing a list of stop words. - The stop-words “of, and, is, a” are removed from the label field of the subject entity in the mapping. - stop-word removal + + A preprocessing method transforming a string by removing a list of stop words. + The stop-words “of, and, is, a” are removed from the label field of the subject entity in the mapping. + stop-word removal @@ -637,9 +681,9 @@ - - A similarity measure based on the comparison of strings. - string-based similarity measure + + A similarity measure based on the comparison of strings. + string-based similarity measure @@ -648,9 +692,9 @@ - - A string-based similarity measure which involves determining if two strings associated with mapping entities are equal. - string equality + + A string-based similarity measure which involves determining if two strings associated with mapping entities are equal. + string equality @@ -659,10 +703,10 @@ - - Structural matching does not involve looking at "values" of properties. - A match between a subject and an object was established because of the similarity of their structural features, e.g., the number of direct property of a class. - structural matching process + + Structural matching does not involve looking at "values" of properties. + A match between a subject and an object was established because of the similarity of their structural features, e.g., the number of direct property of a class. + structural matching process @@ -671,8 +715,8 @@ - - substring similarity + + substring similarity @@ -681,8 +725,8 @@ - - substring test + + substring test @@ -691,9 +735,9 @@ - - named entity recognition - term extraction + + named entity recognition + term extraction @@ -702,7 +746,7 @@ - threshold filtering + threshold filtering @@ -711,8 +755,8 @@ - - token-based distance + + token-based distance @@ -721,8 +765,19 @@ - - tokenization + + tokenization + + + + + + + + + A matching process that utilizes transformer models, which are a type of deep learning model architecture designed to handle sequential data, particularly for natural language processing tasks. + Matches between entities are established based on the contextual relationships learned by the transformer from large datasets. + transformer-based matching process @@ -731,9 +786,9 @@ - A matching process based on an unspecified comparison. - A mapping between a subject and an object was established, but it is no longer clear how or why. - unspecified matching process + A matching process based on an unspecified comparison. + A mapping between a subject and an object was established, but it is no longer clear how or why. + unspecified matching process @@ -742,12 +797,12 @@ - - ngram similarity + + ngram similarity - + From 49d9ecdf8d797326bb715655d93faedd6306d03c Mon Sep 17 00:00:00 2001 From: Nico Matentzoglu Date: Wed, 8 Nov 2023 13:01:08 +0200 Subject: [PATCH 2/2] Update documentation --- docs/index.html | 949 +++++++++++++++++++++++++++++++----------------- 1 file changed, 623 insertions(+), 326 deletions(-) diff --git a/docs/index.html b/docs/index.html index d92cff4..ed607b6 100644 --- a/docs/index.html +++ b/docs/index.html @@ -256,6 +256,17 @@ .setclass { list-style-type: none; } + +code{ + word-wrap: break-word; + } + table { + table-layout: fixed; + width: 100%; + } + td { + word-wrap: break-word; + } @@ -302,13 +313,13 @@

Metadata

- Title + Title

Semantic Mapping Vocabulary

- License + License
https://creativecommons.org/publicdomain/zero/1.0/ @@ -316,15 +327,15 @@

Metadata

- Version Iri + Version Iri
- ns2:semapv.owl + ns2:semapv.owl
- Description + Description

The Semantic Mapping Vocabulary provides and defines terms used for creating and maintaining semantic mappings, in particular mapping metadata.

@@ -345,19 +356,21 @@

background knowledge-based matching process - Description + Description

A matching process that exploits background knowledge from external resources, commonly referred to as background knowledge resources. This approach is also known as indirect matching, BK-based matching or context-based matching.

- Example + Example -

A match between a subject A and an object B was established because they appear equivalent under consideration of externally provided background knowledge.

+ +
A match between a subject A and an object B was established because they appear equivalent under consideration of externally provided background knowledge.
+ - Source + Source ns3:j.websem.2018.04.001 @@ -365,11 +378,11 @@

background knowledge-based matching process - Sub Class Of + Sub Class Of - sempav_voc:Matching + semapv_voc:Matching c @@ -389,19 +402,21 @@

blank normalisation - Description + Description

A normalization technique replacing all blank characters, such as space, tabulation, carriage return (or sequences of these) into a single blank character.

- Example + Example -

The string 'ontology matching' (two spaces between 'ontology' and 'matching) becomes 'ontology matching' (one space).

+ +
The string 'ontology matching' (two spaces between 'ontology' and 'matching) becomes 'ontology matching' (one space).
+ - Source + Source ns4:978-3-642-38721-0 @@ -409,11 +424,41 @@

blank normalisation - Sub Class Of + Sub Class Of - sempav_voc:Normalization + semapv_voc:Normalization + c + + + + + +
+

bounded path matching + c +

+ + + + + + + + + + + + @@ -433,11 +478,11 @@

cardinality filtering

@@ -457,19 +502,21 @@

case normalization

- + @@ -501,23 +548,25 @@

composite matching process

- + @@ -537,19 +586,21 @@

diacritics suppression

- + @@ -581,19 +632,21 @@

digit suppression

- + @@ -625,7 +678,7 @@

edit distance

+ + +
IRI + https://w3id.org/semapv/vocab/BoundedPathMatching +
+ Description +

A matching process based on the comparison of matched super and subclasses (paths) of two entities.

+ Sub Class Of + + + semapv_voc:StructuralMatching c
- Sub Class Of + Sub Class Of - sempav_voc:MatchingPostProcessing + semapv_voc:MatchingPostProcessing c
- Description + Description

A normalization technique converting each alphabetic character in a string to their lower case counterpart.

- Example + Example

The string 'Ontology' is normalised to 'ontology' (lower case).

+
The string 'Ontology' is normalised to 'ontology' (lower case).
+
- Source + Source ns4:978-3-642-38721-0 @@ -477,11 +524,11 @@

case normalization

- Sub Class Of + Sub Class Of - sempav_voc:Normalization + semapv_voc:Normalization c
- Description + Description

A matching process based on multiple, possibly intertwined, matching approaches.

- Example + Example

An ontology matching tool determines that a subject should be mapped to an object by employing a range of techniques, including lexical, semantic and structural.

+
An ontology matching tool determines that a subject should be mapped to an object by employing a range of techniques, including lexical, semantic and structural.
+
- Sub Class Of + Sub Class Of - sempav_voc:Matching + semapv_voc:Matching c
- Description + Description

A normalization technique replacing diactritic signs (e.g. accents, hats) with their most frequent replacements.

- Example + Example

The string 'Protégé' is normalised to 'Protege'.

+
The string 'Protégé' is normalised to 'Protege'.
+
- Source + Source ns4:978-3-642-38721-0 @@ -557,11 +608,11 @@

diacritics suppression

- Sub Class Of + Sub Class Of - sempav_voc:Normalization + semapv_voc:Normalization c
- Description + Description

A normalization technique removing all numbers in a string.

- Example + Example

The string 'ontology1' becomes 'ontology'.

+
The string 'ontology1' becomes 'ontology'.
+
- Source + Source ns4:978-3-642-38721-0 @@ -601,11 +654,11 @@

digit suppression

- Sub Class Of + Sub Class Of - sempav_voc:Normalization + semapv_voc:Normalization c
- Source + Source ns4:978-3-642-38721-0 @@ -633,22 +686,60 @@

edit distance

- Sub Class Of + Sub Class Of - sempav_voc:StringBasedSimilarityMeasure + semapv_voc:StringBasedSimilarityMeasure c
- Super Class Of + Super Class Of + + + semapv_voc:LevenshteinEditDistance + c + +
+
+
+

graph representation learning-based matching process + c +

+ + + + + + + + + + + + + + + @@ -668,7 +759,7 @@

hamming distance

@@ -700,19 +791,21 @@

instance-based matching process

- + + +
IRI + https://w3id.org/semapv/vocab/GraphRepresentationLearningBasedMatching +
+ Description +

A matching process that uses graph representation learning which is a method in machine learning that focuses on learning a compact representation for graphs, capturing their structural information.

+ Example + +
Matches between entities are identified by analyzing the learned representations that encode the structural features and relationships within graph data.
+
+ Sub Class Of - sempav_voc:LevenshteinEditDistance + semapv_voc:Matching c
- Source + Source ns4:978-3-642-38721-0 @@ -676,11 +767,11 @@

hamming distance

- Sub Class Of + Sub Class Of - sempav_voc:StringBasedSimilarityMeasure + semapv_voc:StringBasedSimilarityMeasure c
- Description + Description

A matching process based on individual representations (or instances).

- Example + Example

A match between a subject A and an object B was established because they share the same instances.

+
A match between a subject A and an object B was established because they share the same instances.
+
- Source + Source ns4:978-3-642-38721-0 @@ -720,11 +813,49 @@

instance-based matching process

- Sub Class Of + Sub Class Of - sempav_voc:Matching + semapv_voc:Matching + c + +
+
+
+

LLM-based matching process + c +

+ + + + + + + + + + + + + + + + @@ -744,7 +875,7 @@

language-based similarity measure

@@ -776,13 +907,13 @@

lemmatization

@@ -814,7 +945,7 @@

levenshtein distance

@@ -846,23 +977,25 @@

lexical matching process

- + @@ -882,23 +1015,25 @@

lexical similarity threshold-based matching process

- + @@ -918,19 +1053,21 @@

link stripping

- + @@ -962,11 +1099,11 @@

logical consistency filtering

@@ -986,23 +1123,63 @@

logical reasoning process

+ + + + + + +
IRI + https://w3id.org/semapv/vocab/LLMBasedMatching +
+ Description +

A matching process that employs large language models (LLMs) which are pre-trained on vast amounts of text data and can understand and generate human-like text, making them suitable for tasks requiring a deep understanding of language.

+ Example + +
Matches between entities are determined through the language understanding capabilities of LLMs, such as semantic context and language inference.
+
+ Sub Class Of + + + semapv_voc:Matching c
- Source + Source ns4:978-3-642-38721-0 @@ -752,11 +883,11 @@

language-based similarity measure

- Sub Class Of + Sub Class Of - sempav_voc:SimilarityMeasure + semapv_voc:SimilarityMeasure c
- Description + Description

The process of reducing the different forms of a word to one single form.

- Source + Source ns4:978-3-642-38721-0 @@ -790,11 +921,11 @@

lemmatization

- Sub Class Of + Sub Class Of - sempav_voc:Normalization + semapv_voc:Normalization c
- Source + Source ns4:978-3-642-38721-0 @@ -822,11 +953,11 @@

levenshtein distance

- Sub Class Of + Sub Class Of - sempav_voc:EditDistance + semapv_voc:EditDistance c
- Description + Description

A matching process based on a lexical comparison between one or more syntactic features of the subject with one or more syntactic features of the object.

- Example + Example

The label of a subject entity matches to an exact synonym of an object entity.

+
The label of a subject entity matches to an exact synonym of an object entity.
+
- Sub Class Of + Sub Class Of - sempav_voc:Matching + semapv_voc:Matching c
- Description + Description

A lexical matching process based on a minimum threshold of a score from a comparison based on a lexical similarity algorithm.

- Example + Example

A match between a subject and an object was established because they had a Levenshtein score higher than 0.8.

+
A match between a subject and an object was established because they had a Levenshtein score higher than 0.8.
+
- Sub Class Of + Sub Class Of - sempav_voc:Matching + semapv_voc:Matching c
- Description + Description

A normalization technique replacing specific links between words, such as apostrophes, dashes, underscore, etc into dashes or blanks.

- Example + Example

The string 'alzheimer's' becomes 'alzheimers'.

+
The string 'alzheimer's' becomes 'alzheimers'.
+
- Source + Source ns4:978-3-642-38721-0 @@ -938,11 +1075,11 @@

link stripping

- Sub Class Of + Sub Class Of - sempav_voc:Normalization + semapv_voc:Normalization c
- Sub Class Of + Sub Class Of - sempav_voc:MatchingPostProcessing + semapv_voc:MatchingPostProcessing c
- Description + Description

A matching process based on the inferences made by a logical reasoner.

- Example + Example + +
Two classes across ontologies are determined equivalent by an OWL reasoner such as ELK.
+
+ Sub Class Of + + + semapv_voc:Matching + c + +
+
+
+

machine learning-based matching process + c +

+ + + + + + + + + + + - + @@ -1022,23 +1199,25 @@

manual mapping curation

- + @@ -1058,15 +1237,17 @@

mapping

- +
IRI + https://w3id.org/semapv/vocab/MachineLearningBasedMatching +
+ Description +

A matching process that involves machine learning algorithms which learn from data to find patterns or make decisions with minimal human intervention.

+ Example

Two classes across ontologies are determined equivalent by an OWL reasoner such as ELK.

+
Matches between entities are made by applying learned models to data points to predict similarities or relationships.
+
- Sub Class Of + Sub Class Of - sempav_voc:Matching + semapv_voc:Matching c
- Description + Description

An matching process that is performed by a human agent and is based on human judgement and domain knowledge.

- Example + Example

A human curator determines that a subject should be mapped to an object by virtue of their domain expertise.

+
A human curator determines that a subject should be mapped to an object by virtue of their domain expertise.
+
- Sub Class Of + Sub Class Of - sempav_voc:Matching + semapv_voc:Matching c
- Description + Description

A triple comprising a subject entity s, an object entity o and a mapping predicate p.

- Example + Example

The subject entity NCI:C9305 is mapped to the object entity ICD10:C80.9 using the skos:relatedMatch mapping predicate.

+
The subject entity NCI:C9305 is mapped to the object entity ICD10:C80.9 using the skos:relatedMatch mapping predicate.
+
@@ -1083,43 +1264,45 @@

mapping activity - Description + Description

A process that relates to the creation, confirmation, rejection or curation of a mapping.

- Example + Example -

Matching is a mapping activity that results in the creating of a mapping; mapping review is an activity that results in the confirmation of a mapping.

+ +
Matching is a mapping activity that results in the creating of a mapping; mapping review is an activity that results in the confirmation of a mapping.
+ - Super Class Of + Super Class Of