Treebank search¶
-
class
chaininglib.search.TreebankQuery.
TreebankQuery
(resource)[source]¶ A query on a treebank.
-
kwic
(align_lemma=None, align_pos=None, align_wordform=None)[source]¶ Get the results (as Pandas DataFrame) of a treebank search, with one keyword in context (KWIC) per row Beware: if no align_lemma etc is given, there will be no alignment :param align_lemma: the lemma to align all results with (it then behaves as a hit) :param align_pos: same with pos :param align_wordform: same with wordform
Returns: Pandas DataFrame >>> treebank_obj = create_treebank(some_treebank).pattern(some_pattern).search() >>> df = treebank_obj.kwic()
-
search
()[source]¶ Perform a treebank search :returns: TreebankQuery object
>>> # build a treebank search query >>> treebank_obj = create_treebank(some_treebank).pattern(some_pattern).search()
-
-
chaininglib.search.TreebankQuery.
create_treebank
(name)[source]¶ API constructor :param name: Name of the treebank (eg. ‘cgn’, ‘lassy’, …)
Returns: TreebankQuery object >>> treebank_obj = create_treebank(some_treebank).pattern(some_pattern).search() >>> df = treebank_obj.kwic()
-
chaininglib.search.TreebankQuery.
get_available_treebanks
()[source]¶ This function returns the list of the available treebanks
Returns: list of treebanks names strings
-
chaininglib.search.treebankQueries.
treebank_query
(lemma=None, word=None, pos=None)[source]¶ This function builds a query for getting occurances of a given lemma within a treebank
Parameters: - lemma – a lemma to look for
- word – wordform to look for
- pos – POS tag to look for
Returns: a treebank query string
>>> tb = create_treebank().word("kat") >>> df_trees = tb.search().kwic() >>> display(df_trees)