Treebank search

class chaininglib.search.TreebankQuery.TreebankQuery(resource)[source]

A query on a treebank.

kwic(align_lemma=None, align_pos=None, align_wordform=None)[source]

Get the results (as Pandas DataFrame) of a treebank search, with one keyword in context (KWIC) per row Beware: if no align_lemma etc is given, there will be no alignment :param align_lemma: the lemma to align all results with (it then behaves as a hit) :param align_pos: same with pos :param align_wordform: same with wordform

Returns:Pandas DataFrame
>>> treebank_obj = create_treebank(some_treebank).pattern(some_pattern).search()
>>> df = treebank_obj.kwic()
search()[source]

Perform a treebank search :returns: TreebankQuery object

>>> # build a treebank search query
>>> treebank_obj = create_treebank(some_treebank).pattern(some_pattern).search()
trees()[source]

Get results (as nested objects) matching a treebank search query :returns: list of tree objects

>>> treebank_obj = create_treebank(some_treebank).pattern(some_pattern).search()
>>> list_of_trees = treebank_obj.trees()
xml()[source]

Get the XML response (unparsed) of a treebank search :returns: XML string

>>> treebank_obj = create_treebank(some_treebank).pattern(some_pattern).search()
>>> xml_response = treebank_obj.xml()
chaininglib.search.TreebankQuery.create_treebank(name)[source]

API constructor :param name: Name of the treebank (eg. ‘cgn’, ‘lassy’, …)

Returns:TreebankQuery object
>>> treebank_obj = create_treebank(some_treebank).pattern(some_pattern).search()
>>> df = treebank_obj.kwic()
chaininglib.search.TreebankQuery.get_available_treebanks()[source]

This function returns the list of the available treebanks

Returns:list of treebanks names strings
chaininglib.search.treebankQueries.treebank_query(lemma=None, word=None, pos=None)[source]

This function builds a query for getting occurances of a given lemma within a treebank

Parameters:
  • lemma – a lemma to look for
  • word – wordform to look for
  • pos – POS tag to look for
Returns:

a treebank query string

>>> tb = create_treebank().word("kat")
>>> df_trees = tb.search().kwic()
>>> display(df_trees)