Functions

corpkit contains a small set of standalone functions.

as_regex

corpkit.other.as_regex(lst, boundaries='w', case_sensitive=False, inverse=False, compile=False)[source]

Turns a wordlist into an uncompiled regular expression

Parameters:
  • lst (list) – A wordlist to convert
  • boundaries (str -- 'word'/'line'/'space'; tuple -- (leftboundary, rightboundary)) –
  • case_sensitive (bool) – Make regular expression case sensitive
  • inverse (bool) – Make regular expression inverse matching
Returns:

regular expression as string

load

corpkit.other.load(savename, loaddir='saved_interrogations')[source]

Load saved data into memory:

>>> loaded = load('interro')

will load ./saved_interrogations/interro.p as loaded

Parameters:
  • savename (str) – Filename with or without extension
  • loaddir (str) – Relative path to the directory containg savename
  • only_concs (bool) – Set to True if loading concordance lines
Returns:

loaded data

load_all_results

corpkit.other.load_all_results(data_dir='saved_interrogations', **kwargs)[source]

Load every saved interrogation in data_dir into a dict:

>>> r = load_all_results()
Parameters:data_dir (str) – path to saved data
Returns:dict with filenames as keys

new_project

corpkit.other.new_project(name, loc='.', **kwargs)[source]

Make a new project in loc.

Parameters:
  • name (str) – A name for the project
  • loc (str) – Relative path to directory in which project will be made
Returns:

None