Main entities in Smabbler
MODEL (or TEXT MODEL)
– a configuration of the Smabbler engine consisting of query commands and specifying which information (TEXT FEATURE) to extract from text. Can be one of the predefined models (PUBLIC), or a user-defined model (PRIVATE). Each text model consists of one or more QUERY(ies)
QUERY
– is a command to the Graph Language Model to extract specified TEXT FEATURE. Query building is available via QueryLab.
CONCEPT
– a single word, topic, phrase, or value that a TEXT MODEL can extract. For example, {date value} or {colors} can be a concept, but they can also be more specific e.g. {problem description} or {health conditions}.
CONTEXT
– a single word, topic, phrase, or value that specifies a CONCEPT(s). For example, {climate} can be a concept and {emotions} can be a context.
TEXT FEATURE
– a label, annotation or classification. An output of Smabbler’s text processing, can be used as an input for later stages in the data pipeline, or consumed directly. Note, for each input text multiple text features can be produced.
QUERY LAB (text model builder)
– a configuration mechanism in Smabbler UI allowing construction of new text models. QueryLab provides an easy way to create Graph Language Model queries by choosing nodes from Galaxia graph or providing own concepts until all desired text feature criteria are selected.
GALAXIA
– graph language model (GLM). It is prepared to recognize and extracts information from text. Galaxia is a foundation model, on top of which other text models and applications can be built.
NODES
– data points in the Galaxia graph, that represent words, phrases, definitions.
EDGES
– connection between Galaxia nodes. They can populate / transfer features between nodes, not necessarily linked by direct connection.