Glossary
Aggregate
The task of collecting a set of values to return a single value.
Aggregator node
A node that routes queries to the leaf nodes, aggregates intermediate results, and sends the results back to a client.
Approximate Nearest Neighbor
A technique used in computational geometry and machine learning to find the approximate nearest neighbors in high-dimensional spaces.
Background merger
An algorithm used by SingleStore that allows columnstore segments to maintain sort order (as close as possible), while data is being ingested or updated.
Binary Javascript Object Notation
A binary-encoded serialization of JSON-like documents.
Centroids
A point used in clustering algorithms to represent the center or mean of a cluster, a group of vectors near each other.
Child aggregator
A node that can be promoted to the role of Master Aggregator in the event that the existing Master Aggregator’s host fails.
Cluster
A collection of SingleStore aggregator and leaf nodes.
Code generation
An industrial compiler to produce highly efficient machine code that enables low-level optimizations, which are not possible when executing queries via interpretation alone.
Common Table Expression
A named temporary result set that exists within the scope of a single statement and that can be referred to later within that statement, possibly multiple times.
Database branching
A feature that creates private, independent copies of a database including all of its data.
Database user
A user that resides in the workspace.
Deadlock
A situation when two or more transactions mutually hold and request a table write that the other transaction needs.
Deterministic
An operation or function that will always produce the same result for the same input values.
Full backup
A stored and complete copy of database.
Garbage Collection
The process where unneeded versioned nodes for multi-version concurrency control (MVCC) are eliminated.
Globbing
Globbing is commonly used in data ingest to read or select a subset of files based on a naming pattern.
Hadoop Distributed File System
A distributed file system by Apache Hadoop.
Hash index
A data structure optimized for fast equality lookups by a key.
Hierarchical data
A set of data items that are related to each other by hierarchical relationships.
Hierarchical Navigable Small World
An algorithm used for approximate nearest neighbor (ANN) search, particularly in high-dimensional spaces.
High-dimensional spaces
Datasets with a large number of features or attributes, where each feature represents a separate dimension.
Host
A hardware or virtual machine which holds the aggregator and leaf nodes that comprise a SingleStore cluster.
Incremental backup
A stored copy of data that only the data stores what has been modified since the most recent backup.
Information schema
Holds the information or metadata for all the databases in a cluster.
Inline view
A SELECT
statement embedded in the FROM
or WITH
clause of another SELECT
statement that creates a temporary table that is operated on by the outer query.
In-place change
An in-place change in a database refers to modifying the data or schema of the database without requiring a full data migration or recreation of the database.
Inverted file with product quantization
A method used for approximate nearest neighbor search in large-scale datasets, particularly in high-dimensional spaces.
Javascript Object Notation
An open standard file format and data interchange format that uses human-readable text to store and transmit data objects consisting of attribute–value pairs and arrays.
JSON Web Key Sets
A set of keys containing public keys that can be used to authenticate any JWT.
JSON Web Token
An open, industry standard typically used for authorization and information exchange.
Leaf node
A node that stores a subset of a cluster’s data.
Lock-free backups
Backup operations that do not block INSERT
, UPDATE
, and DELETE
operations during the backup.
Low Level Virtual Machine
It is a collection of modular and reusable compiler and toolchain technologies used for developing compiler frontends and backends.
The project has outgrown the original name, and now LLVM is just its name, not an acronym.
Master aggregator
A specialized node that’s responsible for cluster monitoring and failover.
Metadata
Information about a database's schema, access to the database, storage, built-in programs, and or other information about the data such as date created, file size, and etc.
Multi-Version Concurrency Control
A method used to increase transaction concurrency and reduce response time for read-only transactions by maintaining a history of versions of each row in a table.
Node
A SingleStore server.
Non-Uniform Memory Access
A computer memory design used in multiprocessing.
Normalized
Organizing data to appear similar across all records and fields.
Object store
A data storage architecture that manages data as objects, as opposed to other storage architectures like file systems which manages data as a file hierarchy, and block storage which manages data as blocks within sectors and tracks.
Online Analytical Processing
A data processing type that is designed to analyze data dimensions concurrently.
Online Transaction Processing
A data processing type that executes a number of transactions occurring concurrently.
Organization
Allows shared access to workspaces and related resources within a company or group.
Organization user
An organization user resides within the “control plane” of SingleStore and can perform those actions that are available on the Cloud Portal, such as manage organization users, manage clusters, and run SQL queries against cluster data via the Cloud Portal SQL editor.
Partition
A partition contains a subset (a shard) of a database’s data.
Persisting the name of a file
Persisting the name of a file is storing the file name so it can be retrieved and used again later in a program or system.
Pipelines
A feature that continuously loads data as it arrives from external sources.
Point-in-time recovery
A user-initiated operation that allows a set of data in a database to be recovered to a specific timestamp in the past.
Procedural SQL
A set of programming extensions for SingleStore that allow developers to write code in a procedural format.
Product Quantization
A technique used for vector compression.
An application that allows you to design, manage, and monitor your SingleStore instances.
user
A user that can log into the Cloud Portal and access portal services.
Query shape
Patterns or structures in query.
Random-Access Memory
A computer's short-term memory.
Replication
Ensures redundancy in a cluster.
Segment elimination
A process where metadata stored for columnstore segments is used to determine where a segment can match a filter queried at execution time.
Serializable isolation
An isolation level provides the strictest transaction isolation.
Shard
A subset of a databases's data.
Smart Distater Recovery
A process that handles the continuous asynchronous replication of data between a primary and a secondary region.
Solid-State Drive
A solid-state drive is a storage device.
Single sign-on
Single sign-on, where one can log into SingleStore via Azure AD, Okta, and PingOne.
Skiplist index
A data structure optimized for ordered data that allows for queries to quickly seek data by binary searching.
SQL surface area
The amount of components installed and or configuration options that are enabled.
Sharding
A type of database partitioning that divides a database into smaller more easily manageable parts.
Unlimited storage
An unlimited amount of storage space in the cloud that can be used to move data to seamlessly between memory, persistent cache, and storage.
A database whose size is not limited by the size of the persistent cache, but only by available external object storage.
Unlimited storage database
A database whose size is not limited by the size of the persistent cache, but only by available external object storage.
User-defined aggregate function
A callable routine that accepts input parameters, executes programmatic logic in the function body, and returns a scalar-type value.
User-defined function
A callable routine that accepts input parameters, executes programmatic logic, and returns a value (or values).
User-defined table-valued function
A callable routine that accepts input parameters, executes a single SELECT statement in the function body, and returns a single table-type value (similar to a view).
Workspace
SingleStore Helios’s version of a cluster.
Last modified: February 23, 2024