Signet Forge 0.1.0
C++20 Parquet library with AI-native extensions
DEMO
Loading...
Searching...
No Matches
forge.hpp
Go to the documentation of this file.
1// SPDX-License-Identifier: AGPL-3.0-or-later
2// Copyright 2026 Johnson Ogundeji
3#pragma once
4
28
31#include "signet/error.hpp"
32#include "signet/types.hpp"
33#include "signet/schema.hpp"
34#include "signet/statistics.hpp"
35#include "signet/memory.hpp"
41
49
52#include "signet/z_order.hpp"
54
63
73#if defined(SIGNET_ENABLE_COMMERCIAL) && SIGNET_ENABLE_COMMERCIAL
74#include "signet/crypto/pme.hpp"
76#endif
78
84
88#if !defined(_WIN32) && !defined(__EMSCRIPTEN__)
90#endif
92
99
102#include "signet/ai/wal.hpp"
103#ifndef __EMSCRIPTEN__
105#endif
107
113
120
123#if defined(SIGNET_ENABLE_COMMERCIAL) && SIGNET_ENABLE_COMMERCIAL
133#endif
135
142
145#include "signet/writer.hpp"
146#include "signet/reader.hpp"
AES-256 block cipher implementation (FIPS-197).
AES-256-CTR stream cipher implementation (NIST SP 800-38A).
AES-256-GCM authenticated encryption (NIST SP 800-38D).
Arrow C Data Interface interop for SignetStack Signet Forge.
BYTE_STREAM_SPLIT encoding and decoding (Parquet encoding type 9).
Abstract cipher interface, GCM/CTR adapters, CipherFactory, and platform CSPRNG.
Compression codec interface and registry for Signet Forge.
Column-major batch of feature rows for zero-copy tensor wrapping and WAL serialization in ML inferenc...
ColumnIndex, OffsetIndex, and ColumnIndexBuilder for predicate pushdown.
PLAIN-encoded Parquet column decoder.
PLAIN encoding writer for all Parquet physical types.
Thrift Compact Protocol encoder and decoder for Parquet metadata serialization.
DELTA_BINARY_PACKED encoding and decoding (Parquet encoding type 5).
Dictionary encoding and decoding for Parquet (PLAIN_DICTIONARY / RLE_DICTIONARY).
Multi-tier event bus for routing SharedColumnBatch events through three tiers.
Versioned ML feature store reader with point-in-time queries.
Versioned ML feature store writer for appending typed feature vectors to Parquet.
GZIP compression codec for Signet Forge (wraps zlib).
Key material, encryption configuration, and TLV serialization for Parquet Modular Encryption (PME).
LZ4 raw-block compression codec for Signet Forge (wraps liblz4).
Arena (bump-pointer) allocator for batch Parquet reads.
Memory-mapped I/O reader for Parquet files.
Lock-free bounded MPMC ring buffer based on Dmitry Vyukov's algorithm.
DLPack / NumPy interop for SignetStack Signet Forge tensor bridge.
ONNX Runtime interop for SignetStack Signet Forge tensor bridge.
Parquet Modular Encryption (PME) orchestrator – encrypts and decrypts Parquet file components (footer...
Post-quantum cryptography: Kyber-768 KEM, Dilithium-3 signatures, X25519 ECDH, and HybridKem (Kyber+X...
INT8/INT4 quantized vector storage for AI embeddings with SIMD acceleration.
RLE/Bit-Packing Hybrid encoding and decoding (Parquet spec).
Per-row lineage tracking (Iceberg V3-style) with monotonic row IDs, mutation versioning,...
Schema definition types: Column<T>, SchemaBuilder, and Schema.
SHA-256 hash function (NIST FIPS 180-4).
SHA-512 hash function (NIST FIPS 180-4).
Bundled, zero-dependency, header-only Snappy compression codec.
Split Block Bloom Filter as specified by the Apache Parquet format.
Per-column-chunk statistics tracker and little-endian byte helpers.
Lock-free SPSC/MPSC ring buffers, StreamingSink for background Parquet compaction,...
Zero-copy tensor bridge: maps Parquet column data directly into ML-framework-compatible tensor views ...
Parquet Thrift struct types – C++ structs matching parquet.thrift, with Compact Protocol serialize/de...
Parquet format enumerations, type traits, and statistics structs.
AI-native ML vector column type: VectorWriter/VectorReader for dense float embedding storage as FIXED...
Write-Ahead Log (WAL) with sub-millisecond append, CRC-32 integrity, crash recovery,...
Header-only xxHash64 implementation for Signet Forge.
Z-order curve (Morton code) encoding/decoding for multi-dimensional indexing.
ZSTD compression codec for Signet Forge (wraps libzstd).