2.4 — Silicon blocks (small, composable IP)
- SSM-ALU (lane ops). Three stages: clamp/map (
u := atanh(a_c)), compose (u' = u_f ⊕ u_gvia+,-, scale), inverse/re-clamp (a' := tanh(u')).tanh/atanhvia LUT+poly sized to keep|error_a| <= 1e-6. - SSACC tile (streaming). Registers
{U,W}, guarded divide, publisha_out := tanh( U / max(W, eps_w) ). One input per cycle; flush emits the fused lane. - Quality DMA (optional). Side-channel for A/U-BUS so legacy math can ignore
awhile SSM units consume it. - Determinism. Publish rounding occurs only at I/O boundaries; collapse always holds:
phi((m,a)) = m.
2.5 — ISA-like ops (scalar first, then SIMD)
- Mapping.
SATANH: a -> u;STANH: u -> a - Compose.
SUADD: u1,u2 -> u;SUSUB: u1,u2 -> u;SUSCALE: u,r -> u - Arithmetic (symmetric lane).
SMUL_SYM: (m1,a1),(m2,a2) -> (m1*m2, a_out)witha_out = (a1 + a2) / (1 + a1*a2)SDIV_SYM: (m_f,a_f),(m_g,a_g) -> (m_f/m_g, a_out)witha_out = (a_f - a_g) / (1 - a_f*a_g)(magnitude followsdivision_policy) - Streaming.
SSUM_UW: a,w⇒{U += w*atanh(clamp(a)); W += w};SFLUSH_UW⇒a_out = tanh(U / max(W, eps_w)) - Utility.
SCOLLAPSE: (m,a) -> m;SSCALE: (m,a),k -> (k*m, sign(k)*a)(then clamp) - SIMD. Vector forms of all ops plus fused-accumulate variants for dot/fold patterns.
2.6 — Memory map (knobs and manifests)
- Numerical knobs.
eps_a(e.g.,1e-6),eps_w(e.g.,1e-12),denom_soft_min,gamma. - Policies.
division_policy in {"strict","soft","meadow"},gate_enable, gate gaing_t. - Bands. Threshold table for
A++/A+/A0/A-/A--(overrides must be declared). - Tables. LUT IDs for
tanh/atanhwith max-error bounds. - Manifest (ASCII). Records knob values, build ID, and a conformance vector checksum.
- Determinism rule. Changing any knob or LUT invalidates bit-identical replay claims; emit a new manifest.
2.7 — Timing, latency, throughput (design targets)
u-space ops.u_add/u_sub/u_scalein ~1 cycle at MCU/FPGA DSP-class clocks.- Nonlinear maps.
tanh/atanhlatency set by LUT depth/poly degree (typ. 1–3 cycles). - SSACC. Per-sample accumulate in 1 cycle;
SFLUSH_UWdominated by divider +tanh. - Pipeline sketch.
L_total ≈ L_map + L_compose + L_inverse; sustained streaming rate = 1 sample / cycle once pipeline is full.
Navigation
Back: Shunyaya Symbolic Mathematical Hardware – Ports, Buses & Kernel Pipeline (2.1–2.3)
Next: Shunyaya Symbolic Mathematical Hardware – Numerics, Interop & Stamping (2.8–2.10)
Directory of Pages
SSMH – Table of Contents