Question 1

When should we choose Path A versus Path B?

Accepted Answer

Path A fits when you already have cloud landing zones or datacenters, mature IAM, and want SLM-Works focused on model serving patterns - not procuring metal. Path B can reduce coordination with multiple vendors when you want dedicated GPU capacity sourced and operated under a single commercial thread with SLM-Works; exact trade-offs belong in discovery.

Question 2

Who owns the GPUs in Path B?

Accepted Answer

Ownership and lien terms follow the contract chain (customer ↔ SLM-Works ↔ provider). This site does not specify title or lease structure - your order form and legal schedules do.

Question 3

What SLAs apply?

Accepted Answer

Published marketing pages are not SLAs. Availability, response times, and credits - if any - are defined only in signed agreements and may reference underlying provider schedules. Ask for the current SLA exhibit during procurement.

Question 4

Can you run inside our existing Kubernetes platform?

Accepted Answer

Yes, when cluster policies allow the required GPU drivers, device plugins, and observability agents. We align with your platform team on namespaces, network policies, and secret management rather than imposing a greenfield cluster.

Question 5

How do we monitor latency and errors in production?

Accepted Answer

We typically wire RED-style metrics (rate, errors, duration) plus GPU utilization signals into your metrics stack, with optional synthetic probes from approved vantage points. Alert thresholds are co-owned: engineering proposes, your operations team approves.

Question 6

How does this relate to custom SLM development?

Accepted Answer

Custom SLM engagements produce the model artifacts; infrastructure engagements make those artifacts reliable in your environment. Many clients combine both; some bring a model from another vendor and only need deployment hardening.

Question 7

Which regions or countries are supported?

Accepted Answer

Region lists change with provider capacity and export rules. We confirm residency and data-flow diagrams during discovery - nothing on this page promises availability in a specific geography.

Area	SLM-Works	Customer	Provider (Path B)
Model artifacts & version promotion	Defines promotion playbooks; assists cutover	Approves releases; owns business risk	N/A unless hosted registry is bundled
Inference runtime & model serving config	Implements baseline; documents tuning knobs	Owns change control in prod	Host OS / hypervisor only (Path B)
GPU / accelerator hosts	Specifies sizing; may procure in Path B	Owns or approves SKUs (Path A)	Physical hardware & facility (Path B)
Monitoring & alerting	Integrates dashboards & SLO templates	Owns on-call rosters & escalation	Facility/host metrics per contract
Backups & disaster recovery (non-model data)	Documents recommended patterns	Owns policies & execution	May offer snapshots per SKU (Path B)
Incident response (P1)	Participates per support tier in SOW	Incident commander for product impact	Per facility runbooks (Path B)

SLM infrastructure

Legal and commercial review

Path A. On-premises or your cloud accounts

Path B. Dedicated GPU capacity via SLM-Works

Architecture at a glance

On-prem footprint vs dedicated capacity

Support and operations matrix

From our insights

Frequently asked questions