Peer-to-peer LLM network

Open-source AI,served by the people.

A network of machines serving open models. Run the app to add yours, or reach the whole network through one OpenAI-compatible API. No third-party AI provider in between.

Have a machine to spare? Run a node and earn credits for tokens you serve.

How it works

Two roles. One mesh.

Anyone can use the network — through an OpenAI-compatible API, or the chat on senda.network — without running anything themselves. The models are served by peers who run the Senda app on their own hardware. Anyone can do one, both, or neither.

Send a request

A call to an OpenAI-compatible endpoint — from your code, an agent, or the chat on senda.network. Streamed response, no account, nothing to install.

Mesh entry

Requests land at the public mesh entry point and are routed to a peer that can serve the requested model — by capability, by load, by latency.

Compute peers

Volunteered nodes running Senda LLM serve each session end-to-end on whichever peer fits the model, and the router auto-routes around offline ones. A model too big for any single peer can be split across several.

Two glowing clusters of connected nodes linked by curved luminous lines passing through a central node

The app

Run open models on your own machine.

The Senda app puts the network on your hardware: serve models to the mesh, or run them privately just for yourself. Manage models, watch the mesh, and chat with what's live — one native app for macOS, Windows, and Linux.

Senda — Chat

Chat

Private LLM, right in the app

Stream answers from the mesh with model selection, thread history, and no account.

Get the app →

Senda — Setup

Setup

One click to join the mesh

Install the runtime, opt into staying live on login, and start contributing capacity.

How to contribute →

Senda — Web

Web

Try it in the browser first

No install required — chat at senda.network, then graduate to the desktop app when you want to run a node.

Try the mesh →

Why a mesh

Capacity is everywhere. Senda just uses it.

No third-party AI provider

Your prompt goes to a peer running an open-weight model on a contributed machine — not OpenAI, Anthropic, or Google. Nothing to revoke, no provider terms to read.

Apple Silicon carries a lot of it

An M3 Max or M4 Max with 64–128 GB of unified memory can serve 30B–70B-class models at full quality — competitive with, and often ahead of, same-price Windows GPU boxes for large models. CUDA / ROCm / Vulkan machines join too; each fits different model sizes.

One peer, end to end

A model that fits on one peer runs there start to finish — full quality, no per-token network hops. Splitting a model across peers exists as a fallback for models no single peer can hold.

Peers are verified

The mesh checks that a peer actually runs the model it advertises: each publishes a deterministic fingerprint and the network re-runs an unpredictable synthetic probe to compare. A peer can't claim a big model while quietly serving a smaller one. Real prompts are never replayed.

OpenAI-compatible

Every peer exposes a standard /v1/chat/completions endpoint. Drop-in for any tool that speaks OpenAI — agents, IDE plugins, internal scripts.

Run your own peer

The peer serving your session reads your prompt. Don't want to trust someone else's box? Run the same open-source runtime yourself — the mesh, on hardware you control. Share it, rent it, or keep it entirely in-house.

A glowing green trail winding across topographic contour lines, dotted with luminous waypoint nodes linked into a mesh

Two sides

Use the mesh. Or become part of it.

Use

Build on the network

Point any OpenAI-compatible tool, agent, or script at the network — or just chat on senda.network. No signup for chat; your request goes to a peer, not a third-party AI provider.

Read the docs →

Contribute

Run a node

Have a capable Mac or GPU box? Download the desktop app or curl the runtime. It autostarts and joins the mesh, adding capacity for everyone.

Contribute →

Earn

Earn credits

Contributors accumulate credits for completion tokens served to the mesh. Tracked now in the dashboard; redeemable when payouts ship. No crypto token.

Credits & rates →

What it's for

Built for the work you keep in-house.

Senda is low-cost inference on open-weight models, served by the mesh instead of a hosted AI API. It's for teams where keeping data off third-party providers and keeping per-token costs flat matter more than shaving a second off every reply.

Great fit

Summarizing documents and codebases
Classifying or labeling data at scale
Long-running background agents and pipelines
Synthetic-data generation
Anything private or high-volume where an instant answer isn't the point

Why it holds up

Off third-party AI — your prompt goes to a peer in the mesh, not OpenAI, Anthropic, or Google
Yours to control — run your own peer on hardware you own, not a rented black-box endpoint
No lock-in — OpenAI-compatible API, fully open-source runtime
Verified peers — each one proves it runs the model it advertises

FAQ

The questions people ask first

What is Senda?

A peer-to-peer mesh that runs open-weight models end-to-end on hardware contributors already own. Reach it through an OpenAI-compatible API — or the chat on senda.network — and a capability-aware router sends each request to a peer that can serve it. No third-party AI provider sits in the middle.

Do I need to sign up or install anything to chat?

No. Open senda.network and start typing — no account, no install. The desktop app is only needed if you want to run a node and contribute compute.

Can a peer read my prompts?

The peer serving your session has to read the prompt to run inference — that's the honest trade versus a hosted API. The runtime is open source so peers can be audited, sessions aren't tied to an identity, and for anything you don't want to trust to others, you can run your own peer with the same runtime.

Which models can I use?

Open-weight models served by live peers — the set changes as peers come and go, which is why the live status above lists what's serving right now. Apple Silicon Macs with enough memory can serve 30B–70B-class models at full quality; smaller machines serve smaller models well.

What hardware can contribute?

Apple Silicon Macs (Metal), and NVIDIA (CUDA), AMD (ROCm) or Intel/other (Vulkan) GPU boxes on macOS, Linux, or Windows. The installer detects your OS, CPU architecture and GPU vendor and pulls the matching build.

Use the network, or help run it.

Build on it through the API, try it in your browser, or add your machine and grow the network for everyone.

Try the mesh Run a node