Why does Socket.IO start with long-polling instead of WebSocket?

Socket.IO prioritises connection reliability over initial performance. Long-polling works through nearly any proxy, firewall, or corporate network. Once the connection is established and verified, Socket.IO upgrades to WebSocket for better performance. This adds latency to the first connection but reduces the chance of a failed handshake.

Can I scale Socket.IO horizontally?

Yes, but it requires sticky sessions at the load balancer plus a pub/sub adapter like Redis or NATS to broadcast events across nodes. This creates three independent failure points: the load balancer routing, the adapter, and each Socket.IO node. If any one fails, clients experience dropped messages or failed connections.

When should I use Socket.IO vs a managed realtime service?

Socket.IO is a solid choice for prototypes, internal tools, and applications where you control the infrastructure and can tolerate occasional message loss. Move to a managed service when you need guaranteed delivery, global distribution, multi-language server support, or an SLA.

Socket.IO vs WebSocket: Key Differences Explained

Q: Is Socket.IO just a WebSocket wrapper?

No. Socket.IO uses its own wire protocol built on Engine.IO. It starts every connection with HTTP long-polling and upgrades to WebSocket only after the first data exchange. Standard WebSocket clients cannot connect to a Socket.IO server, and Socket.IO clients cannot connect to a standard WebSocket server.

Q: Does Socket.IO guarantee message delivery?

No. Socket.IO provides at-most-once delivery by default. Messages sent while a client is disconnected are lost. Connection state recovery was added in v4.6 but is limited to a two-minute window and does not work with all adapters.

by Matthew O'Riordan • Published on March 12, 2026 • Updated March 12, 2026

What Socket.IO actually is

Socket.IO is not a thin WebSocket wrapper. It is a custom protocol — Engine.IO — layered on top of WebSocket with its own framing, packet types, and connection lifecycle.

Every Socket.IO connection starts with HTTP long-polling. The client sends its first payloads over plain HTTP requests. Only after that initial exchange succeeds does Engine.IO attempt a WebSocket upgrade. This means the first messages always travel over HTTP, adding a round trip compared to a direct WebSocket handshake.

This design choice is intentional: Socket.IO optimises for reliability over performance. Long-polling works through corporate proxies, hotel Wi-Fi captive portals, and restrictive firewalls that block WebSocket upgrades. In environments where you control the network, this trade-off costs you latency for no benefit. In environments where you don’t, it keeps your app working.

The wire protocol is custom. A standard WebSocket client cannot connect to a Socket.IO server. A Socket.IO client cannot connect to a standard WebSocket server. If you adopt Socket.IO, you are locked into its ecosystem for both client and server.

The same thing, two ways

A broadcast echo server — first with raw WebSocket, then with Socket.IO:

// Raw WebSocket (ws library) — manual everything
import { WebSocketServer } from 'ws';
const wss = new WebSocketServer({ port: 8080 });
const clients = new Set();

wss.on('connection', (ws) => {
  clients.add(ws);
  ws.on('message', (data) => {
    for (const client of clients) {
      if (client !== ws) client.send(data);
    }
  });
  ws.on('close', () => clients.delete(ws));
});

// Socket.IO — rooms and broadcast built in
import { Server } from 'socket.io';
const io = new Server(3000);

io.on('connection', (socket) => {
  socket.join('chat');
  socket.on('message', (data) => {
    socket.to('chat').emit('message', data);
  });
});

The Socket.IO version is shorter, but that is not the point. The real difference is what happens when you add a second server, lose a connection, or need to target specific users. Raw WebSocket leaves all of that to you.

What Socket.IO gives you

Raw WebSockets give you a bidirectional byte pipe and nothing else. No reconnection. No message acknowledgement. No way to group connections. No fallback if WebSocket is blocked. Building production features on raw WebSockets means writing all of this yourself — and getting it wrong in ways you discover at 3am.

Socket.IO fills that gap with features that matter in production:

Automatic reconnection. This is not a convenience feature — it is critical. WebSocket connections are stateful. They break constantly: mobile users switch from Wi-Fi to cellular, laptops close and reopen, load balancers cycle, servers deploy. A WebSocket connection that does not automatically reconnect is a WebSocket connection that silently stops working. Socket.IO handles reconnection with exponential backoff out of the box.

Rooms and namespaces. Group connections by topic, user, or channel. Broadcast to a room without iterating over every connected socket. This is the single most useful abstraction Socket.IO provides — it is the feature people reimplement badly when using raw WebSockets.

Acknowledgements. Send a message and get a callback when the server has processed it. This is request-response semantics on top of a persistent connection — useful for actions like “send message and confirm it was stored.”

HTTP fallback. If WebSocket is blocked, the connection stays on long-polling. The API is the same either way. Your application code does not need to know which transport is active.

Where Socket.IO breaks down

Socket.IO works well for a single server handling hundreds to low thousands of connections. Problems start when you need more.

Scaling requires three systems to work together

Scaling Socket.IO horizontally requires:

Sticky sessions at the load balancer — because Engine.IO’s HTTP long-polling phase requires multiple HTTP requests to hit the same server
A pub/sub adapter (Redis, NATS, or similar) — because a room broadcast on server A needs to reach clients on server B
The Socket.IO nodes themselves — each maintaining local connection state

That is three independent failure points. If the load balancer loses its session affinity table, long-polling clients get 400 errors. If Redis goes down, cross-node broadcasts stop. If a Socket.IO node crashes, all its client state is gone. Each layer has its own failure mode, its own monitoring, and its own scaling limits.

At-most-once delivery

Socket.IO provides at-most-once delivery. If a client is disconnected when a message is sent, that message is lost. There is no server-side queue, no replay from a position, no catch-up mechanism.

Connection state recovery, added in v4.6, stores missed packets in memory for up to two minutes. If the client reconnects within that window and the server has not restarted, it receives the buffered messages. This helps with brief network blips. It does not help with server deployments, node failures, or disconnections longer than two minutes. It also does not work with every adapter.

Single-region architecture

Socket.IO has no built-in multi-region support. If your users are global, you either route everyone to a single region (adding latency) or you build your own cross-region message routing on top of Socket.IO — at which point you are building your own messaging infrastructure.

Node.js-only server

The official Socket.IO server is Node.js-only. Community ports exist for Java, Python, Go, and other languages, but they are maintained independently and often lag behind the official release. If your backend is not Node.js, you are depending on volunteers to keep your server library compatible.

No built-in security model

Socket.IO provides no token management, no capability-based access control, no end-to-end encryption. Authentication is left to middleware you write yourself. This is fine for internal tools but becomes a significant engineering burden for user-facing applications where you need per-channel permissions, token rotation, and audit logs.

The three layers of realtime

Most teams go through a predictable progression:

Layer 1: Raw WebSocket. You open a connection, send JSON, receive JSON. This is great for learning how WebSockets work and for experiments. It is not production-ready. You have no reconnection, no rooms, no delivery guarantees, no scaling story.

Layer 2: Socket.IO (or similar library). You get a protocol on top of WebSocket: reconnection, rooms, acknowledgements, fallback transports. You self-host it. This is a legitimate production choice for many applications — internal dashboards, prototypes, tools where you accept at-most-once delivery and single-region deployment.

Layer 3: Managed realtime infrastructure. Services like Ably, PubNub, or Pusher handle the infrastructure: guaranteed delivery, ordering, global edge distribution, multi-protocol support, SLAs. You stop managing WebSocket servers and focus on your application.

This is not a maturity ladder where everyone must reach the top. Layer 2 is the right answer for plenty of applications. But if you find yourself building Redis adapters, writing cross-region routing, implementing message replay, and debugging sticky session failures — you are rebuilding Layer 3 from scratch, and the economics stop making sense.

At Ably, we see this pattern constantly. Teams start with Socket.IO, ship successfully, then hit scaling walls or delivery reliability issues as their user base grows. The migration drivers are predictable: message ordering under load, delivery guarantees during deploys, and latency for users far from the primary region.

Feature comparison

Feature	Raw WebSocket	Socket.IO	Managed service (e.g. Ably)
Reconnection	Manual	Built-in	Built-in
Rooms / channels	Manual	Built-in	Built-in
Delivery guarantee	None	At-most-once	Exactly-once (Ably)
Message ordering	None	Per-socket	Global ordering
Horizontal scaling	Manual	Sticky sessions + adapter	Handled
Global distribution	Manual	Not built-in	Edge network
Server languages	Any	Node.js (official)	Any (SDKs)
Protocol interop	Standard	Custom (Engine.IO)	Multiple protocols
SLA	None	None (OSS)	99.999% (Ably)
Connection recovery	Manual	2 min max	Persistent

Frequently Asked Questions

Is Socket.IO just a WebSocket wrapper?

No. Socket.IO runs its own protocol, Engine.IO, which defines packet types, a connection handshake, and a transport upgrade mechanism. It starts every connection with HTTP long-polling and only promotes to WebSocket after the first data exchange completes. The wire format is incompatible with standard WebSocket — you cannot point a browser’s new WebSocket() at a Socket.IO server and expect it to work. This is a deliberate design trade-off: Socket.IO chose broad network compatibility over protocol-level interop.

Why does Socket.IO start with long-polling?

Socket.IO prioritises connection success rate over initial latency. HTTP long-polling works through virtually any network environment — corporate proxies, captive portals, firewalls that inspect and block WebSocket upgrade headers. By establishing the connection over HTTP first and upgrading later, Socket.IO avoids the failure mode where a WebSocket handshake is silently blocked and the client hangs. The cost is an extra round trip on first connection. If you control your network and know WebSocket works, this cost buys you nothing.

Can I scale Socket.IO to thousands of concurrent users?

Yes, with effort. You need sticky sessions at the load balancer (IP-hash or cookie-based), a pub/sub adapter like the @socket.io/redis-adapter for cross-node event distribution, and monitoring for each layer independently. Many teams run Socket.IO at this scale successfully. The complexity curve steepens past tens of thousands of connections, where adapter throughput, garbage collection pauses, and sticky session rebalancing start to interact in hard to predict ways.

Does Socket.IO guarantee message delivery?

No. Socket.IO is at-most-once by default. Messages emitted while a client is disconnected are dropped. The v4.6 connection state recovery feature buffers missed packets in server memory for up to two minutes, but this has constraints: it does not survive server restarts, it does not work with all adapter configurations, and the window is too short for many production scenarios. If your application requires that no message is ever lost, you need a system with server-side persistence and replay — which is outside Socket.IO’s scope.

When should I choose Socket.IO over a managed service?

Choose Socket.IO when you want full control over your infrastructure, your team is comfortable operating Node.js at scale, and you can tolerate at-most-once delivery. It is excellent for internal tools, prototypes, hackathon projects, and applications where occasional message loss during deploys or disconnections is acceptable. Move to a managed service when you need delivery guarantees, multi-region distribution, multi-language server support, or a contractual SLA. The tipping point is usually when you start building your own reliability layer on top of Socket.IO — that is the signal that you have outgrown it.

WebSocket vs HTTP — the request-response model that Socket.IO falls back to via long-polling
WebSocket vs SSE — an alternative for server-to-client streaming that needs no library
WebSockets at Scale — architecture patterns for millions of concurrent connections
Building a WebSocket Application — hands-on tutorial with cursor sharing
WebSocket Security Guide — auth, TLS, and protection patterns for persistent connections

Written by Matthew O’Riordan, Co-founder & CEO of Ably, with experience building real-time systems reaching 2 billion+ devices monthly.