0
0
mirror of https://github.com/nodejs/node.git synced 2024-11-29 15:06:33 +01:00
nodejs/doc/api/cluster.markdown

501 lines
15 KiB
Markdown
Raw Normal View History

2012-02-27 20:07:49 +01:00
# Cluster
Stability: 1 - Experimental
2011-10-12 23:19:32 +02:00
A single instance of Node runs in a single thread. To take advantage of
multi-core systems the user will sometimes want to launch a cluster of Node
processes to handle the load.
The cluster module allows you to easily create a network of processes that
all share server ports.
2011-10-12 23:19:32 +02:00
var cluster = require('cluster');
var http = require('http');
2011-11-04 23:11:19 +01:00
var numCPUs = require('os').cpus().length;
if (cluster.isMaster) {
2011-11-04 23:11:19 +01:00
// Fork workers.
for (var i = 0; i < numCPUs; i++) {
cluster.fork();
}
cluster.on('exit', function(worker, code, signal) {
2011-11-04 23:11:19 +01:00
console.log('worker ' + worker.pid + ' died');
});
} else {
// Workers can share any TCP connection
// In this case its a HTTP server
http.createServer(function(req, res) {
res.writeHead(200);
res.end("hello world\n");
}).listen(8000);
}
Running node will now share port 8000 between the workers:
% node server.js
2011-10-12 23:19:32 +02:00
Worker 2438 online
Worker 2437 online
2012-02-27 20:07:49 +01:00
This feature was introduced recently, and may change in future versions.
Please try it out and provide feedback.
## How It Works
<!--type=misc-->
The worker processes are spawned using the `child_process.fork` method,
so that they can communicate with the parent via IPC and pass server
handles back and forth.
When you call `server.listen(...)` in a worker, it serializes the
arguments and passes the request to the master process. If the master
process already has a listening server matching the worker's
requirements, then it passes the handle to the worker. If it does not
already have a listening server matching that requirement, then it will
create one, and pass the handle to the child.
This causes potentially surprising behavior in three edge cases:
1. `server.listen({fd: 7})` Because the message is passed to the worker,
file descriptor 7 **in the parent** will be listened on, and the
handle passed to the worker, rather than listening to the worker's
idea of what the number 7 file descriptor references.
2. `server.listen(handle)` Listening on handles explicitly will cause
the worker to use the supplied handle, rather than talk to the master
process. If the worker already has the handle, then it's presumed
that you know what you are doing.
3. `server.listen(0)` Normally, this will case servers to listen on a
random port. However, in a cluster, each worker will receive the
same "random" port each time they do `listen(0)`. In essence, the
port is random the first time, but predictable thereafter. If you
want to listen on a unique port, generate a port number based on the
cluster worker ID.
When multiple processes are all `accept()`ing on the same underlying
resource, the operating system load-balances across them very
efficiently. There is no routing logic in Node.js, or in your program,
and no shared state between the workers. Therefore, it is important to
design your program such that it does not rely too heavily on in-memory
data objects for things like sessions and login.
Because workers are all separate processes, they can be killed or
re-spawned depending on your program's needs, without affecting other
workers. As long as there are some workers still alive, the server will
continue to accept connections. Node does not automatically manage the
number of workers for you, however. It is your responsibility to manage
the worker pool for your application's needs.
2012-02-27 20:07:49 +01:00
## cluster.settings
* {Object}
* `exec` {String} file path to worker file. (Default=`__filename`)
* `args` {Array} string arguments passed to worker.
(Default=`process.argv.slice(2)`)
* `silent` {Boolean} whether or not to send output to parent's stdio.
(Default=`false`)
All settings set by the `.setupMaster` is stored in this settings object.
This object is not supposed to be change or set manually, by you.
## cluster.isMaster
* {Boolean}
2012-02-27 20:07:49 +01:00
True if the process is a master. This is determined
by the `process.env.NODE_UNIQUE_ID`. If `process.env.NODE_UNIQUE_ID` is
2012-02-27 20:07:49 +01:00
undefined, then `isMaster` is `true`.
2012-02-27 20:07:49 +01:00
## cluster.isWorker
* {Boolean}
This boolean flag is true if the process is a worker forked from a master.
2012-02-27 20:07:49 +01:00
If the `process.env.NODE_UNIQUE_ID` is set to a value, then
`isWorker` is `true`.
2012-02-27 20:07:49 +01:00
## Event: 'fork'
* `worker` {Worker object}
When a new worker is forked the cluster module will emit a 'fork' event.
This can be used to log worker activity, and create you own timeout.
var timeouts = [];
function errorMsg() {
console.error("Something must be wrong with the connection ...");
}
cluster.on('fork', function(worker) {
timeouts[worker.id] = setTimeout(errorMsg, 2000);
});
cluster.on('listening', function(worker, address) {
clearTimeout(timeouts[worker.id]);
});
cluster.on('exit', function(worker, code, signal) {
clearTimeout(timeouts[worker.id]);
errorMsg();
});
2012-02-27 20:07:49 +01:00
## Event: 'online'
* `worker` {Worker object}
After forking a new worker, the worker should respond with a online message.
When the master receives a online message it will emit such event.
The difference between 'fork' and 'online' is that fork is emitted when the
2012-02-27 20:07:49 +01:00
master tries to fork a worker, and 'online' is emitted when the worker is
being executed.
cluster.on('online', function(worker) {
console.log("Yay, the worker responded after it was forked");
});
2012-02-27 20:07:49 +01:00
## Event: 'listening'
* `worker` {Worker object}
* `address` {Object}
When calling `listen()` from a worker, a 'listening' event is automatically assigned
to the server instance. When the server is listening a message is send to the master
where the 'listening' event is emitted.
The event handler is executed with two arguments, the `worker` contains the worker
object and the `address` object contains the following connection properties:
`address`, `port` and `addressType`. This is very useful if the worker is listening
on more than one address.
cluster.on('listening', function(worker, address) {
console.log("A worker is now connected to " + address.address + ":" + address.port);
});
## Event: 'disconnect'
* `worker` {Worker object}
When a workers IPC channel has disconnected this event is emitted. This will happen
2012-03-19 21:34:12 +01:00
when the worker dies, usually after calling `.destroy()`.
2012-03-19 21:34:12 +01:00
When calling `.disconnect()`, there may be a delay between the
2012-03-30 21:24:46 +02:00
`disconnect` and `exit` events. This event can be used to detect if
2012-03-19 21:34:12 +01:00
the process is stuck in a cleanup or if there are long-living
connections.
cluster.on('disconnect', function(worker) {
console.log('The worker #' + worker.id + ' has disconnected');
});
2012-03-30 21:24:46 +02:00
## Event: 'exit'
2012-02-27 20:07:49 +01:00
* `worker` {Worker object}
* `code` {Number} the exit code, if it exited normally.
* `signal` {String} the name of the signal (eg. `'SIGHUP'`) that caused
the process to be killed.
2012-03-30 21:24:46 +02:00
When any of the workers die the cluster module will emit the 'exit' event.
This can be used to restart the worker by calling `fork()` again.
cluster.on('exit', function(worker, code, signal) {
var exitCode = worker.process.exitCode;
console.log('worker ' + worker.pid + ' died ('+exitCode+'). restarting...');
cluster.fork();
});
2012-02-27 20:07:49 +01:00
## Event: 'setup'
* `worker` {Worker object}
2012-01-05 20:09:43 +01:00
2012-02-27 20:07:49 +01:00
When the `.setupMaster()` function has been executed this event emits.
If `.setupMaster()` was not executed before `fork()` this function will
call `.setupMaster()` with no arguments.
2012-01-05 20:09:43 +01:00
2012-02-27 20:07:49 +01:00
## cluster.setupMaster([settings])
2012-01-05 20:09:43 +01:00
2012-02-27 20:07:49 +01:00
* `settings` {Object}
* `exec` {String} file path to worker file. (Default=`__filename`)
* `args` {Array} string arguments passed to worker.
(Default=`process.argv.slice(2)`)
* `silent` {Boolean} whether or not to send output to parent's stdio.
(Default=`false`)
The `setupMaster` is used to change the default 'fork' behavior. It takes
one option object argument.
2012-01-05 20:09:43 +01:00
Example:
var cluster = require("cluster");
cluster.setupMaster({
exec : "worker.js",
args : ["--use", "https"],
silent : true
});
cluster.autoFork();
2012-02-27 20:07:49 +01:00
## cluster.fork([env])
2012-01-05 20:09:43 +01:00
2012-02-27 20:07:49 +01:00
* `env` {Object} Key/value pairs to add to child process environment.
* return {Worker object}
2012-01-05 20:09:43 +01:00
2012-02-27 20:07:49 +01:00
Spawn a new worker process. This can only be called from the master process.
2012-01-05 20:09:43 +01:00
2012-02-27 20:07:49 +01:00
## cluster.settings
2012-01-05 20:09:43 +01:00
2012-02-27 20:07:49 +01:00
* {Object}
* `exec` {String} file path to worker file. Default: `__filename`
* `args` {Array} string arguments passed to worker.
(Default=`process.argv.slice(2)`)
* `silent` {Boolean} whether or not to send output to parent's stdio.
(Default=`false`)
2012-01-05 20:09:43 +01:00
2012-02-27 20:07:49 +01:00
All settings set by the `.setupMaster` is stored in this settings object.
This object is not supposed to be change or set manually.
## cluster.disconnect([callback])
* `callback` {Function} called when all workers are disconnected and handlers are closed
2012-03-19 21:34:12 +01:00
When calling this method, all workers will commit a graceful suicide. When they are
disconnected all internal handlers will be closed, allowing the master process to
die graceful if no other event is waiting.
2012-03-19 21:34:12 +01:00
The method takes an optional callback argument which will be called when finished.
2012-02-27 20:07:49 +01:00
## cluster.workers
2012-02-27 20:07:49 +01:00
* {Object}
In the cluster all living worker objects are stored in this object by there
`id` as the key. This makes it easy to loop through all living workers.
2012-01-05 08:57:54 +01:00
// Go through all workers
function eachWorker(callback) {
for (var id in cluster.workers) {
callback(cluster.workers[id]);
}
}
eachWorker(function(worker) {
worker.send('big announcement to all workers');
});
2012-01-05 08:57:54 +01:00
Should you wish to reference a worker over a communication channel, using
the worker's unique id is the easiest way to find the worker.
socket.on('data', function(id) {
var worker = cluster.workers[id];
});
2012-02-27 20:07:49 +01:00
## Class: Worker
2012-02-27 20:07:49 +01:00
A Worker object contains all public information and method about a worker.
2012-01-05 08:57:54 +01:00
In the master it can be obtained using `cluster.workers`. In a worker
it can be obtained using `cluster.worker`.
### worker.id
2012-02-27 20:07:49 +01:00
* {String}
Each new worker is given its own unique id, this id is stored in the
`id`.
2012-02-27 20:07:49 +01:00
While a worker is alive, this is the key that indexes it in
cluster.workers
### worker.process
* {ChildProcess object}
All workers are created using `child_process.fork()`, the returned object
from this function is stored in process.
2012-02-27 20:07:49 +01:00
See: [Child Process module](child_process.html)
2012-02-27 20:07:49 +01:00
### worker.suicide
2012-02-27 20:07:49 +01:00
* {Boolean}
This property is a boolean. It is set when a worker dies after calling `.destroy()`
or immediately after calling the `.disconnect()` method. Until then it is `undefined`.
2012-02-27 20:07:49 +01:00
### worker.send(message, [sendHandle])
* `message` {Object}
* `sendHandle` {Handle object}
This function is equal to the send methods provided by
`child_process.fork()`. In the master you should use this function to
send a message to a specific worker. However in a worker you can also use
`process.send(message)`, since this is the same function.
This example will echo back all messages from the master:
if (cluster.isMaster) {
var worker = cluster.fork();
worker.send('hi there');
} else if (cluster.isWorker) {
process.on('message', function(msg) {
process.send(msg);
});
}
2012-02-27 20:07:49 +01:00
### worker.destroy()
2012-02-27 20:07:49 +01:00
This function will kill the worker, and inform the master to not spawn a
2012-03-30 22:54:13 +02:00
new worker. The boolean `suicide` lets you distinguish between voluntary
and accidental exit.
cluster.on('exit', function(worker, code, signal) {
if (worker.suicide === true) {
console.log('Oh, it was just suicide\' no need to worry').
}
});
// destroy worker
worker.destroy();
2012-04-20 16:42:57 +02:00
### worker.disconnect()
When calling this function the worker will no longer accept new connections, but
they will be handled by any other listening worker. Existing connection will be
allowed to exit as usual. When no more connections exist, the IPC channel to the worker
will close allowing it to die graceful. When the IPC channel is closed the `disconnect`
2012-03-30 21:24:46 +02:00
event will emit, this is then followed by the `exit` event, there is emitted when
the worker finally die.
Because there might be long living connections, it is useful to implement a timeout.
This example ask the worker to disconnect and after 2 seconds it will destroy the
server. An alternative wound be to execute `worker.destroy()` after 2 seconds, but
that would normally not allow the worker to do any cleanup if needed.
if (cluster.isMaster) {
var worker = cluser.fork();
var timeout;
worker.on('listening', function(address) {
worker.disconnect();
timeout = setTimeout(function() {
worker.send('force kill');
}, 2000);
});
worker.on('disconnect', function() {
clearTimeout(timeout);
});
} else if (cluster.isWorker) {
var net = require('net');
var server = net.createServer(function(socket) {
// connection never end
});
server.listen(8000);
server.on('close', function() {
// cleanup
});
process.on('message', function(msg) {
if (msg === 'force kill') {
server.destroy();
}
});
}
2012-02-27 20:07:49 +01:00
### Event: 'message'
2012-02-27 20:07:49 +01:00
* `message` {Object}
This event is the same as the one provided by `child_process.fork()`.
In the master you should use this event, however in a worker you can also use
`process.on('message')`
As an example, here is a cluster that keeps count of the number of requests
in the master process using the message system:
var cluster = require('cluster');
var http = require('http');
if (cluster.isMaster) {
// Keep track of http requests
var numReqs = 0;
setInterval(function() {
console.log("numReqs =", numReqs);
}, 1000);
// Count requestes
function messageHandler(msg) {
if (msg.cmd && msg.cmd == 'notifyRequest') {
numReqs += 1;
}
}
// Start workers and listen for messages containing notifyRequest
cluster.autoFork();
Object.keys(cluster.workers).forEach(function(id) {
cluster.workers[id].on('message', messageHandler);
});
} else {
// Worker processes have a http server.
http.Server(function(req, res) {
res.writeHead(200);
res.end("hello world\n");
// notify master about the request
process.send({ cmd: 'notifyRequest' });
}).listen(8000);
}
2012-02-27 20:07:49 +01:00
### Event: 'online'
Same as the `cluster.on('online')` event, but emits only when the state change
on the specified worker.
cluster.fork().on('online', function() {
// Worker is online
};
2012-02-27 20:07:49 +01:00
### Event: 'listening'
* `address` {Object}
Same as the `cluster.on('listening')` event, but emits only when the state change
on the specified worker.
cluster.fork().on('listening', function(address) {
// Worker is listening
};
2012-03-24 17:25:23 +01:00
### Event: 'disconnect'
Same as the `cluster.on('disconnect')` event, but emits only when the state change
on the specified worker.
cluster.fork().on('disconnect', function() {
// Worker has disconnected
};
2012-03-30 21:24:46 +02:00
### Event: 'exit'
2012-02-27 20:07:49 +01:00
* `code` {Number} the exit code, if it exited normally.
* `signal` {String} the name of the signal (eg. `'SIGHUP'`) that caused
the process to be killed.
Emitted by the individual worker instance, when the underlying child process
is terminated. See [child_process event: 'exit'](child_process.html#child_process_event_exit).
var worker = cluster.fork();
worker.on('exit', function(code, signal) {
if( signal ) {
console.log("worker was killed by signal: "+signal);
} else if( code !== 0 ) {
console.log("worker exited with error code: "+code);
} else {
console.log("worker success!");
}
};