page.title=Implementing GCM Server @jd:body
The server side of Google Cloud Messaging (GCM) consists of two components:
A full GCM implementation requires both a client implementation and a server implementation. For more information about implementing the client side, see Implementing GCM Client.
Before you can write client apps that use the GCM feature, you must have an application server that meets the following criteria:
Here are the basic steps you follow to implement your 3rd-party app server:
Currently GCM provides two connection servers: HTTP and XMPP (CCS). You can use them separately or in tandem. XMPP (CCS) messaging differs from HTTP messaging in the following ways:
This section gives an overview of sending messages. For details of message syntax, see Server Reference.
Here is the general sequence of events that occurs when a 3rd-party application server sends a message (the details vary depending on the platform):
The following sections describe the basic components involved in sending a request. See the Server Reference for details.
Required. When your app server sends a message in GCM, it must specify a target.
For HTTP you must specify the target as one of the following:
registration_ids
: For sending to 1 or more devices (up to 1000).
When you send a message to multiple registration IDs, that is called a multicast message.notification_key
: For sending to multiple devices owned by a single user.For CCS (XMPP) you must specify the target as:
There are various options the 3rd-party app server can set when sending a downstream message to a client app. See the Server Reference for details. Here are a few examples of possible options:
If you want to test your request (either JSON or plain text) without delivering
the message to the devices, you can set an optional HTTP parameter called
dry_run
with the value true
. The result will be almost
identical to running the request without this parameter, except that the message
will not be delivered to the devices. Consequently, the response will contain fake
IDs for the message and multicast parameters.
Optional. If you are including a payload in the message, you use the data
parameter to include the payload. This applies for both HTTP and XMPP.
See the Server Reference for details on sending and receiving messages.
This section has a discussion of general messaging topics.
Every message sent in GCM has the following characteristics:
But despite these similarities, messages can behave very differently depending on their particular settings. One major distinction between messages is whether they are collapsed (where each new message replaces the preceding message) or not collapsed (where each individual message is delivered). Every message sent in GCM is either a "send-to-sync" (collapsible) message or a "message with payload" (non-collapsible message).
A send-to-sync (collapsible) message is often a "tickle" that tells a mobile application to sync data from the server. For example, suppose you have an email application. When a user receives new email on the server, the server pings the mobile application with a "New mail" message. This tells the application to sync to the server to pick up the new email. The server might send this message multiple times as new mail continues to accumulate, before the application has had a chance to sync. But if the user has received 25 new emails, there's no need to preserve every "New mail" message. One is sufficient. Another example would be a sports application that updates users with the latest score. Only the most recent message is relevant.
GCM allows a maximum of 4 different collapse keys to be used by the GCM server at any given time. In other words, the GCM server can simultaneously store 4 different send-to-sync messages per device, each with a different collapse key. For example, Device A can have A1, A2, A3, and A4. Device B can have B1, B2, B3, and B4, and so on. If you exceed this number GCM will only keep 4 collapse keys, with no guarantees about which ones they will be.
Unlike a send-to-sync message, every "message with payload" (non-collapsible message) is delivered. The payload the message contains can be up to 4kb. For example, here is a JSON-formatted message in an IM application in which spectators are discussing a sporting event:
{ "registration_id" : "APA91bHun4MxP5egoKMwt2KZFBaFUH-1RYqx...", "data" : { "Nick" : "Mario", "Text" : "great match!", "Room" : "PortugalVSDenmark", }, }
A "message with payload" is not simply a "ping" to the
mobile application to contact the server to fetch data. In the aforementioned IM
application, for example, you would want to deliver every message, because every
message has different content. To specify a non-collapsible message, you simply
omit the collapse_key
parameter. Thus GCM will send each message
individually. Note that the order of delivery is not guaranteed.
GCM will store up to 100 non-collapsible messages. After that, all messages are discarded from GCM, and a new message is created that tells the client how far behind it is.
The application should respond by syncing with the server to recover the discarded messages.
If your application does not need to use non-collapsible messages, collapsible messages are a better choice from a performance standpoint. However, if you use collapsible messages, remember that GCM only allows a maximum of 4 different collapse keys to be used by the GCM server per registration ID at any given time. You must not exceed this number, or it could cause unpredictable consequences.
You can use the time_to_live
parameter in the send request
to specify the maximum lifespan of a message.
The value of this parameter must be a duration from 0 to 2,419,200 seconds, and
it corresponds to the maximum period of time for which GCM will store and try to
deliver the message. Requests that don't contain this field default to the maximum
period of 4 weeks.
Here are some possible uses for this feature:
GCM usually delivers messages immediately after they are sent. However,
this might not always be possible. For example, if the platform is Android,
the device could be turned off, offline, or otherwise unavailable.
Or the sender itself might request
that messages not be delivered until the device becomes active by using the
delay_while_idle
flag. Finally, GCM might intentionally delay messages
to prevent an application from consuming excessive resources and negatively
impacting battery life.
When this happens, GCM will store the message and deliver it as soon as it's feasible. While this is fine in most cases, there are some applications for which a late message might as well never be delivered. For example, if the message is an incoming call or video chat notification, it will only be meaningful for a small period of time before the call is terminated. Or if the message is an invitation to an event, it will be useless if received after the event has ended.
Another advantage of specifying the expiration date for a message is that GCM
will never throttle messages with a time_to_live
value of 0 seconds.
In other words, GCM will guarantee best effort for messages that must be delivered
"now or never." Keep in mind that a time_to_live
value of
0 means messages that can't be delivered immediately will be discarded. However,
because such messages are never stored, this provides the best latency for
sending notifications.
Here is an example of a JSON-formatted request that includes TTL:
{ "collapse_key" : "demo", "delay_while_idle" : true, "registration_ids" : ["xyz"], "data" : { "key1" : "value1", "key2" : "value2", }, "time_to_live" : 3 },
GCM allows multiple parties to send messages to the same application. For example, suppose your application is an articles aggregator with multiple contributors, and you want each of them to be able to send a message when they publish a new article. This message might contain a URL so that the application can download the article. Instead of having to centralize all sending activity in one location, GCM gives you the ability to let each of these contributors send its own messages.
To make this possible, all you need to do is have each sender generate its own project number. Then include those IDs in the sender field, separated by commas, when requesting a registration. Finally, share the registration ID with your partners, and they'll be able to send messages to your application using their own authentication keys.
Note that there is limit of 100 multiple senders.
When a 3rd-party server posts a message to GCM and receives a message ID back, it does not mean that the message was already delivered to the device. Rather, it means that it was accepted for delivery. What happens to the message after it is accepted depends on many factors.
In the best-case scenario, if the device is connected to GCM, the screen is on, and there are no throttling restrictions (see Throttling), the message will be delivered right away.
If the device is connected but idle, the message will still be
delivered right away unless the delay_while_idle
flag is set to true.
Otherwise, it will be stored in the GCM servers until the device is awake. And
that's where the collapse_key
flag plays a role: if there is already
a message with the same collapse key (and registration ID) stored and waiting for
delivery, the old message will be discarded and the new message will take its place
(that is, the old message will be collapsed by the new one). However, if the collapse
key is not set, both the new and old messages are stored for future delivery.
Collapsible messages are also called send-to-sync messages.
Note: There is a limit on how many messages can
be stored without collapsing. That limit is currently 100. If the limit is reached,
all stored messages are discarded. Then when the device is back online, it receives
a special message indicating that the limit was reached. The application can then
handle the situation properly, typically by requesting a full sync.
Likewise, there is a limit on how many collapse_key
s you can have for
a particular device. GCM allows a maximum of 4 different collapse keys to be used
by the GCM server per device
any given time. In other words, the GCM server can simultaneously store 4 different
send-to-sync messages, each with a different collapse key. If you exceed this number
GCM will only keep 4 collapse keys, with no guarantees about which ones they will be.
See Send-to-sync messages for more information.
If the device is not connected to GCM, the message will be stored until a
connection is established (again respecting the collapse key rules). When a connection
is established, GCM will deliver all pending messages to the device, regardless of
the delay_while_idle
flag. If the device never gets connected again
(for instance, if it was factory reset), the message will eventually time out and
be discarded from GCM storage. The default timeout is 4 weeks, unless the
time_to_live
flag is set.
Finally, when GCM attempts to deliver a message to the device and the
application was uninstalled, GCM will discard that message right away and
invalidate the registration ID. Future attempts to send a message to that device
will get a NotRegistered
error. See
How Unregistration Works for more information.
Although is not possible to track the status of each individual message, the Google Cloud Console stats are broken down by messages sent to device, messages collapsed, and messages waiting for delivery.
To prevent abuse (such as sending a flood of messages to a device) and to optimize for the overall network efficiency and battery life of devices, GCM implements throttling of messages using a token bucket scheme. Messages are throttled on a per application and per collapse key basis (including non-collapsible messages). Each application collapse key is granted some initial tokens, and new tokens are granted periodically therefter. Each token is valid for a single message sent to the device. If an application collapse key exhausts its supply of available tokens, new messages are buffered in a pending queue until new tokens become available at the time of the periodic grant. Thus throttling in between periodic grant intervals may add to the latency of message delivery for an application collapse key that sends a large number of messages within a short period of time. Messages in the pending queue of an application collapse key may be delivered before the time of the next periodic grant, if they are piggybacked with messages belonging to a non-throttled category by GCM for network and battery efficiency reasons.