Back to index
An Architecture for Wide Area Multicast Routing
Steve Deering (Xerox), Deborah Estrin (USC), Dino Farinacci (Cisco), Van
Jacobson (LBL), Ching-Gung Liu (USC), Liming Wei (USC)
One-line summary:
Mcast routing that can dynamically switch between core-based
trees and shortest-path trees to provide efficient behavior for both
sparse and dense mcast groups, using explicit joins (rather than
explicit prunes) and periodic refreshes of soft state in routers.
Overview/Main Points
- Motivation:
- link-state and distance-vector mcast routing scale
poorly for
large but sparse groups.
- Explicit pruning scales terribly for large but sparse groups.
- core-based trees trade less router state for longer
delivery paths and traffic concentration; shortest-path
delivery trees trade more router state for shorter paths
and traffic dispersal.
- CBT's work well for sparse groups, SPT's for dense
groups. Want a way to flexibly choose between them
without breaking compatibility/interoperability or tearing
down the group and recomputing its routes.
- Requirements:
- efficient sparse group support
- high quality data distribution (application can choose low
delay delivery if needed)
- unicast routing protocol independence (why?)
- interoperability w/existing mcast routing
- robustness: no single point of failure or hard state
- Sparse group support: designate some routers as RP's (rendezvous
points) for subtree below them.
- To join an mcast group RP-tree:
- receiver sends PIM-join upstream to nearest RP (may be
1 or more router hops away)
- RP is in the mcast tree of an upstream source. When the
source starts sending, RP will receive a PIM-register
message.
- At that point it forwards the original PIM-join towards
the source.
- Note that the RP is now acting as the forwarding agent for
packets going to the receiver.
- To switch from RP-tree to shortest-path tree:
- PIM-join-SPT message sent upstream towards RP.
- RP propagates message up to mcast source S.
- S then starts forwarding multicast directly on a
shortest-path tree to the receiver (which probably
circumvents RP).
- Once receiver starts receiving mcast packets that were not
sent through RP, it sends RP a PIM-prune message, which
(a) is forwarded to S, and (b) causes the RP to set up a
"negative cache" entry for the receiver (i.e. no longer
forward mcast packets for that receiver).
- Fault tolerance:
- Reachability beacons periodically broadcast downstream
from RP's. If receivers detect an RP has gone away, they
can target some other RP.
- Since PIM messages are periodically refreshed, they don't
need to be positively acked.
- Therefore all state in routers is effectively soft state,
though a few mcast packets may be missed if an RP reboots.
Relevance
Adaptive mcast routing that leverage the best of both CBT and SPT, and
uses the more efficient explicit-join rather than explicit-prune. A
step forward for efficiency and scalability of wide area multicast.
Flaws
- No discussion of how policy (when to switch) might be
implemented.
- Might the intermediate routers want to control this policy
(i.e. "Please switch from RP to SP because I am running out of
routing table entries!") There seems to be no way for them to do
this since everything is strictly receiver-driven (all messages
originate at receivers).
Back to index