Finally, an intelligent migration path to IMS WebRTC Whitepaper WebRTC:Preparing for a Traffic Explosion
Business Useful WebRTC
Dialogic WebRTC Expo On-stage Demo Transforming Contact Centers
Contact Sales
PowerMedia XMS

PowerMedia XMS

Rich Media Processing and WebRTC

for Developers and Operators.

What is server-side WebRTC media processing?

PowerMedia XMS serves the needs of application developers and network operators with:

  • Cloud-ready, virtualized media processing software
  • Production-ready media server interfaces for web and traditional media applications
  • A full suite of media application capabilities with WebRTC support
  • Comprehensive audio, video, and contact center features

Offloading difficult media handling to PowerMedia’s flexible rich media processing software helps to improve the user experience, optimize network efficiency, and reduce development time.

Typical Media Processing applications

PowerMedia XMS handles an extensive range of real-time media processing needs, including:

  • Multi-party conferencing – low-latency mixing of audio and video, including HD, and Multi-point Control Unit (MCU) for group communications with the ability to adapt individual streams to optimize the experience for each user
  • Transcoding – any-to-any audio and video codec conversion for a wide-range of fixed, wireless, and web-oriented codecs including transrating and transizing for video
  • Media interworking – conversion of underlying transport protocols and encryption interworking including support for a WebRTC Media Gateway
  • Recording – local audio and video recording or interface with 3rd party recording systems
  • Stream processing – analyze, insert, and modify the audio or video stream including speech recognition, DTMF, video overlays, and much more
  • Person-to-Machine – connect to computer-controlled interfaces, not just other people, for applications such as Interactive Voice (and Video) Response (IVR) systems, speech interaction, biometrics, and more

PowerMedia XMS for Developers

Developers can leverage PowerMedia XMS’s programmable API’s to rapidly add sophisticated media handling capabilities to their applications. By offloading difficult media handling requirements to PowerMedia XMS’s specialized software, developers can focus on the unique aspect of their application without having to become media processing experts. PowerMedia XMS supports web-oriented APIs, a standardized Java interface, and media server scripting languages for easy development with powerful media controls.

PowerMedia XMS for Network Operators

Service providers and Network Operators can quickly plug PowerMedia XMS into their existing networks using its standardized media control interfaces. PowerMedia XMS is 100% software based and supports widely used hypervisors for easy integration into virtualized cloud environments. PowerMedia XMS is deployed in a variety of environments, including service providers, enterprises, and contact centers.

PowerMedia XMS for WebRTC

WebRTC is revolutionizing communications by providing a powerful and large open source community-based technology for embedding communications in web browsers and mobile applications. PowerMedia XMS includes support for WebRTC audio, video, and media plane, allowing developers and network operators to extend their applications and services to the web. PowerMedia’s web-oriented API’s and virtualized software are ideally suited for the dynamic, cloud-based environments where WebRTC is typically deployed alongside other web services.

PowerMedia XMS WebRTC Media Gateway

One of the most common WebRTC applications is extending existing SIP-based systems to communicate anywhere over the web. PowerMedia XMS acts as a WebRTC Media Gateway to mediate WebRTC media-plane differences from those of typical existing VoIP networks including encryption interworking, transcoding, and client-based NAT traversal support. PowerMedia XMS is pre-integrated with a variety of application servers and signaling gateways with HTTP-to-SIP (H2S) functionality and rapidly integrates with others using its web API or standard interfaces.

PowerMedia XMS Media Resource Function (MRF)

The 3GPP’s IMS architecture is the de facto specification for large scale telco networks and is a fundamental requirement for future federated communications services in that domain. PowerMedia XMS is conformant with 3GPP-specified media server interfaces. PowerMedia XMS provides critical media processing capabilities required by IMS-based services such as VoLTE and RCS. It simplifies and streamlines media processing management by consolidating disparate application-server based media processing into a centralized media server architecture as specified by the 3GPP.

See our IMS migration page for more information on Dialogic's MRF and IMS portfolio. 

Transcoding Gateway

New codecs are introduced into the network far faster than older ones are retired. PowerMedia XMS can act as a transcoding gateway, providing interworking of a wide variety of audio and video codecs. PowerMedia XMS’s software nature also means that new codecs support can rapidly be added without changing physical DSPs or complicated firmware upgrades.  PowerMedia XMS’s virtualization support also allows for much for flexible and cost effective cloud-oriented network models.



Support for IETF and WC3 WebRTC codecs (Opus audio , VP8 video) and technologies to provide real-time video and audio media mixing, security, signaling and NAT/firewall traversal with WebRTC-enabled HTML5 browsers

Facilitates the development and deployment of rich communication apps and services across Web, VoIP/SIP, and PSTN networks to virtually all connected endpoints

Up to 2,000 sessions of high performance software media processing with excellent call rates, voice quality PESQ scores, and low media latency on common datacenter server infrastructure.

Supports the development and deployment of demanding service provider applications such as conferencing, messaging and IVR services without the need for specialized hardware.

Advanced multimedia processing functionality, including HD Voice and real-time video; video transcoding, transrating and transizing up to HD720p resolution with H.264 and VP8 codecs

Allows solution providers to attract new users and increase target revenue by offering multimedia solutions with a high user Quality of Experience (QoE)

RESTful interface for multimedia processing and SIP call control via HTTP control messages Simplifies development for programmers who do not have  an in-depth familiarity with traditional telecommunications interfaces
MSML (RFC 5707) interface for multimedia control protocol support Enables the delivery of standards-based solutions in IMS and other SIP-based environments
JSR 309 connector software, compliant with the Java Community Process specification for a protocol agnostic API that provides media server control Allows development of real-time multimedia applications and services on standard Java Platform, Enterprise Edition (J2EE) converged (SIP & HTTP) application servers, leveraging application redundancy and high availability capabilities of J2EE servers
NetAnn (RFC4240) and VXML interfaces Provides standards-based control interfaces for announcements, IVR and speech enabled applications
Virtualization (VMware ESXI 5.x, KVM, and Oracle VM) support Promotes lower CAPEX, and deployment of communications solutions on virtual machines in private and service provider datacenters
User-friendly OA&M that includes a web-based GUI and HTTP RESTful interface for real-time control and monitoring Helps reduce deployment costs and OPEX by enabling the quick resolution of  operational issues
Simple yet flexible licensing model that allows scaling from ten ports to thousands of ports per server by software upgrade Can save CAPEX by allowing solutions to be scaled easily as demand grows

Technical  Specifications

 Session Capacity Typical media sessions per server (specific per server results will depend on a variety of factors, including but not limited to deployment conditions, configurations and equipment):
  • Audio — Up to 2000 sessions of G.711 or 1000 sessions with full-duplex (RTP-RTP) transcoding

  • Video — Up to 450 unidirectional sessions per server (also includes audio transcoding), depending on system capacity, codec, resolution, and frame rate

Signaling, Protocol, & Control Interfaces
Signaling and Session Protocols
  • SIP (RFC3261)
  • WebRTC JavaScript API
  • RTSP client support for streaming multimedia content from RTSP servers
  • MRCPv2 for connection to speech servers for ASR/TTS - see “Third Party MRCP Speech Vendor Capability” below
 Media Protocols
  • IPv4
  • ICE Lite, Trickle ICE 
 Remote Application Interfaces
  • RESTful API - HTTP-based RESTful web services interface for controlling media processing and SIP signaling operations
  • MSML (RFC5707) – Media Server Markup Language, an XML-based media control API
  • JSR 309 Connector – A Java Community Process Java API software connector to control external servers running PowerMedia XMS
  • VXML - A standard XML interface for specifying interactive voice dialogs for IVR or speech enabled applications.
  • NetAnn (RFC4240) – Basic Network Media Services with SIP for announcements, dialogs, and simple conferences

 Media and Coders

  • Voice & HD Voice play/record
  • Tone generation/detection (DTMF, RFC2833/RFC4733)
  • Call progress analysis
  • Positive Voice Detection (PVD) and Positive Answering Machine Detection (PAMD) for outbound calling scenarios

  Audio Codecs

  • Narrowband codecs: G.711u/a, G.723, G.726, G.729a, G.729b, GSM-FR, GSM-EFR, AMR-NB and iLBC
  • Wideband codecs: Opus, G.722, AMR-WB (G.722.2) – see “Obtaining Third Party Licensing” below
  • Voice activity detection, silence suppression, comfort noise generation
  Audio Conferencing
  • N-way (including HD Voice) audio mixing
  • Automatic Gain Control (AGC)
  • Per party gain/volume control
  • Active talker detection
  • DTMF clamping
  • Coach-pupil (whisper) mode
  • Loudest N-party mixing
  • Privileged party mixing
  • Echo cancellation
  • Play/record
  • Video transcoding, transrating, and transizing
  • Video overlays (text and image overlay with scrolling)
  Video Codecs
  • H.264 Baseline Profile
  • VP8
  • MPEG 4 Baseline Profile
  • H.263, H.263+, H.263++ Baseline Profile
  • Image sizes: HD720p, 4CIF, VGA, CIF, QVGA, QCIF, SQCIF (and custom resolutions) 
  • Frame rates: Up to 30 FPS
  • Bit rates: Up to 2Mbps
  • Video Fast Update (VFU): Configurable responses to I-Frame Update requests
  • Fully adaptive video jitter buffer
  • Packet Loss Concealment (PLC)
  • Dynamic Bitrate Adaptive Encoding
  • RTCP feedback support (TMMBR, FIR)
  Media Handling
  • File operations: HTTP1.1, HTTPS, and/or NFS; RTSP/RTP
  • Audio File Containers: .wav, .pcm, .vox, .aud
    • Codec Formats: 8k lin PCM, 11k lin PCM, 16 lin PCM, 8k alaw PCM, 8k mulaw PCM
  • Multimedia File Formats: .3gp via .vid/.aud
Language Support
  • Variable content announcement / language phrasing:
    • "date", "digits", "duration", "month", "money", "number", "silence", "time", "weekday"
  • Customizable to support any language or dialect
  • Built-in professional voice files: US English, Mandarin Chinese, Spanish
  • VMWare ESXi 5.x
  • Kernel-based Virtual Machine (KVM)
  • Oracle VM
  • XEN Virtual Machine
System Management
  • Web GUI
  • Real-time monitoring and management via HTTP RESTful control interface
  • Remotely managed tracing and logging

Hardware Support and Minimum System Requirements

  • Hardware: Intel Architecture-based server
  • Operating System (64-bit OS):
    CentOS Release 6.4 ISO installation OR
    RedHat Enterprise Linux 6.4
    Oracle Enterprise Linux 6.4
  • Processor: Intel Dual 56xx or greater
  • Ethernet: Dual 1000Base-TX (RJ-45)
  • Memory: 8 GB RAM minimum
  • Storage: 120 GB HD minimum
Third Party MRCP Speech Vendor Compatability
  • Lumenvox (ASR & TTS)
  • Nuance (ASR & TTS)
  • Vestec (ASR)
Obtaining Third-Party Licenses  Using the AMR-WB resource in connection with a Dialogic® product described herein does not grant the right to practice the AMR-WB standard. To seek a patent license agreement to practice the standard, contact (as of January 2013) the VoiceAge Corporation at


Data Sheet

  • Dialogic® PowerMedia™ XMS energizes application delivery by boosting performance with state of the art mixing of media-rich communications; software providing a seamless transition to virtualization and cloud delivery and telco hardened scalability.

Technology Brief

  • Dialogic’s WebRTC Codec Survey results are in! We targeted the survey at Dialogic’s developer community. Respondents included an equitable mix of developers from traditional service providers, established systems integrators, and newer upstarts.
  • Dialogic is pleased to present the results of its 2013 WebRTC Impact Survey that asked 169 key 
    executives of Service Providers and Application Developers to assess the impact of WebRTC on 
    their business models.
  • Delivering carrier class business communications services, such as Interactive Voice Response (IVR), for customer self-service and conferencing applications have become a profitable opportunity for service providers. Value added business communication services have proven to be a natural portfolio fit for service providers delivering core communications bandwidth and wide area networking services to large commercial accounts and government agencies. Thanks to the combined solution set of the Oracle Communications Converged Application Server paired with Dialogic® PowerMedia™ XMS, carriers and service providers can introduce value-added communication services, such as IVR self-service and conferencing solutions, to their portfolio in a cost effective and efficient manner.

White Paper

  • This white paper provides an introduction to HD Voice and discusses its current adoption rate and future potential, including use case examples which paint a picture that HD Voice upgrades to certain network and enterprise infrastructure will be seen as important, and perhaps as a necessity to many.

  • As the industry moves more towards full-scale IP networks, the issue of interconnecting diverse networks and different service providers - each with its own IP network(s) - becomes more important to resolve in order to support the seamless delivery of next generation services. There will be concerns for security and end-to-end quality of service (QoS) to be addressed to make sure the different “moving parts” that make up these services are delivered to the customer in the expected manner.  It will be important to set up the mechanism by which all the participants in the service delivery supply chain are compensated for the value they provide to the overall end-to-end service.

    The GSMA IP eXchange (IPX) provides a visionary approach to the challenges fixed and mobile service providers have with interworking IP-based services and networks. It provides multiple, service-aware connectivity models that provide QoS as well as cost efficient, optimal routing of IP traffic. The same platform can be extended to bridge the gap between IP and legacy TDM networks.

Use the Purchase page linked below to find partners to purchase Dialogic products. The order codes below are for your reference when making a purchase.

Where to Buy

ProductOrder CodeDescription
XMS-MServer-10G01-120-01PowerMedia XMS license for 10 Ports of Voice Media Server (includes WebRTC)
XMS-HDAudio-10G04-120-01PowerMedia XMS license for 10 ports of Opus, G.722, AMR-WB
XMS-LBRAudio-10G03-120-01PowerMedia XMS license for 10 ports of G.723, G.729, iLBC
XMS-GSMAMRAudio-10G05-120-01PowerMedia XMS license for 10 ports of GSM-FR, GSM-EFR, AMR-NB
XMS-MRCPSpeech-10G07-120-01PowerMedia XMS license for 10 ports of MRCP Speech Server connectivity
XMS-Adv-Video-10G02-120-01PowerMedia XMS license for 10 ports of Advanced video up to CIF (for H.263, H.264, MPEG4, VP8)
XMS-HighResVideo-10G06-120-01PowerMedia XMS license for 10 ports High Resolution Video (up to VGA/720p with H.264, VP8)
XMS-JSR309G08-120-01PowerMedia XMS license to enable JSR309 (per server)