Top 50 WebRTC Terms Putting WebRTC Media in the Cloud Finally, an intelligent migration path to IMS WebRTC Whitepaper WebRTC:Preparing for a Traffic Explosion
Business Useful WebRTC
Dialogic WebRTC Expo On-stage Demo Transforming Contact Centers
Contact Sales

Related Press Releases

Dialogic’s PowerMedia XMS Enables Reduced CAPEX and OPEX and Paves the Way for Future Virtualization and Any-to-Any Media Processing Dialogic Earns 2014 Best Conferencing Award at WebRTC Conference & Expo VCreaLog Improves Call Center Technology with Dialogic’s PowerMedia XMS VoltDelta Expands Call Center Offerings with Dialogic’s PowerMedia XMS to Enhance Enterprise Customer ServiceTropo and Dialogic Announce Strategic Technology, WebRTC CollaborationDialogic Receives 2014 INTERNET TELEPHONY Product of the Year AwardDialogic, Gintel Upgrade Network Norway without Service InterruptionDialogic Wins CTI Forum and Editor’s Choice Awards for Mobile Video Solutions and Media Gateway Technology Dialogic Earns Ready Now Award at WebRTC Conference & Expo for Showcasing Real-World ImplementationsDialogic to Present Best Practices and Demonstrate Collaborative Communication with Gintel and TeleStax at WebRTC World Exhibition & Conference in ParisAPEX and Dialogic Collaborate on WebRTC Service Delivery PlatformDialogic Introduces Network Fuel to Elevate Performance of Media-Rich Communications Dialogic Wins CTI Forum Editor’s Choice Awards for Media Gateway Technology and Mobile Video SolutionsDialogic Launches Powerful Software Media Server for Mobile, Broadband and Cloud Environments Tolly Group Awards Performance Certification to Dialogic PowerMedia Extended Media Server (XMS)Dialogic Selected as a Light Reading Leading Lights Award Finalist for Best New Product (Mobile)

PowerMedia XMS

PowerMedia XMS

Rich Media Processing and WebRTC

for Developers and Operators.

What is server-side WebRTC media processing?

PowerMedia XMS serves the needs of application developers and network operators with:

  • Cloud-ready, virtualized media processing software
  • Production-ready media server interfaces for web and traditional media applications
  • A full suite of media application capabilities with WebRTC support
  • Comprehensive audio, video, and contact center features

Offloading difficult media handling to PowerMedia XMS’s flexible rich media processing software helps to improve user experience, optimize network efficiency, and reduce development time.

Typical Media Processing applications

PowerMedia XMS can handle an extensive range of real-time media processing needs, including:

  • Multi-party conferencing – low-latency mixing of audio and video, including HD voice and high-resolution video up to HD 720p, and Multi-point Control Unit (MCU) for group communications with the ability to adapt individual streams to optimize the experience for each user
  • Transcoding – any-to-any audio and video codec conversion for a wide-range of fixed, wireless, and web-oriented codecs, including transrating and transizing for video
  • Media interworking – conversion of underlying transport protocols and encryption interworking, including support for a WebRTC Media Gateway
  • Recording – local audio and video recording or interfacing with third party recording systems
  • Stream processing – analyze, insert, and modify the audio or video stream, including for speech recognition, DTMF, video overlays, and much more
  • Person-to-Machine – connect to computer-controlled interfaces, not just other people, for applications such as Interactive Voice (and Video) Response (IVR and IVVR) systems, and speech interaction, or potential use cases such as biometrics, among others.

 Media Processing Examples 

PowerMedia XMS for Developers

Developers can leverage PowerMedia XMS’s programmable APIs to rapidly add sophisticated media handling capabilities to their applications. By offloading difficult media handling requirements to PowerMedia XMS’s specialized software, developers are able to focus on unique aspects of their application without having to become media processing experts. PowerMedia XMS supports web-oriented APIs, a standardized Java interface, and media server scripting languages for easy development with powerful media controls.

PowerMedia XMS for Network Operators

Service Providers and Network Operators can quickly plug PowerMedia XMS into their existing networks using its standardized media control interfaces. PowerMedia XMS is software-based and supports widely used hypervisors for easy integration into virtualized cloud environments. PowerMedia XMS is deployed in a variety of environments, including service providers, enterprises, and contact centers.

PowerMedia XMS for WebRTC

WebRTC is revolutionizing communications by providing a powerful and large open source community-based technology for embedding communications in web browsers and mobile applications. PowerMedia XMS includes support for WebRTC audio, video, and media plane technologies, allowing developers and network operators to extend their applications and services to the web. PowerMedia XMS’s web-oriented APIs and virtualized software are well suited for dynamic, cloud-based environments where WebRTC is typically deployed alongside other web services.

PowerMedia XMS WebRTC Media Gateway

A common WebRTC application is to extend existing SIP-based systems to communicate anywhere over the web. PowerMedia XMS can act as a WebRTC Media Gateway to mediate WebRTC media-plane differences from those of typical existing VoIP networks including encryption interworking, transcoding, and client-based NAT traversal support. PowerMedia XMS has been integrated with a variety of application servers and signaling gateways with HTTP-to-SIP (H2S) functionality and rapidly integrates with others using its web API or standard interfaces.

PowerMedia XMS Media Resource Function (MRF)

Many large-scale telco networks are designed to follow the guidelines and definitions of network functions and interfaces set forth by the 3GPP IMS architectural specifications. PowerMedia XMS is conformant with 3GPP-specified media interfaces and supports the IMS profiles for voice services defined by the GSMA for Long Term Evolution (LTE) networks. This allows XMS to be deployed as a Media Resource Function (MRF) providing key media processing capabilities that may be required by IMS-based services such as VoLTE and RCS. Additionally the conformance to IMS specifications promotes compatibility between legacy telephony networks and evolving IP telecommunication standards.

See our IMS Integration page for more information on Dialogic's MRF and IMS portfolio.

Transcoding Gateway

New codecs are being introduced into the network faster than older ones are being retired. PowerMedia XMS can act as a transcoding gateway, providing interworking of a wide variety of audio and video codecs. PowerMedia XMS’s software nature also means that new codec support can be rapidly added without changing physical DSPs or necessitating complicated firmware upgrades. PowerMedia XMS’s virtualization support also allows for flexible and cost effective cloud-oriented network models.



High performance software-only media processing with excellent call rates, high voice quality PESQ scores and low media latency

Supports the development and deployment of demanding service provider applications (such as conferencing, messaging and IVR services) on common datacenter server infrastructure without the need for specialized hardware

Support for IETF, 3GPP and W3C WebRTC codecs (Opus audio, VP8 video) and technologies to provide real-time video and audio media mixing, security, signaling and NAT/firewall traversal with WebRTC-enabled HTML5 browsers

Facilitates the development and deployment of rich communication apps and services across Web, VoIP/SIP, Mobile and PSTN networks to virtually all connected endpoints
Advanced multimedia processing functionality, including HD Voice and real-time video; video transcoding, transrating and transizing up to HD720p resolution with H.264 and VP8 codecs

Positions solution providers to attract new users and increase target revenue by offering multimedia solutions with a high user Quality of Experience (QoE)

Virtualization (VMware ESXI 5.x, KVM, Oracle VM, Xenserver) support

Promotes lower CAPEX, and deployment of communications solutions on virtual machines in private and service provider datacenters

User-friendly OA&M that includes a web-based GUI and HTTP RESTful Management interface for real-time control and monitoring Helps reduce deployment costs and OPEX by enabling the quick resolution of operational issues
Simple yet flexible licensing model that allows scaling from ten ports to thousands of ports per server Can save CAPEX by allowing solutions to be scaled easily by software upgrade as demand grows

Technical  Specifications

 Session Capacity Typical media sessions per server (specific per server results will depend on a variety of factors, including but not limited to deployment conditions, configurations and equipment):
  • Audio — Up to 2000 sessions of G.711 or 1000 sessions with full-duplex (RTP-RTP) transcoding

  • Video — Up to 450 unidirectional sessions (also includes audio transcoding), depending on system capacity, codec, resolution, and frame rate, etc.

Signaling, Protocol, & Control Interfaces
Control Protocols and Specification Compatibility
  • SIP (RFC3261)
  • GSMA IR.92 for Voice over LTE (VoLTE)
  • GSMA IR.94 for Video over LTE
  • 3GPP TS23.288 for IMS (Mr/Mr’ and Cr interfaces)
  • WebRTC JavaScript API
  • MSRP for multimedia chat and RCS message services
  • RTSP client support for streaming multimedia content from RTSP servers
  • MRCP v2.0/v1.0 for connection to speech servers for ASR/TTS - see “Third Party MRCP Speech Vendor Capability” below
 Media Protocols
  • IPv4, IPv6, and mixed-mode IPv4/IPv6
  • ICE Lite, Trickle ICE
  • HTTP 
 Remote Application Interfaces
  • RESTful API - HTTP-based RESTful web services interface for controlling media processing and SIP signaling operations
  • MSML (RFC5707) – Media Server Markup Language, an XML-based media control API
  • JSR 309 Connector – A Java Community Process Java API software connector to control external servers running PowerMedia XMS
  • VXML v2.1/v2.0 - A W3Cstandard XML interface for specifying interactive voice dialogs for IVR or speech enabled applications.
  • NetAnn (RFC4240) – Basic Network Media Services with SIP for announcements, dialogs, and simple conferences

 Media and Coders

  • Voice & HD Voice play/record
  • Tone generation/detection (DTMF, RFC2833/RFC4733)
  • Call progress analysis
  • Positive Voice Detection (PVD) and Positive Answering Machine Detection (PAMD) for outbound calling scenarios
  • Fax Tone Detection

  Audio Codecs

  • Narrowband codecs: G.711u/a, G.723, G.726, G.729a, G.729b, GSM-FR, GSM-EFR, AMR-NB and iLBC
  • Wideband codecs: Opus, G.722, AMR-WB (G.722.2) – see “Obtaining Third Party Licensing” below
  • Voice activity detection, silence suppression, comfort noise generation
  Audio Conferencing
  • N-way (including HD Voice) audio mixing
  • Conference Recording
  • Automatic Gain Control (AGC)
  • Per party gain/volume control
  • Active talker detection
  • DTMF clamping
  • Coach-pupil (whisper) mode
  • Loudest N-party mixing
  • Privileged party mixing
  • Echo cancellation
  • Play/record
  • Video transcoding, transrating, and transizing
  • Video overlays (text and image overlay with scrolling)
  Video Codecs
  • H.264 Baseline Profile, up to Level 3.1 (HD720p)
  • VP8, up to HD720p
  • MPEG 4 Simple Profile, up to Level 4 (VGA)
  • H.263, H.263+, H.263++ Baseline Profile, up to CIF 
  • Image sizes: HD720p, 4CIF, VGA, CIF, QVGA, QCIF, SQCIF (and custom resolutions) 
  • Frame rates: Up to 30 FPS
  • Bit rates: Up to 2Mbps
  • Video Fast Update (VFU): Configurable responses to I-Frame Update requests
  • Fully adaptive video jitter buffer
  • Packet Loss Concealment (PLC)
  • Dynamic Bitrate Adaptive Encoding
  • RTCP feedback support (TMMBR, FIR, PLI, REMB, Generic NACK)
  Media Handling
  • File operations: HTTP1.1, HTTPS, and/or NFS; RTSP/RTP
  • Audio File Containers: .wav, .pcm, .vox, .aud, .amr, .amb
  • WAV/PCM Codec Formats:  
    • 8k lin PCM, 11k lin PCM, 16 lin PCM, 8k alaw PCM, 8k mulaw PC
  • AMR Codec Formats (RFC4867:
    • AMR-NB(.amr), AMR-WB(.amb)
  • Multimedia File Formats: .3gp via .vid/.aud
Language Support
  • Variable content announcement / language phrasing:
    • "date", "digits", "duration", "month", "money", "number", "silence", "time", "weekday"
  • Customizable to support virtually any language or dialect
  • Built-in professional voice files: US English, Mandarin Chinese, Spanish
  • VMWare ESXi 5.x
  • Kernel-based Virtual Machine (KVM)
  • Oracle VM
  • XEN Virtual Machine
System Management
  • Web GUI
  • Real-time monitoring and management via HTTP RESTful control interface
  • Remotely managed tracing and logging
  • SNMP v2c/v3 for management and traps 

Hardware Support and Minimum System Requirements

  • Hardware: Intel Architecture-based server
  • Operating System (64-bit OS):
    CentOS Release 6.4 ISO installation OR
    RedHat Enterprise Linux 6.4
    Oracle Enterprise Linux 6.4
  • Processor: Intel Dual 56xx or greater
  • Ethernet: Dual 1000Base-TX (RJ-45)
  • Memory: 8 GB RAM minimum
  • Storage: 120 GB HD minimum
Third Party MRCP Speech Vendor Compatability
  • Lumenvox (ASR & TTS)
  • Nuance (ASR & TTS)
  • Vestec (ASR)


  • Refer to the documents listed below for specific details on Dialogic® PowerMedia™ XMS Release 2.3. The documents may periodically be updated with new and updated content throughout the lifetime of the release.

  • Refer to the documents listed below for specific details on Dialogic® PowerMedia™ XMS Release 2.2. The documents may periodically be updated with new and updated content throughout the lifetime of the release.

  • Refer to the documents listed below for specific details on Dialogic® PowerMedia™ XMS Release 2.1. The documents may periodically be updated with new and updated content throughout the lifetime of the release.

  • Refer to the documents listed below for specific details on Dialogic® PowerMedia™ XMS Release 2.0.

  • Refer to the documents listed below for specific details on Dialogic® PowerMedia™ XMS Release 1.1.

Data Sheet

  • Dialogic® PowerMedia™ XMS energizes application delivery by boosting performance with state of the art mixing of media-rich communications; software providing a seamless transition to virtualization and cloud delivery and telco hardened scalability.

  • The Dialogic® PowerMedia™ Media Resource Broker (MRB), a standards compliant, software-based Media Resource Broker that allows application developers, service providers and enterprises to dynamically scale and distribute a media server resource pool across disparate applications and datacenters. When used with one or more PowerMedia™ XMS media servers, the PowerMedia MRB provides a reliable and resource-efficient software-based media-processing platform for scaling rich media applications such as conferencing, transcoding, and Interactive Voice Response (IVR).

Technology Brief

  • Dialogic’s WebRTC Codec Survey results are in! We targeted the survey at Dialogic’s developer community. Respondents included an equitable mix of developers from traditional service providers, established systems integrators, and newer upstarts.
  • Dialogic is pleased to present the results of its 2013 WebRTC Impact Survey that asked 169 key 
    executives of Service Providers and Application Developers to assess the impact of WebRTC on 
    their business models.
  • Delivering carrier class business communications services, such as Interactive Voice Response (IVR), for customer self-service and conferencing applications have become a profitable opportunity for service providers. Value added business communication services have proven to be a natural portfolio fit for service providers delivering core communications bandwidth and wide area networking services to large commercial accounts and government agencies. Thanks to the combined solution set of the Oracle Communications Converged Application Server paired with Dialogic® PowerMedia™ XMS, carriers and service providers can introduce value-added communication services, such as IVR self-service and conferencing solutions, to their portfolio in a cost effective and efficient manner.

White Paper

  • This white paper provides an introduction to HD Voice and discusses its current adoption rate and future potential, including use case examples which paint a picture that HD Voice upgrades to certain network and enterprise infrastructure will be seen as important, and perhaps as a necessity to many.

  • As the industry moves more towards full-scale IP networks, the issue of interconnecting diverse networks and different service providers - each with its own IP network(s) - becomes more important to resolve in order to support the seamless delivery of next generation services. There will be concerns for security and end-to-end quality of service (QoS) to be addressed to make sure the different “moving parts” that make up these services are delivered to the customer in the expected manner.  It will be important to set up the mechanism by which all the participants in the service delivery supply chain are compensated for the value they provide to the overall end-to-end service.

    The GSMA IP eXchange (IPX) provides a visionary approach to the challenges fixed and mobile service providers have with interworking IP-based services and networks. It provides multiple, service-aware connectivity models that provide QoS as well as cost efficient, optimal routing of IP traffic. The same platform can be extended to bridge the gap between IP and legacy TDM networks.

Use the Purchase page linked below to find partners to purchase Dialogic products. The order codes below are for your reference when making a purchase.

Where to Buy

ProductOrder CodeDescription
XMS-MServer-10G01-120-01PowerMedia XMS license for 10 sessions of Voice Media Server (includes WebRTC)
XMS-HDAudio-10G04-120-01PowerMedia XMS license for 10 sessions of Opus, G.722, AMR-WB
XMS-LBRAudio-10G03-120-01PowerMedia XMS license for 10 sessions of G.723, G.729, iLBC
XMS-GSMAMRAudio-10G05-120-01PowerMedia XMS license for 10 sessions of GSM-FR, GSM-EFR, AMR-NB
XMS-MRCPSpeech-10G07-120-01PowerMedia XMS license for 10 sessions of MRCP Speech Server connectivity
XMS-MsgSessMSRP-10G12-120-01PowerMedia XMS license for 10 sessions of Text/Multimedia messaging with MSRP peer
XMS-Adv-Video-10G02-120-01PowerMedia XMS license for 10 sessions of Advanced video up to CIF (for H.263, H.264, MPEG4, VP8)
XMS-HighResVideo-10G06-120-01PowerMedia XMS license for 10 sessions High Resolution Video (up to VGA/720p with H.264, VP8)
XMS-JSR309G08-120-01PowerMedia XMS license to enable JSR309 (per server)