Author | Zhou Xiaoli
Editor | Liu Jingfeng
On October 27, the RTE2022 Real-Time Internet Conference Media Day was held in Beijing. Shengwang released the first professional book “Real-time Vientiane” focusing on the analysis of application scenarios in the real-time interactive industry. It deeply analyzed 20+ tracks in the real-time interactive industry, nearly 200 scene, and reveal the audio and video big data of Shengwang for the first time, providing a comprehensive reference and reference for developers and entrepreneurs in the global RTE industry.
The Real-Time Internet Conference started in 2015 and was held by Shengwang, the founder of the Asia-Pacific Audio and Video Technology Conference. At that time, Shengwang has been promoted to a unicorn in the audio and video segment, and broke the “three noes” state of RTC technology evangelism in China, that is, no industry conferences, no professional books, and no professional media community.
By this year, Shengwang’s Real-time Internet Conference has been held for the eighth time. At the media day, Peng Xiaohuan, vice president of Shengwang Marketing, also introduced the highlights of the 8th Real-Time Internet Conference to be held online from November 1-4 this year, and reviewed the development of the conference over the past eight years.
For example, in the first conference, it was predicted that Lianmai Live Broadcasting would promote the popularity of live broadcasting. As a result, in 2016, there was a thousand broadcast wars, and Lianmai Interactive became the outlet of live broadcasting; in the third conference in 2017, the agenda of the conference included the topic of e-commerce live broadcasting. , and then the e-commerce live broadcast began to explode in 2018; in 2019, online concerts became a reality…
Peng Xiaohuan, Vice President of Shengwang Marketing, is introducing Shengwang’s real-time Internet conference
“This year’s RTE conference is also the one with the richest agenda in the past eight years. The number of online registrations for the real-name system has exceeded 5,000. Centering on the theme of “Gathering in Vientiane”, the conference will be divided into a main forum and three themed days, which are Developer Day, Industry Day and four sub-forums, with guests from many well-known enterprises focusing on and discussing the hottest topics of the year. In addition, this year’s conference also moved the traditional exhibition visit to the online, allowing the audience to experience an immersive experience based on real-time A virtual exhibition of interactive technology.” Peng Xiaohuan introduced.
1. RTE hot list, providing new ideas for overseas enterprises
At last year’s RTE conference, Shengwang released the “RTE Vientiane Map” based on real-time interactive scenarios, covering 20+ industry tracks and 200+ scenarios such as pan-entertainment, IoT, education, finance, medical care, and enterprise collaboration.
This year, Shengwang compiled the “RTE Vientiane Map” into a book “Real-time Vientiane”, which is also the first professional book in the real-time interactive industry focusing on application scenario analysis. “Real-time Vientiane” mainly includes three core contents:
In-depth analysis of nearly 200 scenarios of real-time interaction: a more detailed introduction to real-time interaction scenarios in the fields of pan-entertainment, IoT, education, finance, medical care, enterprise collaboration, and industry, and further analysis of the technical difficulties of real-time interaction in different scenarios.
Multi-dimensionally revealing the audio and video big data of Shengwang: For the first time, the big data of terminal equipment using RTC in seven regions of mainland China, the Middle East, North America, South America, Europe, India, and Southeast Asia is revealed, including the list of the TOP30 models of RTC consumption in each region, and each region. The proportion of RTC usage of low-end machines, the overlap rate of terminal equipment in various regions, and the proportion of commonly used networks for equipment, etc., provide corresponding strategies for developers and enterprises going overseas for their local business layout, model adaptation, and performance optimization.
“Real-time Vientiane” also analyzes the correlation analysis of audio and video freezing rates on user business indicators in scenarios such as chat rooms, game voice, werewolf killing, 1V1 video calls, live shows, e-commerce live broadcasts, and video blind dates. , which clearly shows the impact of audio and video freezing rates on users’ channel staying time and the use of RTC retention rates in different scenarios, which will bring practical guiding reference value to the entire industry and enterprises.
In addition, the conference also brought the 2022 global regional RTE scene hot list: a picture showing the hot and innovative RTE scenes in different regions of the world.
Popular RTE scenarios and innovative scenarios in different regions of the world
For example, last year’s “online karaoke room” was only one of the emerging scenarios in mainland China. This year, it has spread to Southeast Asia, the Middle East, Africa and other regions, becoming a local emerging scenario. At the same time, “Metaverse” is developing rapidly in Japan and South Korea. It has become a popular scene in this year’s TOP1 from an emerging scene last year. The RTE heat list released by Shengwang can provide new scenario ideas for overseas enterprises to a certain extent.
2. Institute of Information and Communications Technology & Sound Network, released real-time audio and video certification standards
At the media day, Zhang Rui, director of the Intelligent Product Evaluation Department of the Tel Terminal Laboratory of the China Academy of Information and Communications Technology, shared the standard system established by the real-time interactive joint laboratory.
He said, “The real-time interactive standard system is divided into two parts: general standards for audio and video experience and scene standards. The general standard is an evaluation framework that combines the communication capabilities of real-time audio and video and user experience, covering audio and video experience, subjective and objective evaluation. The scene The standard will be more integrated with practical applications, establish multiple network simulation scenarios, real-time interactive system network adaptability in various network environments, and evaluate end-to-end experience.”
According to Shengwang’s statistics on nearly 10,000 applications in several major domestic application stores in education, pan-entertainment, shopping, finance, medical care, corporate communications and other industries, the penetration rate of real-time audio and video in 2021 will exceed 30%. In the future, the penetration rate of real-time audio and video is likely to reach 50% in some key industries.
Therefore, the establishment of a real-time interactive standard system will help to promote the healthy development of the entire audio and video industry from the perspective of the user side, the developer client side, and the practitioners side.
From the user side, with the continuous explosion of new scenarios of real-time interaction, the experience of each scenario is different, which requires enterprises to continuously improve the user experience and begin to transform from online to presence, to low-latency, low-card pause, high audio and video quality change;
· From the perspective of developers and customers, the end-to-end performance needs to be improved, and users’ needs need to be converted into standard indicators to improve product performance;
· From the perspective of the practitioners, the complexity of the scenarios increases, which requires multi-party collaboration, and the standards need to be unified. Developers need to establish a perceptible, traceable, and comparable performance indicator system that uses different scenarios.
For example, video conferencing is one of the most frequently used real-time interactive application scenarios on a daily basis. In the context of the digital transformation of enterprises, the collaborative office platform has become a super entrance for B-side traffic. As an important tool to reduce costs and increase efficiency, video conferencing is the most indispensable part, but at the same time, the conference scene is also one of the most technically challenging scenes in real-time interactive applications, mainly reflected in the following aspects:
The first is the diversity of terminals. Conference scenarios need to be able to flexibly initiate or join conferences anytime, anywhere. Therefore, full-scene coverage is required, combining software and hardware, not only general-purpose mobile phones or PCs, but also many dedicated hardware devices. meeting room. Scenarios where many people participate in a conference room together will have higher requirements for audio and video, such as ultra-high-definition, anti-noise reduction, etc.;
The second is the complexity of the environment, including different acoustic environments, different lighting conditions, and different network environments;
The third is multi-person concurrency, which mainly includes the number of participants in a channel, the number of people who turn on the camera and microphone at the same time, and the downstream inflow data that subscribes and watches at the same time, all of which will be more concurrent than other real-time interactive application scenarios;
The fourth is screen sharing, which is used in at least half of the meetings. The stability, clarity, and model adaptation of screen sharing greatly affect the user experience.
Therefore, in response to these challenges, the video conferencing scene of SoundNet will add special test items under the key usage scenarios of the industry on the basis of the general standard of real-time audio and video experience, such as multi-person conference, screen sharing quality, 4k60 frames per second ultra-high definition effect, etc.
In addition, the multi-device in the same place, AI noise reduction, de-reverberation, two-way suppression effects, and spatial audio low-network high-definition capabilities have also been improved.
In terms of scene standards, audio and video are more integrated with practical applications. This year, smart doorbell door locks have become popular, video calls have become standard in smart door locks, and video doorbell door lock scenarios have also become an indicator that defines the end-to-end experience.
Finally, Zhang Rui introduced that four series of common standards for real-time interactive audio and video experience will also be officially released in November.