头图

Every day in the Olympic Games, ice and snow miracles are staged, and the moment of occurrence is captured to condense the most exciting and moving pictures of sports and humanities, making "ice dance", "snow dance", "beauty of speed" and "beauty of volley" like a feast Bloom, let the "golden moment" and "national general style" become eternal echoes of time and space. The realization of all this comes from the intelligent production capability on the cloud provided by the AI editorial department of Alibaba Cloud Video Cloud - AI Cloud Smart Cut.

If the opening ceremony of the 2022 Winter Olympics is a story told by China to the world, unfolding a Chinese-style romance, then the instant beauty of the Winter Olympics is more like the Qingchuan wooden slips that record history, unfolding characters and stories of the extraordinary.

The Beijing Winter Olympics are drawing to a close, and various events are on full display, with ice and snow miracles being staged every day. In this Winter Olympics, for the first time, the Chinese delegation has achieved 7 major events and 15 sub-items "all-event participation", of which 35 are the first time to stand on the Winter Olympics stage. Rich material.

In order to take into account the effectiveness, splendor, humanity and aesthetics of the short video content of the Winter Olympics, CCTV Sports New Media and the technical team of the main station have joined forces with Alibaba Cloud Video Cloud and Alibaba Dharma Academy to introduce the AI editorial department's intelligent production tool "AI Cloud". "Smart Cut" can complete the intelligent content understanding of multiple events in real time, intelligently and automatically generate a large number of wonderful video materials in a very short period of time, covering multiple description dimensions such as arena actions, event content, various shots, etc., and generate collection materials of aesthetic themes .

Chinese athletes Su Yiming, Gu Ailing, Jin Boyang, Sui Wenjing/Han Cong, etc. all performed well in the women's freestyle skiing platform, men's snowboarding platform and figure skating. Whether it is a gold medal or a breakthrough in self, the Winter Olympics they pass on The spirit is as inspiring as fire this winter.

AI Yunzhijian conducted a multi-dimensional analysis of the video content as soon as the competition was completed, and completed the generation of exciting materials. At the same time, based on the cross-video highlight production capability, the theme highlight video was produced for the audience at the first time. So far, it has been automated. 200+ games, more than 30,000 clips were produced, and a large number of thematic videos generated were instantly presented on CCTV sports new media and spread rapidly.

In the dissemination of sports event content, AI Cloud Smart Clipper can efficiently, quickly and comprehensively provide strong productivity for the Winter Olympics event broadcast, quickly seize the opportunity to release, and also bring timely and high-quality event sensation to the global event audience. It has created more possibilities for the media industry to deeply develop the value of sports media copyright content.

For the content of the Olympic events, AI Yunzhijian has set a wealth of intelligent templates for aesthetic themes, such as ice hockey, figure skating, speed skating, short track speed skating, etc. , to create the theme "Snow Dance", and at the same time, from the special perspective of speed events, such as figure skating spins, ice hockey goals, etc., to present the "beauty of speed", and for ski events with rich jumping actions, to create "the beauty of the sky" ”, it can be said that through intelligent video cloud technology, it can fully capture the instant aesthetic light and shadow of the event.

Productivity of new content for the Winter Olympics

The application of AI and machine learning in the field of sports media video production is the general trend of the industry. With the rapid evolution of digital media and the continuous change of audience media content consumption habits, fragmented short video content has become the mainstream of various content consumption fields. The media content space is no exception.

This Winter Olympics is even more focused on the Winter Olympics of science and technology, in which AI plays a crucial role. Based on the AI editorial department, its cloud-based intelligent production capability "AI Cloud Smart Scissors" has played a huge role in the production of event content. It has become a productive force for the new content of the Winter Olympics in science and technology.

Taking the competition itself as the core, AI Yunzhijian defines and extracts the exciting information of the content of the competition, identifies and analyzes it from various dimensions such as competition video, commentary audio, character field notes, etc., and uses multi-modal fusion technology. Feature collection effects in complex scenes. AI Cloud Smart Clipper can perform efficient AI content analysis on sports event videos, and can generate various types of highlights in real time. In addition to important clips in a single event such as exciting action shots and athlete highlights, it also supports the beauty of national generals and volleys. The production of high-quality videos of various complex themes such as the expedition of teenagers, etc., realizes the multi-level short video production capacity coverage of video content analysis, multi-type video material production, and cross-video complex theme video generation.

AI Yunzhijian relies on the powerful streaming media processing capabilities of Alibaba Cloud Video Cloud to ensure that the highlights of each game are generated within 3-5 minutes, and then quickly released by the platform, which greatly improves the media's ability to seize opportunities Enjoy the feeling of the Winter Olympics with the public.

Figure 1 AI cloud smart scissor flow chart

As shown in the figure above, the intelligent production process of AI cloud smart scissors mainly includes two steps:

First, the AI model needs to understand the video of the event. Based on the long-term accumulation in the AI field, AI Cloud Smart Clipper can deeply understand the fine-grained behavior, field events, cultural events, and shot types of various sports events. The clips are evaluated for aesthetics, action splendor, and diversity, which are equivalent to the eyes and brains of the entire system. Only by seeing a lot, seeing carefully, thinking fully, and thinking fast can they be able to compete in the fierce Winter Olympics. , to present the audience with exquisite content as soon as possible. Second, based on the various types of clips and multi-index evaluations output by the AI model, the material production module will select materials based on the matching weights, produce a large number of selected materials, and also output a variety of themed highlights.

At the same time, in response to the theme of the Green Winter Olympics, AI Yunzhijian adopted a single-video understanding model for the first time to analyze the content of multi-event, multi-source, and multi-type videos, produce multi-type video materials, and generate videos with complex themes across videos. multi-level short video production.
The video understanding model has three outstanding content values:

• Can identify numerous fine-grained movements across multiple events such as freestyle skiing, figure skating, snowboarding, ice hockey, speed skating, short track speed skating, and capture wonderful moments;

• It can identify the non-competitive actions in the video of the competition, perceive the cheers of the audience, the emotions of the players, and the key moments such as awards and gold medals;

• The lens type can be distinguished, and the intelligent combination of multiple types of materials can be carried out.

Putting the burden of completing such multiple complex tasks in one model also brings huge challenges to the generalization ability of AI Cloud Smart Scissors' AI models.

New Algorithms Behind New Content on Winter Olympics Cloud

New content is presented through AI cloud smart scissors, and new intelligent algorithm technology is used in the cloud-based intelligent production of the Winter Olympics. In essence, AI Yunzhijian deconstructs, analyzes, and scores video events based on intelligent algorithm models, and finally generates intelligent video materials based on diversity strategies and the diversity scores output by AI models.

Relying on cutting-edge technology, the AI model can realize content analysis and highlight material production for multi-event, multi-source, and multi-type videos with less computing resource requirements.

In collaboration with the technical output of Alibaba Cloud Video Cloud, the algorithm engineers of Alibaba DAMO Academy adopted Alibaba's newly developed pre-training model technology LOOK (this technology has been accepted by ICLR 2022, a top conference in the field of artificial intelligence). Compared with the common training method that requires all samples of the same category to be close to a central feature, LOOK can only require similar samples of the same type to be closer in the model training process, retaining more feature degrees of freedom.

It can be considered that this is an improvement from a process of "seeking common ground and eliminating differences" to "seeking common ground while reserving differences". Because more effective information is retained in the training process, it also makes the representation ability of model features more general. Finally, based on this general representation Based on the basic model, a number of lightweight multi-branch task models are constructed to complete multiple tasks.

Because the same basic representation model is shared, the additional computational burden of multiple task branches is almost negligible compared to a single task branch in terms of computational consumption, but it can achieve the same AI capabilities as using multiple models directly.

It is based on this technology that AI Cloud Smart Cut can support the short video production tasks of the Winter Olympics faster, higher and stronger.

Figure 2 Schematic diagram of pre-training model technology LOOK

In addition to using the pre-training model technology, since the video data of the Winter Olympics is "never seen" by the model, in order to ensure the robustness of the model and the stability of the calculation results, Alibaba's newly developed open set recognition technology NGC (which has been accepted as an oral presentation at ICCV 2021, a top conference in computer vision) is also introduced. The AI model will use both the confidence of the model prediction and the geometric structure of the feature to jointly determine the final result, which also makes the AI cloud smart scissor debut at the Winter Olympics for the first time, but it is also quite "stable".

Figure 3 Schematic diagram of the open set recognition algorithm NGC

In addition, Alibaba DAMO Academy has accumulated a large number of technologies in the field of video understanding, including basic model representation, time series feature modeling, self-supervision representation, etc., through the ability output of Alibaba Cloud Video Cloud AI Cloud Smart Cut, all in this Winter Olympics It was featured in the conference, and it was also open sourced in the EssentialMC2 technical framework ( https://github.com/alibaba/EssentialMC2 ), in order to promote the technical development of the community in the field of video content understanding.

Created new audiovisuals for top events many times

As early as during the 2018 World Cup, Alibaba Cloud Video Cloud AI editorial department focused on using the technology of "video AI + cloud editing + media asset management" to produce highlights and star highlights in real time to meet the needs of fans to relive the games and chase stars. .

In the 2018 World Cup, CCTV5 adopted the video AI technology of Alibaba Cloud Video Cloud AI editorial department to realize the first pass detection, playback detection, dangerous shot detection, foul detection, movement trajectory analysis and attack rhythm analysis, etc. AI technology has replaced huge and complex high-definition live production equipment, and efficiently and real-time production of event highlights, so that the wonderful can not be missed.

After four years of technical tempering and product polishing, the AI editorial department has successively supported the featured highlights and themed production of various events such as football, basketball, curling, figure skating, short track speed skating, and skiing, helping users to effectively improve the production of videos Efficiency, making content faster, more exciting, and more beautiful.

The Winter Olympics is coming to an end, and the video AI technology of the AI editorial department has been successfully implemented in this Olympics. This is another milestone in the application of the event, and it is also a broad beginning for the application of video AI in the sports industry and other industries. Having experienced the technical support for such a large-scale event as the Centennial Olympics, Alibaba Cloud Video Cloud can more maturely and stably handle video analysis and processing in event scenarios. AI technology will also penetrate into various industries, helping industry customers to efficiently improve the quality of new content. Production efficiency allows each event to have a completely different new audio-visual experience, and also allows the humanistic beauty of the event to bloom.


【AI editorial department】
As an intelligent media production product of Alibaba Cloud Video Cloud, the AI editorial department is the infrastructure of the content production industry in the intelligent era and an end-to-end product that can be delivered locally. The AI editorial department delivers an intelligent content production line for new media. With the help of big data technology and artificial intelligence, it can realize the automatic, batch and intelligent production of video manuscripts and graphic manuscripts, so as to provide faster, better and wider Seize the new media market services.

【AI Yunzhi Scissors】
As the intelligent production capability of the AI editorial department for sports event theme highlights, AI Cloud Smart Clipper can produce materials in real time during the live broadcast of events, providing high-quality and efficient short video content production technology for exciting events.

Cloud Video Cloud Multimedia AI Experience Center

"Video Cloud Technology", your most noteworthy public account of audio and video technology, pushes practical technical articles from the frontline of Alibaba Cloud every week, where you can communicate with first-class engineers in the audio and video field. Reply to [Technology] in the background of the official account, you can join the Alibaba Cloud video cloud product technology exchange group, discuss audio and video technology with industry leaders, and obtain more latest industry information.

CloudImagine
222 声望1.5k 粉丝