Author: Bai

As a world-renowned vocational training platform service provider, Weidong Cloud Education focuses on educational applications, provides resources, products and services for vocational education and higher education, and builds a "digital talent development platform". Provide relevant personnel training services for governments, colleges, enterprises, and institutions in 25 countries around the world and 25 domestic provincial administrative regions. Bridging the regional digital divide, assisting the construction of education informatization, and contributing to the balanced development of global education.

在这里插入图片描述

As a unicorn in the education industry, facing ToB customers and many ToC end users in different regions of the country and the world, how to ensure the terminal experience and platform availability becomes the key. During the service process, the Weidong cloud service team encountered the following problems.

problem found

In the process of building an educational informatization platform for a certain place, abnormal access of local users often occurs. In order to solve this problem, the Weidong cloud service team checks the website performance and each link of the network link one by one. After confirming that there is no problem with the platform availability, the Weidong cloud service team will focus on the network environment.

在这里插入图片描述

(The picture comes from the Internet, for illustration only)

Although the above problem was finally solved with the operator, the root cause of the abnormality was the limitation of the local network environment, which caused abnormal user access. However, the Weidong cloud service team checked normally during the remote test, which made it impossible to locate the problem location more quickly, making the fault recovery time longer. Faced with such a problem, how to ensure the availability of users in different regions of the country and even the world, fully grasp the local real network environment and various indicators of website performance, and compress the failure recovery time as much as possible, has become an important pain point for the Weidong cloud education service team.

In the process of serving a customer in a certain place in the southwest, the Weidong cloud service team received a report from a user in a certain region, and would jump to an illegal gambling platform when browsing the platform website, which would cause the risk of loss of user assets. The Weidong cloud service team was also unable to reproduce the related problems during remote testing and related testing through VPN proxy.

在这里插入图片描述

(The picture comes from the Internet, for illustration only)

With the in-depth troubleshooting, when conducting research and interviews on users who reported abnormality, the Weidong cloud service team found that the users who reported abnormality all used the broadband of a small local network operator. After testing, it was found that the hijacking was indeed caused by the operator.

solution

Although there are various monitoring methods, how to monitor more comprehensively to check for leaks and make up. How to ensure the stability and security of daily services has become an important topic of the Weidong cloud service team. After understanding the above problems, Alibaba Cloud communicated with the Weidong cloud service team and agreed that it is the best product with global mass monitoring nodes and non-intrusive "cloud dialing and testing" to solve the problem.

Active monitoring of city availability in key cities:

By configuring network monitoring tasks, select IDC monitoring points in major key cities, monitor the network connectivity of key pages of the target website, and configure a faster monitoring frequency. Once availability problems occur, alarms will be notified in time. The IDC monitoring point is more stable than the LastMile monitoring point, which can reduce the probability of false positives.

page access performance analysis:

The speed of opening web pages is also a key issue that Weidong Cloud Education needs to focus on. For their customers, the speed of web pages directly affects the quality of their customers' online education. For the web page opening speed, Weidong Cloud selected the LastMile monitoring points in all major provinces and cities, and configured browsing tasks to analyze the performance of the website homepage and key pages, focusing on network connection delay, the total number of page request elements and CDN analysis. For quality, after locating the root cause of the problem, relevant suppliers or internal R&D teams will be pushed to optimize.

final effect

With the help of cloud dialing test, Weidong cloud education service team further improves the monitoring system. Use the lowest cost to fully grasp the actual access experience of end users in different regions of the country and even the world. The fault recovery time is shortened by more than 20%, and the fault response efficiency and user satisfaction are greatly improved.

About cloud dial test

As a business-oriented non-intrusive cloud-native monitoring product, Cloud Test has become the best choice. Through Alibaba Cloud's global service network, simulate real user behavior, and continuously monitor the availability and performance of websites and their networks, services, and API ports around the clock. Realize fine-grained problem location at page element level, network request level, and network link level. A wealth of monitoring correlation items and analysis models help enterprises discover and locate performance bottlenecks and experience dark spots in a timely manner, reduce operational risks, and improve service experience and efficiency.

  • Global monitoring node coverage

More than 200,000 LMs worldwide, more than 500 IDC terminal monitoring nodes, 400+ operators at home and abroad, and hundreds of thousands of registered members ensure that the monitoring scale can meet the growing business scale.

  • No need to embed code, out of the box

Zero intrusive monitoring, just enter the URL and perform simple configuration, no R&D support required. Get a complete analysis of website performance data in minutes. Resource package & pay-as-you-go multiple purchase modes to meet the needs of operation and maintenance testing.

  • Business-oriented, preset multiple analysis models

The monitoring period is as fine as the minute level, with more than 20 monitoring-related parameter settings in 7 categories, supporting a variety of mainstream protocols, and providing 7×24-hour fine-grained real-time fault monitoring, alarming and performance analysis services for sites and service ports. From the perspective of the end customer, through the multi-dimensional combined analysis of regions and operators, drill down to analyze the details of a single sample, and use the rich indicator system and chart types to intuitively locate the problem, the affected area and its root cause, analyze the time of pressure drop, and improve the Operational efficiency. Really fine-tuned monitoring.

  • Intelligent alarm, precise positioning

Real-time alarms are realized for the first screen time, overall performance, and availability, rich alarm policy settings, and deep integration with Alibaba Cloud Alarm Center, effectively shortening MTTR. Supports the discovery of page element-level errors, and accurately locates the problem attribution to a single network request process, improving the efficiency of problem location.

In order to meet the testing needs of more enterprises and independent webmasters, Cloud Testing has released monthly resource packages of different specifications and launched limited-time discounts. Click here to see more offers!


阿里云云原生
1k 声望302 粉丝