Help us improve your experience.

Let us know what you think.

Do you have time for a two-minute survey?

 
 

AI 네이티브 운영 개요

요약 이 주제에서는 Juniper Mist™ 포털의 AI 네이티브 운영 기능의 이점을 소개합니다.

업무가 문제 해결, 사용자 불만 조사 또는 네트워크 성능 추적과 관련된 업무인 경우 Juniper Mist 포털의 AI 네이티브 운영(AIOps) 기능을 사용하면 이러한 모든 작업이 더 쉬워진다는 것을 알게 될 것입니다.

AIOps가 Juniper Mist에 내장되어 있어 IT 운영팀이 분산된 네트워크의 모든 복잡성을 파악하고 관리할 수 있습니다. Mist AI는 빅데이터, 분석, 머신러닝 기능을 적용하여 네트워크 정보를 지능적으로 선별함으로써 이벤트를 정확히 찾아내고 잠재적인 문제를 나타내는 패턴을 인식합니다. 또한 Mist AI는 문제의 근본 원인을 진단하고 조치를 권고할 수 있습니다.

이러한 기능은 문제 해결에 소요되는 시간을 단축하고 긍정적인 사용자 경험을 보장하기 위해 사전 조치를 취할 수 있도록 합니다. 더 이상 사건의 범위에 대해 추측할 필요가 없습니다. 근본 원인을 파악하기 위해 더 이상 건초 더미에서 바늘 찾기를 통해 로그 파일을 검색할 필요가 없습니다. 더 이상 패킷을 캡처할 수 있도록 문제를 재현하는 데 어려움을 겪지 않아도 됩니다.

10분 문제 해결 비디오 데모

이 데모에서는 문제 해결을 위해 모니터 페이지, Marvis 작업 및 Marvis 쿼리 언어를 사용하는 방법을 보여줍니다.

Hey, this is Joel at Juniper Networks. And in this video, I'm going to show you how to troubleshoot wireless and wired problems on your network using the Mist cloud. Now, in this case, we've got a network that's deployed using Mist access points and Juniper switches, and all those access points and switches are reporting data up to the Mist cloud that we can use to troubleshoot problems.

So let's come up with a scenario. Let's say that you've just received a phone call or a support ticket from a teacher who has class that's starting in just a few minutes, and they have a device that's critical for the class that they can't get connected to the wireless network, and so they need you to fix this problem, and you know that you have a relatively short time span to get this resolved because class is starting really soon. So the first place that you might look is you might come to the monitor view in the Mist dashboard, and you might look at the service level expectations or SLEs for short.

Basically, these service level expectations are here to help you understand whether the users on your network are having a good experience or not, and the way that we do this is that we measure over 150 unique client states. We look at all these different states that a wifi device can be in to help you understand what's going on with that client and to help you understand network-wide what the experience looks like for all the users. For example, we measure how long it takes devices to get connected on the network.

That's both for the wired side and the wireless side that counts both sides of the network. We look to see how often things are able to get successfully connected to the network coverage, roaming performance, throughput capacity, and so on and so forth. So let's say that, uh, let's say that you want to find this problem.

So one way that you might be able to do this is that if the teacher says that they aren't able to get connected to the network, you might first look at the successful connect service level expectation. Now for the last seven days worth of data, we're looking at a seven day time span right now. We've seen a 73% success rate.

Well, that means that there's been a 23% failure rate, and so we might be able to find the device that's having a hard time in here. And remember, I'm showing you the manual way of doing this first. I'll show you the automatic way in just a bit.

So let's drill down into successful connects a little bit to take a closer look at what's going on now for all of our service level expectations, we get classifiers that show us why devices are failing that service level expectation. So for example, for successful connects, we see that there's an association, an authorization, and a DHCP classifier. So let's do some quick math.

If we see a 73% success rate, that means that there's a 27% failure rate. Again, that's for the last seven days worth of data. 1% of that 27% was association problems, maybe driver problems, things like that.

99% of the time it was due to authorization issues. That might be a bad passphrase. Maybe we can't reach a radius server and so on and so forth.

And 0% of the time it wasn't due to, there were no DHCP problems that we saw. So then from here, we can go to the distribution tab and we can understand which aspects of the network are affected by this problem. Like for example, if we look at the list of access points, we can see which access points are seeing failures here.

And we can see that LD_GLN_AP and LD_JSW_AP are seeing a little bit more failures than usual, whereas the rest of the APs are pretty much behaving normally. Uh, things look pretty standard. Uh, therefore, you know, this is what the typical kind of performance that we expect on the network.

You can also look to see which frequency bands are experiencing problems, which device operating systems are having trouble, which device types are having problems, and which of your SSIDs are having issues with Successful Connects. Now notice that right now I've just clicked on Successful Connects and I haven't drilled down into a classifier, but you can do that at any time. For example, we could click on authorization and now we're only looking at the authorization issues.

So the next thing I want to do is I want to come over here to the affected items list. This is going to show us a list of host names that are affected due to unsuccessful connections. These are devices that are failing this service level expectation.

You can see at this device, we only get a Mac address for it because it probably hasn't successfully connected before, has failed a hundred percent of the time. This device has failed 14% of the time. And this device has failed 3% of the time.

So let's take a closer look at one of these devices to see exactly what's going on. I'm going to click on this one that just has a Mac address to take a closer look. Uh, and so first we get, uh, first we get an answer from Marvis.

Marvis tells us, and by the way, Marvis is a name that we assign our AI and machine learning engine that helps us understand what's going on automatically. I'll show you more about Marvis in just a little bit. It says the client failed to connect on a hundred percent of attempts due to authorization problems.

Interesting. Let's take a closer look to see what's going on. If we click on the view insights button, that's going to show us all the events that have happened for this device within whatever time span that we have selected.

Now, right now I have seven days selected, but you could select the last hour. You could select, you know, between Monday and, and a Thursday, you can look at any time span you want within the last seven days, and you can see that this device has been going through a brutal process where it deauthenticates from the access point, uh, and then there's an authorization failure and then it deauth, it's, it gets deauth. And then there's another authorization failure.

And this happens over and over and over. And the, the reason code here is that there was a WPA four-way handshake timeout. That sounds a lot like a bad passphrase to me.

It looks like this device just has the incorrect WPA2 passphrase. Now notice that there's also a little paperclip icon next to each one of these bad events that occurs. Mist will automatically take a packet capture whenever a wireless client enters a bad state, and this is just a normal standard old, plain old PCAP file that you can download, that you can open up in your favorite packet analysis tool, like, uh, like Wireshark or IPA or OmniPeak or any of your favorite packet analysis tools.

And so this is taken automatically to help you get to the root of the problems very quickly. Now, uh, like I mentioned earlier, we're going to show you any events that the client device goes through here and notice all of these different events that will show, will show you positive events like a DHCP success, neutral events like a disassociation. It's not really good or bad.

It's just something that happens. We'll show you negative events, events like a DHCP aborted or a DNS failure. We measure over 150 unique client states, and we'll expose many of those events here so you can look and see exactly what happened to a device, uh, within, uh, within a seven day period.

Now that feature applies to a lot more than just access points or rather clients. You can look at access points as well. Uh, so for example, we could go look up a specific access point.

Like let's look at LD_testbed_MB and we can go look and see what has been going on with this access point. Have there been any, uh, have there been any RadSec changes? Uh, have there been any certificate certificates regenerated? Have there been any DNS failures or maybe man in the middle attacks that have had, that have been detected? We'll show you anything that has happened for this client or for this access point within the last seven days that applies to clients, switches, uh, wired clients, and of course, wireless clients, you can look at all that data to see what's going on. So now let me show you the automatic way of doing troubleshooting.

And that's going to be with Marvis. Remember, I just showed you the manual way. Let's look at the automatic way.

Now, if we go to Marvis, this is where we can ask Marvis questions about what is going on on the wireless network. Like for example, you can say, show unhappy clients and Marvis is going to return a list of devices that are correlated with success and a list of devices that are correlated with failure. As you can see, Kilimanjaro has been highly correlated with failure.

And so let's check this device out to see what's going on. We can click on it and click on troubleshoot and Marvis is going to automatically using the power of AI and machine learning in the cloud. It's going to automatically troubleshoot this client device to figure out what's going on.

It looks like the client failed to connect on 100% of attempts due to authentication failures because of a possible PSK mismatch. Yeah, that looks like a bad passphrase to me. And from here we can investigate this further.

For example, we can go look to see what other aspects of the network correlate with this failure. Is it a client specific problem? Is it happening to everyone on the network? Or we can go look at all of the different events that have occurred for this client device, including the dynamic packet captures that I talked about earlier. Now this goes way deeper than just troubleshooting wireless devices.

We can actually troubleshoot the wired side of the network as well. Now we have the capability to, uh, to, uh, manage all of our Juniper EX switches. And so for example, we can go look at a specific switch.

This one's got three access points plugged into it, and we can even go look at client devices on our switches, uh, to see what kind of events have been happening to those, those devices over time. And Marvis brings this information in as well. We can ask it to troubleshoot access points.

Like for example, we can say, how is, um, let's say, how is Marvis. And this is actually the name of one of the access points on our demo network here. It looks like the AP has a low ethernet speed of a hundred megabits per second.

Here's the switch it's connected to. Here's the port it's connected to as well. Now, if you don't want to ask Marvis questions about the network, um, you can also go to Marvis actions and Marvis actions will automatically find things in the network that need to be fixed.

For example, if we look under the switch actions here, we can see that there's a missing VLAN action. So it looks like one of our switches here has two missing VLANs and Marvis detected this for us automatically. This AP connected to on port 21 is missing three, these three VLANs.

And so is this access point. They're both missing VLANs and we use machine learning to find those. We also use machine learning to automatically find bad cables and negotiation mismatches.

So Marvis can help you find these issues automatically. Thanks for watching our video about how to do troubleshooting with mist and Marvis. If you have any questions, please be sure to let us know, and I hope you have a great rest of your day.

Bye now.

대시보드

Juniper Mist 대시보드를 사용하면 다음을 확인할 수 있습니다.

  • 한 눈에 해석할 수 있는 성공/실패 지표

  • 문제가 발생한 시기와 위치를 정확히 보여주는 시각화

  • 모든 인시던트에 대한 패킷 캡처

  • 근본 원인 분석

더 좋은 점은 영향을 미치기 전에 많은 문제를 발견할 수 있다는 것입니다. SLE(Service Level Expectations) 대시보드를 사용하면 기대에 미치지 못하는 조건을 빠르게 파악할 수 있습니다. 사고가 발생하기 전에 조치를 취하십시오.

마비스

Marvis Virtual Network Assistant 서브스크립션이 있는 경우 다음과 같은 혜택도 얻을 수 있습니다.

  • 네트워크 성능 및 사용자 경험 개선을 위한 AI 권장 조치

  • 문제 식별 및 문제 해결을 통한 대화 지원

  • 보다 체계적인 문의를 위한 강력한 쿼리 언어

  • 선제적 잠재적 문제 식별