Building AI-Powered Network Diagnostics

What if an AI could diagnose network issues faster than your best network engineer? That’s the question we set out to answer.

The Problem Space

Our support team handles hundreds of network-related tickets per week. “WiFi is slow.” “POS can’t connect.” “Kitchen display is offline.” Each one requires someone to:

Log into the ticketing system
Look up the location’s network devices
SSH into routers and access points
Run various diagnostic commands
Interpret the results
Formulate a response

Average time: 15-20 minutes per ticket. Multiply that by 400 tickets a week, and you’re looking at 100+ hours of skilled engineer time just on diagnostics.

Enter MCP

Model Context Protocol (MCP) is Anthropic’s standard for connecting AI models to external tools. Instead of fine-tuning a model on network diagnostics, we give it access to the same tools our engineers use.

The architecture is simple:

User describes problem
        ↓
Claude analyzes and decides what to check
        ↓
MCP Server executes network API calls
        ↓
Results returned to Claude
        ↓
Claude interprets and responds

Building the MCP Server

Our MCP server exposes tools for:

Device lookup - Find all network devices at a location
Ping tests - Check connectivity between devices
Throughput tests - Measure actual bandwidth
Client enumeration - See what’s connected to the network
Event logs - Check recent network events

Each tool maps to an API call to either Meraki (our wireless vendor) or Pronto (our router management platform).

The Results

Early testing shows the AI can correctly diagnose common issues about 80% of the time. More importantly, it does it in under 60 seconds instead of 15 minutes.

The types of issues it handles well:

ISP outages - Detects when the WAN link is down
WiFi interference - Identifies channel congestion
Client density - Spots when too many devices are on one AP
DHCP exhaustion - Recognizes when IP pools are full

What’s Next

We’re working on:

Automated remediation - For simple fixes, let the AI take action
Symptom pattern matching - Learn from past tickets to get smarter
Proactive alerting - Detect issues before they become tickets

The goal isn’t to replace network engineers. It’s to handle the routine 80% so engineers can focus on the interesting 20%.