| 故障一点通 >> 交换产品线 >> 3. 二层技术相关故障 >> 3.4 接口类 >> 3.4.2 POE >> POE无法供电故障处理 >> 【快速完整】收集排查方案 >> |
快速收集排查方案
1、确认故障现象:
1)确认故障出现时间点:是运行一段时间突然出现的,还是刚部署就出现的;
2)确认故障范围:是多台交换机都出现无法供电的问题,还是单台交换机出现;一台交换机上是所有受电设备都无法被供电,还是部分受电设备;
3)确认故障设备信息:
3.1)确认出现问题的交换机型号以及无法被供电的受电设备型号;
3.2)收集一台交换机端口接入多少个受电设备,受电设备的功耗。若是AP,需确认现场AP除自身裸机外,外挂天线等负载都有哪些;
注:各个设备的接地情况,拓扑中的poe设备、网线走线过程、AP以及天线,建议都进行拍照;
2、若是刚部署就出现无法供电,需先确认PD设备和交换机是否存在兼容性问题:
1)供电/受电线序
1.1)我司部分交换机支持用数据线供电(1、2、3、6),部分交换机支持用空闲线供电(4、5、7、8);
1.2)所以要确认受电设备是否只支持数据线或只支持空闲线受电,还是数据线空闲线都支持;
2)供电/受电电流、电压
确认我司交换机供电电流、电压范围,以及受电设备受电的电流、电源范围,比对是否存在不兼容的情况;
3、替换网线观察:
找1-2个故障受电设备,将交换机和受电设备之间的网线替换为自带的超五类网线测试,需要确保自带网线的质量;(poe供电对于网线的要求较高,客户现场的网线可能是批量制作,不排除批量网线质量问题)
4、核对交换机配置:
4.1)确认接口是否开启poe供电
Ruijie#show run int Gx/y //查看接口是否开启poe enable;GX/Y表示具体的接口号,需根据客户环境调整配置
Ruijie(config)#int Gx/y
//GX/Y表示具体的接口号,需根据客户环境调整配置
Ruijie(config-if-GigabitEthernet
x/y)#poe enable
//开启poe供电功能
注意:在没有接入PD 设备的端口上使用这个命令,可能导致对端设备被错误的上电烧毁,请确保端口在接入PD 设备的时候使用该命令。
Ruijie# show poe interface fastEthernet [interface-id]
查看legacy选项是开启还是关闭的,如果是关闭的,开启兼容性功能检查看故障是否解决,命令如下:
Ruijie(config)#int Gx/y
//GX/Y表示具体的接口号,需根据客户环境调整配置
Ruijie(config-if-GigabitEthernet
x/y)#poe legacy //开启端口的PD设备兼容功能
Ruijie(config-if-GigabitEthernet
x/y)#no poe legacy //关闭端口的PD设备兼容功能
Ruijie(config)#poe mode energy-saving //尝试将交换机的供电模式更改为节能模式,若要改回自动模式,命令为poe
mode auto
Ruijie(config)#poe
disconnect-mode {ac | dc}
//修改交换机断接侦测模式观察(AC和DC)
5、show命令确认基本信息:
------------先打开日志显示开关----------
Ruijie#terminal monitor
Ruijie#conf t
Ruijie(config)#logging on
Ruijie(config)#logging console
Ruijie(config)#logging monitor
--------以下在特权模式下收集(ruijie#)-----------
show version
show run
show int status
show
mac-address-table
show
mac-address-table count
show int count
rate
show int count
summary
show poe
interfaces GX/Y //GX/Y表示具体的故障接口号,需根据客户环境调整配置;查看故障端口的供电状态
show poe
interface configuration
show poe
interface status
//需要在故障时多收集几次,最好每隔5s收集一次,收集5次
show poe
powersupply
//需要在故障时多收集几次,最好每隔5s收集一次,收集5次
show
running-config interface GX/Y //GX/Y表示具体的故障接口号,需根据客户环境调整配置;
show
temperature
show log
6、底层信息收集:
注意:底层信息风险性较高,收集不当可能导致设备死机,需要重启交换机才能恢复,请联系4008111000工程师协助收集
1)收集poe错误信息
Ruijie(config)# poe support
show-errors
Ruijie#sd
Ruijie(sd)#
-------------如下在sd模式(Ruijie(sd)#)下收集------------
sh console on
sh pdt dump
sys //读取全局状态数据结构信息
sh pdt dump
port [port_id] //如1口,sh pdt dump port 1
sh pdt ups //读取当前ups环境变量的数值
sh pdt push on //开启或关闭ti驱动上报信息调试开关。注:该信息可能信息量比较大,可以在后面执行
sh pdt push off
//开启或关闭ti驱动上报信息调试开关。注:该信息可能信息量比较大,可以在后面执行
sh pdt read
0x28 0x00 1
sh pdt read
0x28 0x02 1
sh pdt read
0x28 0x04 1
sh pdt read
0x28 0x06 1
sh pdt read
0x28 0x08 1
sh pdt read
0x28 0x0a 1
sh pdt read
0x28 0x0c 1
sh pdt read
0x28 0x0d 1
sh pdt read
0x28 0x0e 1
sh pdt read
0x28 0x0f 1
sh pdt read
0x28 0x10 1
sh pdt read
0x28 0x12 1
sh pdt read
0x28 0x14 1
sh pdt read
0x28 0x2A 1
sh pdt read
0x28 0x2B 1
sh pdt read
0x28 0x30 2
sh pdt read
0x28 0x32 2
sh pdt read
0x28 0x34 2
sh pdt read
0x28 0x36 2
sh pdt read
0x28 0x38 2
sh pdt read
0x28 0x3a 2
sh pdt read
0x28 0x3c 2
sh pdt read
0x28 0x3e 2
sh pdt read
0x29 0x00 1
sh pdt read
0x29 0x02 1
sh pdt read
0x29 0x04 1
sh pdt read
0x29 0x06 1
sh pdt read
0x29 0x08 1
sh pdt read
0x29 0x0a 1
sh pdt read
0x29 0x0c 1
sh pdt read
0x29 0x0d 1
sh pdt read
0x29 0x0e 1
sh pdt read
0x29 0x0f 1
sh pdt read
0x29 0x10 1
sh pdt read
0x29 0x12 1
sh pdt read
0x29 0x14 1
sh pdt read
0x29 0x2A 1
sh pdt read
0x29 0x2B 1
sh pdt read
0x29 0x30 2
sh pdt read
0x29 0x32 2
sh pdt read
0x29 0x34 2
sh pdt read
0x29 0x36 2
sh pdt read 0x29
0x38 2
sh pdt read
0x29 0x3a 2
sh pdt read
0x29 0x3c 2
sh pdt read
0x29 0x3e 2
sh pdt read
0x2a 0x00 1
sh pdt read
0x2a 0x02 1
sh pdt read
0x2a 0x04 1
sh pdt read
0x2a 0x06 1
sh pdt read
0x2a 0x08 1
sh pdt read
0x2a 0x0a 1
sh pdt read
0x2a 0x0c 1
sh pdt read
0x2a 0x0d 1
sh pdt read
0x2a 0x0e 1
sh pdt read
0x2a 0x0f 1
sh pdt read
0x2a 0x10 1
sh pdt read
0x2a 0x12 1
sh pdt read
0x2a 0x14 1
sh pdt read
0x2a 0x2A 1
sh pdt read
0x2a 0x2B 1
sh pdt read
0x2a 0x30 2
sh pdt read
0x2a 0x32 2
sh pdt read
0x2a 0x34 2
sh pdt read
0x2a 0x36 2
sh pdt read
0x2a 0x38 2
sh pdt read
0x2a 0x3a 2
sh pdt read
0x2a 0x3c 2
sh pdt read
0x2a 0x3e 2
sh pdt read
0x2b 0x00 1
sh pdt read
0x2b 0x02 1
sh pdt read
0x2b 0x04 1
sh pdt read
0x2b 0x06 1
sh pdt read 0x2b
0x08 1
sh pdt read
0x2b 0x0a 1
sh pdt read
0x2b 0x0c 1
sh pdt read
0x2b 0x0d 1
sh pdt read
0x2b 0x0e 1
sh pdt read
0x2b 0x0f 1
sh pdt read
0x2b 0x10 1
sh pdt read
0x2b 0x12 1
sh pdt read
0x2b 0x14 1
sh pdt read
0x2b 0x2A 1
sh pdt read
0x2b 0x2B 1
sh pdt read
0x2b 0x30 2
sh pdt read
0x2b 0x32 2
sh pdt read
0x2b 0x34 2
sh pdt read
0x2b 0x36 2
sh pdt read
0x2b 0x38 2
sh pdt read
0x2b 0x3a 2
sh pdt read
0x2b 0x3c 2
sh pdt read
0x2b 0x3e 2
sh pdt read
0x2c 0x00 1
sh pdt read
0x2c 0x02 1
sh pdt read 0x2c
0x04 1
sh pdt read
0x2c 0x06 1
sh pdt read
0x2c 0x08 1
sh pdt read
0x2c 0x0a 1
sh pdt read
0x2c 0x0c 1
sh pdt read
0x2c 0x0d 1
sh pdt read
0x2c 0x0e 1
sh pdt read
0x2c 0x0f 1
sh pdt read
0x2c 0x10 1
sh pdt read
0x2c 0x12 1
sh pdt read
0x2c 0x14 1
sh pdt read
0x2c 0x2A 1
sh pdt read
0x2c 0x2B 1
sh pdt read
0x2c 0x30 2
sh pdt read
0x2c 0x32 2
sh pdt read
0x2c 0x34 2
sh pdt read
0x2c 0x36 2
sh pdt read
0x2c 0x38 2
sh pdt read
0x2c 0x3a 2
sh pdt read
0x2c 0x3c 2
sh pdt read
0x2c 0x3e 2
sh pdt read
0x2d 0x00 1
sh pdt read
0x2d 0x02 1
sh pdt read
0x2d 0x04 1
sh pdt read
0x2d 0x06 1
sh pdt read
0x2d 0x08 1
sh pdt read
0x2d 0x0a 1
sh pdt read
0x2d 0x0c 1
sh pdt read
0x2d 0x0d 1
sh pdt read
0x2d 0x0e 1
sh pdt read
0x2d 0x0f 1
sh pdt read
0x2d 0x10 1
sh pdt read
0x2d 0x12 1
sh pdt read
0x2d 0x14 1
sh pdt read
0x2d 0x2A 1
sh pdt read
0x2d 0x2B 1
sh pdt read
0x2d 0x30 2
sh pdt read
0x2d 0x32 2
sh pdt read
0x2d 0x34 2
sh pdt read
0x2d 0x36 2
sh pdt read
0x2d 0x38 2
sh pdt read
0x2d 0x3a 2
sh pdt read
0x2d 0x3c 2
sh pdt read
0x2d 0x3e 2
sh console off
sh dexit
2)底层debug信息收集
开启debug命令:
Ruijie#debug support
Ruijie(support)#exec
poedb_msg
Ruijie(support)#exec
poedb_range
Ruijie(support)#exec
poedb_error
Ruijie(support)#exec
poedb_debug
Ruijie(support)#end
Ruijie#sd
Ruijie(sd)#sh console on
Ruijie(sd)#sh poe debug i2c
Ruijie(sd)# sh poe debug
push
Ruijie(sd)#sh poe debug conf
Ruijie(sd)#sh poe debug pd
Ruijie(sd)#sh poe debug st
如果是poe设备无法供电问题,关闭poe,打开poe进行信息收集;如果是POE导致AP掉线问题,则收集周期为AP掉线前到AP掉线后。
关闭debug命令:
Ruijie#debug su
Ruijie(support)#exec
poedb_msg
Ruijie(support)#exec
poedb_range
Ruijie(support)#exec
poedb_error
Ruijie(support)#exec
poedb_debug
Ruijie#sd
Ruijie(sd)#sh poe debug i2c
Ruijie(sd)# sh poe debug
push
Ruijie(sd)#sh poe debug conf
Ruijie(sd)#sh poe debug pd
(sd)#sh poe debug st
(sd)#sh console off
(sd)#sh dexit
注:该信息可能信息量比较大,可以在后面执行
2、对于S5750P交换机(老款的S5750P交换机)
注:该信息可能信息量比较大,可以在后面执行
Ruijie#sd
Ruijie(sd)#
-------------如下在sd模式(Ruijie(sd)#)下收集------------
sh console on
sh poe
debug //如果是poe设备无法供电的问题,关闭poe,打开poe进行信息收集;如果是POE导致AP掉线的问题,则收集周期为AP掉线前到AP掉线后。
关闭debug命令,执行:
(sd)#sh poe debug
(sd)#sh dexit
1)查看硬件芯片设置是否与软件一致:
特权模式下:
Ruijie#Debug ssp loc
Poedump 0
Poedump 1
2)收集底层硬件端口信息:
特权模式下:
Ruijie#debug ssp rem
i2c_read slot 0 type 4
dev_id 1 0x11aa 2 24
i2c_read slot 0 type 4
dev_id 0 0x11aa 2 24
i2c_read slot 0 type 4
dev_id 1 0x11c2 2 24
i2c_read slot 0 type 4
dev_id 0 0x11c2 2 24
该步骤是收集端口状态信息,读取硬件端口状态寄存器以及端口pd等级寄存器
Ruijie#debug ssp rem
Ruijie(ssp-debug)#debug poe warn
关闭命令:
Ruijie(ssp-debug)#debug poe off
如果有i2c读errer的信息读出,则可直接定位为i2c硬件问题。
例如:
DFJN_JR_RG5750_05# debug ssp rem
DFJN_JR_RG5750_05(ssp-debug)#debug
poe warn
DFJN_JR_RG5750_05(ssp-debug)#
$ssp[198781207,hal_poe_pd69_read_register,379]:i2c
read error
$ssp[198781213,hal_poe_pd69_read_register,379]:i2c
read error
$ssp[198781219,lml_poe_pm_update_port_hwinfo,1288]:
$ssp[198781225,hal_poe_pd69_read_register,379]:i2c
read error
$ssp[198781232,lml_poe_pm_update_port_hwinfo,1293]:
$ssp[198781237,hal_poe_pd69_read_register,379]:i2c
read error
$ssp[198781244,lml_poe_pm_update_port_hwinfo,1298]:
$ssp[198781249,hal_poe_pd69_write_reg_field,286]:i2c
read error
$ssp[198781256,hal_poe_pd69_write_reg_field,286]:i2c
read error
$ssp[198781263,hal_poe_pd69_write_reg_field,286]:i2c
read error