The Honeywell ControlEdge HC900 is a modular controller that integrates loop regulation and logic control, widely used in mining, metal, chemical, pharmaceutical, power, and combustion management fields. It supports both non redundant and redundant architectures (Split Rack Redundancy), and provides rich analog/digital I/O modules, HART function blocks, as well as flexible Ethernet (Modbus TCP) and serial communication capabilities. Although the platform is known for its high reliability and ease of use, there may still be issues such as CPU redundancy switching failure, I/O channel signal loss, communication interruption, or configuration damage during long-term continuous operation.
This article systematically elaborates on the diagnostic process and standard troubleshooting steps for various common faults based on the hardware structure, redundancy mechanism, I/O characteristics, and communication protocol of the ControlEdge HC900 controller, combined with practical maintenance experience. All operations mentioned in the article must be carried out by engineers with SIL2 safety knowledge and experience using Honeywell Designer software, and strictly comply with on-site safety regulations.
Review of System Core Components and Redundant Architecture
Before starting troubleshooting, it is necessary to review the key components and operating principles of the HC900 system, which will help understand the subsequent diagnostic logic.
CPU module: including C30, C50 (non redundant), C70 (dual network port), and C75 (redundant CPU). All CPUs are based on a 32-bit PowerPC architecture, with battery backed DDR2 memory (64 MB for C30/C50 and 128 MB for C70/C75), and support ECC error checking. The program execution environment is protected by an independent watchdog timer.
Redundant architecture: The C75 redundant system uses independent controller racks (without local I/O), with two C75 CPUs powered by independent power modules and interconnected through redundant switching modules (RSMs). The main controller (Lead) synchronizes the operating data to the backup controller (Reserve) every scanning cycle, and performs undisturbed switching when the main controller fails, RSM key switch is switched, or function block instructions are triggered.
I/O system: Supports local racks (4/8/12 slots) and remote I/O racks (connected through dedicated Ethernet ports using copper or fiber optic cables). The I/O module includes general-purpose analog inputs (8 channels, compatible with thermocouple/RTD/voltage/current), high-level analog inputs (16 channels), analog outputs (4/8/16 channels), digital inputs (16/32 channels), digital outputs (16/32 channels), relay outputs, and pulse/frequency/quadrature encoder modules. Important feature: Supports hot swapping (inserting or removing modules during system operation, automatically recognized and configured by the controller).
Communication interface: Each CPU provides Ethernet 10/100Base-T ports (C70/C75 dual ports), supporting Modbus TCP、OPC、HART IP、Peer-to-Peer(UDP)。 The isolated RS-485 port supports Modbus RTU master or slave mode, with a maximum baud rate of 115200.
Alarm and Event: The controller supports up to 360 alarm points (divided into 30 groups) and 64 event points, with a timestamp resolution of 1 second. Some configurations support high-resolution SOE (1 ms), but it is only valid in non redundant UIO configurations.

Typical fault diagnosis and troubleshooting
The following fault scenarios are organized based on the high incidence problems on site, providing symptoms, possible causes, and detailed handling steps for each item.
3.1 C75 Redundant CPU Switching Failure or Uninterrupted Switching Interruption
phenomenon
After the main controller fails (such as power loss, CPU internal error), the backup controller does not automatically take over, and the process loses control.
When manually switching through the RSM key switch, the Lead status indicator light does not change, or the output briefly jumps after switching.
The redundancy status function block in the Designer software displays "Reserve NOT synchronized" or "Communication loss".
Possible reasons:
The redundant synchronization link (dedicated Gigabit Ethernet port or backplane connection) between two C75 CPUs is physically disconnected.
The real-time operating system or application version of the backup controller is inconsistent with that of the main controller.
The battery of the backup controller has been depleted, resulting in the loss of dynamic data and the inability to receive synchronized data from the main controller.
The mode switch on RSM is mistakenly placed in the "Force" or "Disable" position.
The I/O communication network (remote rack) is interrupted, causing the backup controller to be unable to obtain complete I/O status.
Exclusion steps:
Check physical connection: Confirm that the redundant Ethernet ports (usually Port 2) of two C75 CPUs are directly connected using standard CAT5e jumpers (or through dedicated redundant switches, but Honeywell recommends direct connection). Observe the port LED, which should flash to indicate activity.
Verify version consistency: Use Designer software to connect the main control and backup control separately, click "Controller" ->"Properties" to view the firmware version and project configuration version number (checked through File Properties). If there is inconsistency, upgrade the backup control to be exactly the same as the main control. Note: Redundant systems require that the hardware model, firmware, and configuration file hash values of two CPUs are completely identical.
Test battery status: Open the front cover of the C75 module and measure the battery voltage. The TL5903 lithium battery has a nominal voltage of 3.6V and must be replaced when it is below 2.5V. Place the controller in Program mode and disconnect the power before replacement, and record all configurations (backed up to SD or PC). After replacement, download the configuration again and resume operation.
RSM switch check: Rotate the key switch on the RSM to the "Auto" position (not "Force"). Refer to the output parameters of the "Redundancy Status" function block in Designer and confirm that "Switch_Snable" is set to TRUE.
Force switch test: Manually trigger the switch using the "Redundancy Control" function block in Designer (or through the buttons on RSM). Observing LED changes: The "Lead" LED of the main control goes off, and the "Lead" LED of the backup control lights up. If the I/O communication is still abnormal after switching, use the diagnostic function block "System Monitor" to read the heartbeat status of each remote rack (see the remote I/O fault section below).
Restore synchronization: If the backup controller is offline for a long time, full synchronization needs to be performed again. Firstly, ensure that the backup control is in "Program" mode, and then select "Redundancy" ->"Synchronize now" in Designer. The system will copy all the configuration and operating parameters of the main control to the backup control, which may take several minutes, during which the control will not be affected.
Preventive measures: Conduct a manual redundancy switch test once every quarter and check the battery voltage records of both CPUs. Regularly backup configuration files to the project server.
3.2 Unable to recognize or maintain bad channel data values after hot plugging of I/O modules
phenomenon
After replacing the faulty analog input module (such as 8-channel UIO), the corresponding channel in the Designer software displays "????" or "Bad", and the actual on-site signal (such as 4-20 mA) cannot be read.
After replacing the digital output module, the corresponding DO point cannot control the on-site equipment, and the module status LED displays red.
An entry for 'I/O module mismatch' or 'Configuration error' appears in the controller log (Event Log).
Possible reasons:
The newly inserted module model does not match the model specified in the configuration (for example, the original module was a 16 channel high-level AI, mistakenly inserted as an 8-channel universal AI).
The module was not fully secured during insertion, resulting in poor contact of the backplane connector.
The controller scanning program failed to automatically trigger module enumeration (commonly seen in some early firmware versions or redundant systems).
The channel configuration of the module (such as input type, range, filtering) is lost due to EEPROM damage.
Exclusion steps:
Confirm model consistency: Check the product code on the side of the module (e.g. 900AI8-0100 is an 8-channel universal AI, 900AI16-0100 is a 16 channel high-level AI). Compare the "Module Type" setting of the slot in Designer. If there is inconsistency, it is necessary to replace the correct model or modify the configuration (modifying the configuration requires online download, which may cause temporary disturbance).
Re plug and observe self-test: While the system is running, completely unplug the module, wait for 5 seconds, and then forcefully push it in until the upper and lower buckles "click" to lock. Observe the "OK" or "PWR" LED on the module: normally it should be green and constantly on. If the LED flashes red, it indicates a module self-test failure or backplane communication error.
Force reinitialization: In Designer, enter the "I/O Configuration" view, right-click on the corresponding I/O rack, and select "Re initialize I/O. This operation will force the controller to re enumerate all modules on this rack, without affecting other racks. Note: For redundant CPU systems, this operation must be performed on both CPUs (or wait for automatic repair of redundant synchronization).
Check channel diagnostic information: Double click the I/O module to open the module details window. HC900 provides diagnostic status for each channel (such as "Open Wire", "Overlange", "Underrange", "Input Error"). For analog inputs, the most common is "Open Wire" - check if the sensor wiring on site is disconnected. When configuring the UIO module as a digital output, a "Short Circuit" appears - check if the load impedance is too low.
Restore default configuration: If the module is recognized but all channels have bad values, it may be due to a damaged internal configuration table of the module. Use the 'Upload I/O Config' feature of Designer to read the correct configuration from the controller, then execute 'Factory Defaults' on the module and download it again.
Hot plug precautions:
Hot swapping is only applicable to I/O modules and does not support hot swapping of CPU or scanner modules (S50/S75 Scanner). Replacing the scanner must be powered off.
When replacing non redundant I/O modules in a redundant system, both the primary and backup controllers will sense the removal/insertion of the module, but data will only be restored when the serial number of the new module matches the configuration.
After hot plugging, the output state of the digital output module returns to the safe default value (usually preset in the configuration), and the command must be reissued by the operation station.
3.3 Modbus TCP communication interruption or response timeout
phenomenon
The upper computer (such as SCADA, HMI) cannot read the real-time data of HC900, and the Designer software cannot connect to the controller through Ethernet.
The Link/Activity LED on the Ethernet port of the controller is not lit or only flashing yellow.
Modbus TCP client reports' Connection timeout 'or' Socket error '.
Possible reasons:
IP address conflict or subnet mask/gateway configuration error.
Ethernet switch failure or port blockage (such as broadcast storm).
Controller Modbus/TCP service crash (internal watchdog not reset).
The number of hosts connected simultaneously exceeds the limit (C50/C70/C75 up to 10 concurrent connections, C30 up to 5).
The firewall or network security policy intercepted the Modbus TCP port (default 502).
Exclusion steps:
Physical layer inspection: Use a cable tester to verify the continuity and line sequence of Ethernet cables (recommended to use shielded CAT5e). Observe the corresponding port LED of the switch. If it is constantly on but not flashing, there may be a link negotiation issue. The HC900 Ethernet port supports MDIX auto flip, but if connecting to an old switch, try using a crossover cable.
Verify IP configuration: Connect the controller through the serial port (RS-485) of Designer (USB-RS485 converter is required), Honeywell p/n 50089787-501)。 View the IP address, subnet mask, and default gateway of the Ethernet port in "Controller" ->"Communications". Confirm that it is on the same logical subnet as the upper computer. Note: The two Ethernet ports of C70/C75 must be configured on different subnets and cannot be the same.
Ping test: Use the command line to ping the controller IP from the upper computer. If it doesn't work, check the switch VLAN settings and port security. If it is connected but Modbus TCP is not, use a port scanning tool (such as Nmap) to check if port 502 is open.
Check concurrent connections: On the diagnostic page of Designer, check the "Ethernet Host Connections" counter. If the maximum value is reached, close unused SCADA clients or add upper computer connection pool reuse. The maximum number of connections allowed by Modbus TCP includes Designer connections. It is recommended to disconnect unnecessary Designer sessions during production.
Restart communication stack (without shutdown): Use the function block "System Reset" to reset only Ethernet communication (without restarting the controller). Specifically, add a temporary pulse to trigger the "CfgReset" parameter in the control strategy (caution should be exercised, only resetting the communication submodule). A safer way: Send Modbus commands through the RS-485 port to reset the TCP stack.
Firmware upgrade: If Modbus TCP service freezes frequently, please upgrade to v6.300 or above, which enhances the TCP stack's ability to recover from exceptions.
HART IP communication fault supplement: If using HART IP to communicate with FDM, the default TCP port is 5094. Check if the HART IP server is enabled (check "Enable HART IP Server" in Designer). Simultaneously confirm that there is no conflict with the HART field device address (0-15). When using the Wireless HART adapter, it is necessary to check the signal strength.
3.4 Power redundancy failure or frequent controller restarts
phenomenon
The "Power supply fail" alarm appears in the system log, but there is still voltage input on site.
The controller restarts periodically, and the I/O state briefly disappears before recovering.
The "OK" LED of one module in the redundant power module is turned off.
Possible reasons:
Internal faults within a single power module, such as aging capacitors and blown fuses.
External power supply voltage fluctuation exceeds the allowable range (P01 module: 90~264VAC; P24 module: 21-29VDC).
The redundant power connectors on the rack expansion backplane are loose.
High ambient temperature leads to overheating protection of the power module.
Exclusion steps:
Measure input voltage: Use a multimeter to measure the voltage at the input end of the power module. For P01, ensure the voltage is between 90~264VAC and the frequency is between 47~63Hz; for P24, ensure the voltage is between 21~29VDC and the ripple is less than 5%.
Replacing faulty power modules: For redundant configurations (using expansion racks to install two power supplies), a single power module can be replaced while the system is running. Unplug the input connector of the faulty module, then release the locking screw to remove the module. After inserting the new module, power it on again and observe the LED turn green. Non redundant systems (single power supply) must be shut down and replaced.
Check load current: Calculate the total power consumption of all I/O modules. The maximum output of a power module is 60W. If it approaches the upper limit, consider adding redundant power supplies or removing non essential modules. Real time power load percentage can be viewed in the "System Monitor" of Designer.
Temperature check: HC900 allows an ambient temperature of 0-60 ° C (for some modules such as 16 channel AI, the temperature level is T6, with a maximum of 60 ° C). Use an infrared thermometer to measure the temperature inside the cabinet. If it exceeds 60 ° C, install a fan or air conditioner.
3.5 Configuration loss or controller failure to start
phenomenon
After the controller is powered on, the "Run" LED does not light up, and all I/O module outputs are in a safe state (configured fault safety value).
When the Designer software attempts to connect, it prompts "Controller contains no configuration" or "Configuration checksum error".
The low battery voltage alarm had already appeared but was ignored.
Possible reasons:
After the backup battery is depleted and the main power supply is interrupted, the configuration in the dynamic memory is lost.
User configuration damage in Flash memory (rare, possibly caused by static electricity or firmware upgrade interruption).
CPU module hardware failure.
Exclusion steps:
Download configuration again from PC: If the configuration file (*. cde) was previously saved, use Designer to download it via serial port or Ethernet. Attention: Before downloading, place the CPU in "Program" mode (via the key switch on the RSM or through software selection). After downloading, switch to 'Run'.
Attempt to upload without backup: If the PC file is lost but there is still configuration in the controller Flash (only battery loss), you can try to recover it through the "Upload from Controller" function in Designer. Check 'Upload full configuration including text descriptions'. Attention: The uploaded file may lack some comments, but the functional blocks and I/O configuration are complete.
After resetting to factory settings, reconfigure: If the upload fails, you can perform a full controller erase. Select "Controller" ->"Erase Configuration" through Designer, and then manually reconfigure (or import previously exported CSV point tables). For SIL2 security configuration, it is recommended to only use validated backup files and not to recreate the configuration in production.
Configuration migration after replacing CPU module: If the CPU is physically damaged, move the SD card (if any) or external storage on the old CPU to the new CPU. HC900 does not support direct insertion and removal of SD card migration configuration, and must be downloaded through Designer. After replacing C75, redundant synchronization needs to be performed again.
Best practice: After each configuration modification, use the "Export" function to save the configuration as. cde and. pdf documents and archive them on the engineering server. Set up periodic (monthly) automatic upload of configurations to network locations (via Designer scripts).

Advanced diagnostic tools and techniques
4.1 Utilizing Function Blocks for Online Diagnosis
HC900 provides rich diagnostic function blocks that can be embedded in control strategies to monitor health status in real-time:
System Monitor Block: Output CPU load, scan cycle, battery voltage, redundancy status, and I/O bus error count.
Redundancy Status Block: Provides primary and backup roles, synchronization status, and switch counters.
Rack Monitor Block: Monitor the communication quality and module presence of each I/O rack.
Modbus Device Block: When acting as a Modbus master, it can read the status register of the slave station and detect the number of communication failures.
After creating these functional blocks in Designer, their output values can be displayed on the 900 Control Station operator interface for easy daily inspections.
4.2 Event Log Analysis
The controller can store up to 5000 events (SOE) or 360 alarms internally. The "Event Log" view in Designer allows filtering by time, priority, and type. Common key events:
Event code description suggests actions
E011 Redundant link loss check for redundant cables
E045 I/O module removed check hot plug record
E102 Battery Low: Replace the battery within 48 hours
E207 Modbus TCP Connection Dropped Check Network or Restart Communication
E301 Watchdog reset CPU abnormal reset, firmware upgrade
4.3 Remote access and modem troubleshooting
For unmanned stations, HC900 supports remote diagnosis through an external modem (RS485 to RS232, connected to standard telephone lines or cellular modules). Key points for troubleshooting:
Ensure that the RS485 port of the controller is configured in "modem" mode (not "direct") and the timeout is set to 30 seconds or more (due to dialing delay).
Use the AT command to test the modem initialization string (e.g. "AT&F0").
In Designer, select "Communication" ->"modem" and enter the other party's phone number. If frequently disconnected after connection, reduce the baud rate to 9600.
Preventive maintenance recommendations
To minimize unplanned downtime, it is recommended to develop and implement the following maintenance plan (with cycles adjusted based on the severity of the operating environment):
Quarterly:
Test redundant CPU switching (manually triggered, verify no interference).
Measure the input voltage and ripple of all power modules.
Check the LED status of all I/O modules and record any abnormal flashing.
Generate a report using "Diagnostics" ->"I/O Health" in Designer.
Every six months:
Hot plug each I/O module (during planned downtime), clean the gold fingers and reinsert to remove the oxide layer.
Calibrate key analog input channels (verify the engineering values corresponding to 4mA and 20mA using a precision signal source).
Backup the controller configuration to two independent media (network driver and USB).
Every year:
Replace the CPU battery (even if low voltage is not reported, TL5903 is designed to have a lifespan of 5 years, and it is recommended to replace it preventively for 3 years).
Upgrade the firmware to the latest stable version (download from Honeywell official website, follow the upgrade guide).
Check all grounding resistances (frame grounding should be less than 1 Ω).
