

Bergische Universität Wuppertal FAKULTÄT FÜR Elektrotechnik, Informationstechnik und Medientechnik

Development and characterization of the Monitoring of Pixel System (MOPS) chip to monitor the ATLAS ITk Pixel Detector

PhD Thesis of: Rizwan Ahmad

20.09.2023

# Abstract

A High Luminosity(HL) upgrade is planned for the Large Hadron Collider (LHC) at CERN, which will increase the originally designed luminosity by a factor of ten. The ATLAS experiment will get a completely new all-silicon Inner Tracking Detector (ITk) by 2026 to work with the new HL-LHC. The ITk consists of a Pixel Detector as the inner and a strip detector as the outer part. The ITk Pixel Detector design comprises five layers and over 10000 readout modules. The new Pixel Detector will use serial powering chains with a maximum of 16 readout modules in a single chain to reduce services inside the detector. A new Detector Control System (DCS) is required to monitor the temperature and voltages across the detector modules for reliable operation and data taking.

Due to the limited space, very stringent radiation requirements, and to reduce the material inside the detector volume, also in the DCS, a newly designed chip is required that can collect the monitoring data locally and send it over a long communication channel to a central station outside the detector volume. This dramatically reduces the number of service lines. The chip must be radiation hard to an ionizing dose of up to 500 Mrad, tolerant to Single-Event Effects (SEE), and requires a wide operational temperature range of -40  $^{\circ}$ C to +60  $^{\circ}$ C.

The Monitoring of Pixel System (MOPS) chip is an Application Specific Integrated Circuit (ASIC) that has been developed using the TSMC 65 nm CMOS process to provide the temperature and the voltage monitoring data of detector modules to the DCS. The chip offers 32 ADC channels using the Successive Approximation Register (SAR) based 12-bit ADC. The chip implements Controller Area Network (CAN) and CANopen protocols in a hardwired logic, uses a non-standard low voltage physical layer, and provides additional features like remote reset without a power cycle and automated on-chip frequency trimming using CAN messages. Irradiation studies have been performed to prove Total Ionizing Dose (TID) tolerance of up to 500 MGy and tolerance to Single Event Effect(SEE). The chip has proven to work reliably under irradiation at high operating temperatures of up to  $40\degree C$  as well as for an operational temperature range of −40 °C to 60 °C.

# **Contents**







# <span id="page-8-0"></span>Chapter 1 CERN and the Large Hadron Collider

# <span id="page-8-1"></span>1.1 Research at CERN and the LHC

Conseil Européen pour la Recherche Nucléaire (CERN) is an international research institute founded in 1954 by 12 member countries. Currently, the CERN provides one of a kind infrastructure to conduct research in high-energy physics. The institute hosts the world's largest high-energy particle collider. Over time, many experiment facilities were built, contributing a tremendous amount of scientific knowledge for humanity. The puzzle of the standard model of elementary particles was partially solved by the discovery of the W boson, Z boson, and Higgs boson. In addition to that, the institute provides the infrastructure to research in extra dimensions and antimatter, etc. Beyond the research in fundamental particle physics, CERN contributes towards the development of future technologies and provides scientists with tools to research; e.g., the World Wide Web is one of the key achievements at CERN, which has revolutionized information exchange. Technologies developed at CERN are used in the medical field to provide diagnoses and therapies and in the space industry to build radiation-hard electronics and tools. Figure [1.1](#page-9-1) shows an overview of the complete CERN accelerator complex with all the installed experiment facilities.

The Large Hadron Collider The LHC is one of the largest and most complex machines ever built by mankind. The machine is 100 m under the ground in a circular tunnel with a circumference of approximately 27 km and a diameter of 3.8 m. Protons are acquired by removing electrons from the hydrogen atoms. Shown in Figure [1.1,](#page-9-1) the protons are then fed into the Booster and subsequently into the Proton Synchrotron(PS) and Super Proton Synchrotron (SPS), where they get accelerated before finally transferring to the LHC. Two proton beams accelerated up to 7 TeV in a counter direction collide at specific points around which the detectors are built to track the trajectories of the particles after a collision. In total, 9600 magnets are used in the LHC to steer and optimize the trajectories of the particles [\[2\]](#page-138-1). LHC is designed for a maximum collision energy of 14 TeV, equivalent to  $22.4 \times 10^{-7}$ J. However, it can also be operated using a Lead-nuclei beam, generating even higher collision energy of up to 1150 TeV. Although circular, the LHC is not symmetrical. The whole structure is divided into eight arcs and also eight insertion points. The dipole bending magnets are used in the arcs to guide the beam. Part of the machine lying inside an insertion point is entirely straight and goes through different experiments. It is used

<span id="page-9-1"></span>

Figure 1.1: The CERN accelerator complex [\[1\]](#page-138-2)

for dumping, cleaning, and injection of the beam. Combined, this incredible piece of engineering delivers a Luminosity of  $1 \times 10^{34}$ cm<sup>-2</sup>s<sup>-1</sup> and up to 1 billion collisions per second [\[2\]](#page-138-1).

### <span id="page-9-0"></span>1.2 Experiments running at the LHC

Figure [1.1](#page-9-1) shows locations around the ring where the four big experiments and related facilities are installed at the LHC. These are the Compact Muon Solenoid (CMS), Large Hadron Collider Beauty (LHCb), A Toroidal LHC ApparatuS (ATLAS), and A Large Ion Collider Experiment (ALICE). The ALICE experiment is a dedicated machine of 10000 tonnes, 26 m long and 16 m high/wide, to study strongly interacting matters. It uses the lead-nuclei beam at the LHC to study conditions just after the Big Bang. The LHCb is dedicated to researching a particular type of particle called beauty quark to find differences between matter and antimatter. Instead of a single detector centered around a collision point, the LHCb is composed of a series of small detectors placed in a single direction over up to 20 m. The other two large detectors, ATLAS and CMS, are general-purpose detectors that provide data to study the interaction of the particles. Although doing the same thing, ATLAS and CMS are built differently by two independent collaborations. This allows cross-confirmation of the scientific achievements made by the two experiments. The data from the CMS and the ATLAS is used to study the standard model, dark matter, and extra dimensions [\[3\]](#page-138-3). Section [2.1](#page-13-0) describes the ATLAS experiment in detail.

# <span id="page-10-0"></span>1.3 LHC high luminosity upgrade

A high luminosity upgrade is planned for the LHC, which will increase the originally designed luminosity by a factor of 10. The new collider will start its operation in 2029 per the current plan. After several years of operation at 13 TeV, the machine was able to produce  $160 \text{ fb}^{-1}$  of data, while after the upgrade, the HL-LHC will produce  $250 \text{ fb}^{-1}$  of data per year and 4000 fb<sup>-1</sup> of data over its lifetime of 12 years. The upgrade will allow producing a massive amount of collision data compared to the current system, e.g., 15 million Higgs bosons per year can be generated compared to a maximum of 3 million in 2017. To achieve the upgrade capabilities, 24 superconducting magnets made of niobium-titanium will be used with an intensity of 12 T compared to the current 8 T. In addition to that, the fundamental structure of the machine also needs an upgrade, e.g., Crab cavities to tilt the beam, reinforced machine protection to defend against a stray beam, and crystal collimators for better cleaning [\[4\]](#page-138-4).

# <span id="page-12-0"></span>Chapter 2 The ATLAS experiment

ATLAS is a general purpose and the largest detector at CERN. Parallel to the CMS experiment, it also studies dark matter, Higgs boson, and extra dimensions. The whole machine is composed of several sub-detectors and built around a center point of collision. The whole detector is symmetric around the interaction/collision point. It is 46 m long, 25 m high, 25 m wide and weighs around 7000 t.

<span id="page-12-1"></span>

Figure 2.1: The ATLAS experiment [\[5\]](#page-138-5)

### <span id="page-13-0"></span>2.1 Overview and sub-detectors of the ATLAS experiment

Figure [2.1](#page-12-1) shows the computer-aided design of the ATLAS experiment and its subdetectors. As shown in Figure [2.2,](#page-13-1) the center point of collision/interaction is defined as the origin of the coordinate system where the z-axis is used to define the beam direction, the positive x-axis is defined from the interaction point to the center of the LHC, while the positive y-axis is going in the upward direction. The pseudorapidity is defined as  $\eta = -\ln \tan(\theta/2)$ , where  $\theta$  is the polar angle from the beam axis [\[6\]](#page-138-6). A larger pseudorapidity means closer to the beam pipe, as seen in Figure [2.4.](#page-16-1) Figure [2.3](#page-14-0) shows a visualization of different particles detected inside the sub-detectors of the ATLAS experiment. The whole detector can be categorized into four sub-systems. Starting from the center point and going outward, the first is called the Inner Detector(ID). The second one is the Calorimeter, while the outermost and the third one is called the Muon Spectrometer. The fourth part is the magnetic system, which lies between the detectors to provide bending power. Brief details about each part are mentioned in the following paragraphs.

<span id="page-13-1"></span>

Figure 2.2: ATLAS experiment coordinates [\[7\]](#page-138-7)

**Inner Detector** The inner detector(ID) is further divided into three parts: the Pixel detector, the Semiconductor Tracker(SCT), and the Transition Radiation Tracker (TRT). The complete ID is immersed in a magnetic field of 2 T provided by the central solenoid, briefly explained in paragraph [2.1,](#page-15-0) and can be seen in Figure [2.1.](#page-12-1) The ID provides pattern recognition, momentum, and precise vertex measurements for the particles produced by a collision every 25 ns with a pseudorapidity of  $|\eta| < 2.5$ . This thesis will focus on the monitoring chip for the future Pixel detector. Therefore, the Pixel detector is explained in more detail in section [2.2](#page-16-0) [\[6\]](#page-138-6).

Calorimeter The Calorimeter is made up of two sub-detectors: the liquid argon, an electromagnetic calorimeter, and the hadronic Calorimeter. The Electromagnetic Calorimeter is used to find the energy and trajectory of electrons and photons. It provides a pseudorapidity of  $|\eta| < 3.2$ . The Hadronic Calorimeter is subdivided into a barrel section and an endcap section. It gives a pseudorapidity range of  $|\eta| < 1.7$  and is used for hadron measurements like proton and neutron. Cumulative, the complete Calorimeter provides high granularity and a range of pseudorapidity of  $|\eta| < 4.9$  [\[6\]](#page-138-6).

<span id="page-14-0"></span>

Figure 2.3: Visualization of particles inside the ATLAS's sub-detectors [\[8\]](#page-138-8)

Muon detector As shown in Figure [2.1,](#page-12-1) The Muon detector is ATLAS's largest and outermost part. It is divided into an endcap Muon chamber, barrel Muon chambers, and a New Small Wheel (NSW) Muon chamber. The Muon detector detects the momentum and energy of the muon particles, which pass through all the inner detectors without any interaction. The barrel region measures the muon tracks using three cylindrical layers around the beam axis. The endcap chambers have three layers perpendicular to the beam axis on each side. The information generated by the muon detector is also used for the trigger system[\[6\]](#page-138-6). New Small Wheel (NSW) endcaps have been integrated recently into the muon spectrometer to cope with the HL-LHC environment and for better resolution [\[9\]](#page-138-9).

<span id="page-15-0"></span>Magnet system The ATLAS experiment has four large superconducting magnets with a combined stored energy of 1.6 GJ. The ATLAS magnet system is divided into two main parts. A solenoid that is cylindrical around the beam axis and provides an axial magnetic field for the Inner Detector (ID). The solenoid magnet can be charged and discharged in just 30 minutes. A barrel toroid and two endcap toroid magnets provide deflection power for the muon detector. The barrel toroid generates a magnetic field of 0.5 T. The two endcap toroids combined produce a magnetic field of 1 T [\[6\]](#page-138-6). The whole magnet system is steered and operated in different modes by the ATLAS detector control system, as explained in Chapter [3.](#page-20-0)

### <span id="page-16-0"></span>2.2 The ATLAS ITk Pixel Detector

The complete Inner Detector of the ATLAS experiment will be upgraded and replaced by an all-silicon Inner Tracker (ITk) during the Long Shutdown (LS) 3 of the LHC. The upgrade is required to cope with harsher radiation environments of up to a fluence of  $2 \times 10^{16}$  n<sub>eq</sub>/cm<sup>2</sup> and to provide more precise measurements operating under the HL-LHC. Shown in Figure [2.4\(](#page-16-1)a) is a software-rendered overview of the complete ITk, whereas Figure [2.4\(](#page-16-1)b) shows the layout for one quadrant. The blue lines show the Strip part, whereas the red lines represent the Pixel part of the detector. In this thesis, only details regarding the Pixel part will be discussed because the MOPS chip has been developed to be used only for this part. The new Pixel detector will have five layers surrounding the beam pipe and several layers as rings known as Endcaps. Each layer is loaded with detector modules. A module comprises a sensor to detect particles and a readout chip to communicate data. The whole ITk will cover an instrumented area of  $13 \text{ m}^2$  with five billion readout channels [\[10\]](#page-138-10). The innermost layer is only at a distance of 34 mm from the beam pipe. Because of the radiation damage caused by an ionizing radiation dose of 500 Mrad, the innermost two layers will be replaced after five years of operation [\[11\]](#page-139-0). This allows for replacing damaged components due to radiation and integrating the newest available technology. The system will provide a pseudorapidity coverage of  $|\eta| < 4.0$ . 3D sensors will be used in the innermost layer with a pixel size of  $50 \times 50 \text{ µm}^2$  and  $25 \times 100 \text{ µm}^2$  whereas planar sensors will be used in the outer four layers with a pixel size of  $50 \times 50 \text{ }\mu\text{m}^2$ . The planar sensors have a thickness of 100 µm for the second layer and 150 µm for the outer three layers. The readout system will use the ITkpix chip providing  $400 \times 384$  pixels. Each chip has four readout channels at a rate of 1.28 Gbps. Approximately 10000 modules will be used in the system to host the ITkpix readout chips. The readout chips on the modules are powered in parallel [\[10\]](#page-138-10).

<span id="page-16-1"></span>

(a) Software rendered image of the new Inner Tracker [\[11\]](#page-139-0)



(b) Layout of one quadrant from the point of interaction (origin) [\[12\]](#page-139-1)

Figure 2.4: The ATLAS Inner Tracker layout

## <span id="page-17-0"></span>2.3 Serial powering scheme for the detector modules

The new Pixel Detector uses a new serial powering scheme to reduce the material inside the detector volume and decrease cable power losses. The total power loss in the cable should be less than  $30\%$  of the total required power [\[11\]](#page-139-0). Figure [2.5](#page-18-0) shows the new serial powering scheme. The concept of a serial powering chain has not been used in earlier generations of the detector. The complete chain is powered by a constant current source labeled LV. All the modules hosting the readout chips are connected in series, although the chips themselves are powered in parallel on the modules. According to the Pixel TDR [\[11\]](#page-139-0), a single chain can have a maximum of sixteen modules, which is reduced to thirteen in the subsequent system planning. Each module can have a maximum of four FrontEnd (FE) chips [\[13\]](#page-139-2), but a lower number is also used in some chains. Every FE chip is powered using two shunt-LDO regulators [\[14\]](#page-139-3)[\[15\]](#page-139-4), which provide a constant voltage supply for the chip and can also shunt extra current in the case when the chip is working in the low power mode where the consumption goes down significantly. The shunting capability of the regulator also helps in case one of the FE chips stops working and becomes a high ohmic path. The regulators from the other FE chips can shunt the extra current to the ground. The regulator can shunt the current as high as double the amount during nominal operation. To ensure reliability using the serially powered chain of modules, the modules must be protected against over-temperature and over-voltage conditions. This requires a radiation-hard independent monitoring system. A serial power chain is the smallest unit, which can be controlled individually from the power supplies. In case of over-temperature and over-voltage conditions reported by an independent monitoring system, the problematic modules can be configured so they can be bypassed to prevent loss of the complete chain.

<span id="page-18-0"></span>

Figure 2.5: Serial powered chain for the ATLAS ITk Pixel detector

# <span id="page-20-0"></span>Chapter 3

# Detector Control System (DCS) for the ATLAS ITk Pixel Detector

A sophisticated Detector Control System(DCS) for the ATLAS ITk Pixel Detector is necessary to ensure reliable data collection and safe operation of the detector itself and the safety of the operating personnel. The DCS monitors parameters like voltage, temperature, and other signals from different sub-systems to assess the experiment's state.

# <span id="page-20-1"></span>3.1 Requirements for the new DCS system

The system is required not only for nominal operation but also during the commissioning of the new subsystems. The DCS must have independent power and communication links/protocols to ensure that it is available at all times. This allows monitoring, control, and safety of the detector independent of its operational mode or whether the front-end detector modules can communicate.

# <span id="page-20-2"></span>3.2 Proposed DCS concept of the Pixel Detector

The Technical design report (TDR) of the ATLAS ITk Pixel Detector [\[11\]](#page-139-0) presents a detailed concept of the DCS. There are three different independent paths for the DCS. They differ in terms of precision, availability, granularity, and reliability. The block Figure [3.1](#page-21-0) shows the revised design.

#### <span id="page-20-3"></span>3.2.1 Safety path

The safety path consists of the interlock system and acts as the last line of defense against any critical detector malfunction. The interlock system provides less precision but the highest reliability and availability. The system must be available and ready to act at all times. Figure [3.2](#page-22-1) shows the topology of the Interlock system. The input from the temperature sensors is processed by two independent instances inside the interlock crate. The instance FPGA Interlock Matrix processes the information from the temperature sensors using a hardwired logic in a binary way. If the temperature rises above a certain threshold, the system will directly act on the power supplies or any interlock-controlled device. This means that the system is either in a safe

<span id="page-21-0"></span>

Figure 3.1: Overview of the ITk Pixel Detector DCS [\[11\]](#page-139-0)

state or not. The same input signal from the temperature sensors is also sampled by an ADC and fed to the Monitoring-FPGA for monitoring and debugging the interlock system. Each module in the serial powering chain hosts an NTC resistor to measure the module's temperature. The interlock system uses one of the several NTC resistors available in a single serial powering chain. This allows the interlock system to have a granularity per serial power chain. The exact location of the Interlock NTC resistors in a serial power chain is different for sub-systems. A simple two-wire readout topology reads out NTC resistors, and all the signals are driven directly to the counting room. In the counting room, a 3U 19-inch crate called Local Interlock Safety System (LISSY) [\[16\]](#page-139-5) is responsible for receiving and processing all the signals coming from the interlock NTC resistors, as well as it hosts the interlock-FPGA and the Monitoring-FPGA The LISSY are placed inside the counting room to make them safe from radiation effects. The interlock system not only acts on the temperature signals coming from the NTC resistors but also provides the possibility to act on external signals. These external signals can be, e.g., from the cooling plant or the beam interface system for a safer operation. The system acts directly on the power supplies using hardwired logic independent of any software [\[16\]](#page-139-5)[\[11\]](#page-139-0).

<span id="page-22-1"></span>

Figure 3.2: Concept of the interlock system used in the Safety path of the ITk DCS [\[16\]](#page-139-5)

#### <span id="page-22-0"></span>3.2.2 Control & feedback path

Figure [3.1](#page-21-0) shows the control and feedback path. The control and feedback path of the DCS provides an operator with an interface to monitor the actual state of the detector and then act accordingly to control sub-detector parts. The control here mainly refers to controlling power to the sub-systems to ensure safe operation and reliable data taking. The power supplies directly provide the control and feedback per SP chain. However, the feedback per detector module is provided by the MOPS chip in the form of the temperature and the voltage of individual detector modules. The MOPS chip is placed on the end of the sub-structure. The control and feedback is a highly reliable system and must also be available when the detector is not running. This sub-system will be used during the commissioning to calibrate and debug services to the detector modules. Another significant feature of this sub-system of the DCS is that it has its own powering and communication lines independent of the serial power chains of the ITk Pixel Detector. This ensures the monitoring feature is available even when the detector is not in regular operation. The interconnection topology between the Monitoring of Pixel System (MOPS) chip and a module in the serial powering chain shown in Figure [3.1](#page-21-0) is elaborated in Figure [3.3.](#page-23-0) The connection between the MOPS chip and the local control station is shown in Figure [3.4.](#page-23-1) The MOPSHUB is an FPGA-based system aggregating data from the MOPS chip and sending it to the Embedded Monitoring and Control Interface (EMCI)[\[17\]](#page-139-6). The EMCI then sends the data over an optical link to the local control station. The Embedded Monitoring Processor (EMP) [\[17\]](#page-139-6) is responsible for receiving the DCS data on the other end provide it to the DCS server. The control and feedback path requires a refresh rate of 5 seconds [\[18\]](#page-139-7). By providing 32 ADC channels, each single MOPS chip can monitor the voltage and the temperature of up to 16 modules in a single serial power chain.

<span id="page-23-0"></span>

Figure 3.3: MOPS connection with modules in the serial powering chain [\[19\]](#page-139-8)

<span id="page-23-1"></span>

Figure 3.4: Control/monitoring path of the ITk DCS [\[20\]](#page-139-9)

#### <span id="page-24-0"></span>3.2.3 Diagnostics path

The diagnostics path of the DCS allows for fine-tuning the detector's performance by providing additional monitoring and debug information. Detailed information is collected from each front-end chip and transferred to the central DCS server over the optical links connected to the Opto-box shown in Figure [3.1.](#page-21-0) The communication channels and data streams of the detector modules are used for this purpose, which means that this additional information is only available when the detector is operational.

### <span id="page-24-1"></span>3.3 Current monitoring technology and its limitation

In the current system, each module has its own sense lines for voltage and temperature monitoring. All the sense lines are routed outside the detector volume to a point far from the modules, up to 70 m in length. The sense lines are fed into the Embedded Local Monitor Board (ELMB). The ELMB is used at CERN for different experiments and facilities. It is also used in the current ATLAS experiment to provide monitoring data for different sub-detectors [\[6\]](#page-138-6). This is a general-purpose board of size  $50 \times 50$ mm<sup>2</sup> that can be theoretically placed directly on the FE modules or patch panels used for IO signals, but in reality, the board is too large for the detector modules and also lacks sufficient radiation hardness. The ELMB can withstand a maximum TID of up to 8 krad, provides 64 16-bit analog ADC input channels and several other general-purpose digital/analog IO channels, and has a CAN transceiver to communicate data using the standard 5 V physical layer[\[21\]](#page-140-0). Although more radiation-hard than the most commercially available monitoring boards, the ELMB had to be placed outside the Calorimeter of the ATLAS experiment to protect from harsher radiation. The radiation hardness required for the innermost layer of the new Pixel detector is 500 Mrad. This makes ELMB very unsuitable for monitoring granularity per module per serial power chain because to have such a system, an enormous amount of cables/ material must go inside the detector. The ELMB is better suited for environmental monitoring where the radiation levels are below the maximum tolerated dose level of the board. A newer version of the ELMB has also been developed and produced by a CERN group, which is more radiation tolerant but still does not meet the radiation requirements of the new Pixel detector.

# <span id="page-24-2"></span>3.4 Available commercial solutions

Due to its robustness and reliability, the CAN protocol has gained popularity for space applications in recent years. Under the umbrella of the European Space Agency, the European Cooperation for Space Standardization formulated a new standard that specifies the requirements on top of the existing CAN protocol for space applications [\[22\]](#page-140-1). For the application layer, the standard optionally specifies the CANopen standard and its limited implementation in the hardware to build CAN-compliant controllers that can be used in space. A minimal implementation of the CANopen is also defined where the Object Dictionary of a CANopen device does not have to be dynamic, but it can be hardwired, and many communication objects can be dropped. Most commercial vendors now use this standard to build CAN controllers for space applications; however, certain limitations prevent them from being used in the DCS.

- Although the CAN-controller-based microchips built for space applications are radiation-hard, their radiation tolerance is as little as nothing for the ATLAS ITk DCS application. Most commercial controllers are radiation hard up to an Ionizing dose of  $\approx 100 \text{ krad}$ , e.g., Intersil's ISL7202xSEH which is tolerant only up to 75 krad at a dose rate of 0.01 rad/s [\[23\]](#page-140-2). The TID requirement for the innermost layer of the ATLAS ITk detector is 500 Mrad.
- As this is an extension of the original CAN protocol for space applications, it does not change anything in the original protocol. This means all the controllers must use a standard CAN physical layer, which is impossible with available radiation hard libraries for the targeted IC process.
- Most commercially available ICs do not offer all-in-one solutions. They need extra peripherals, e.g., reference voltage generators, regulators, oscillators, etc., to operate. These external peripherals are also unavailable for required radiation tolerance and require a larger space.
- Size and power are also important factors because no active cooling is foreseen for a component used for monitoring inside the detector volume. This requires very low power consumption for the chip used for monitoring.

# <span id="page-26-0"></span>3.5 Motivation to develop a new monitoring chip

- As mentioned in the earlier sections, the current monitoring systems from CERN and other particle physics experiments do not meet the new Pixel detector's versatility, space, power, and radiation requirements. Also, the available commercial solutions lack the necessary radiation tolerance well, and they are too general and large with added complexity to integrate into the Pixel detector.
- The tracking performance of the ITk is influenced significantly by the amount of material inside the detector volume. Some of the optimizations can be achieved by optimizing the inclination angle of the modules by doing a simulation of the tracking using the GEANT4 [\[24\]](#page-140-3) framework to see how particles move through different materials. Less material inside the ITk than the current ID reduces the probability of electron charge misidentification by at least a factor of 3 [\[11\]](#page-139-0). Developing a new custom-made chip helps reduce the amount of material that goes inside the detector by saving on externally required peripheries and services and reducing the size using only essential functionality.
- Quick OPC-UA Server Generation Framework(quasar) is widely used at CERN to generate and easily manage Open Platform Communications-Unified Architecture (OPC-UA) servers. The existing DCS of the ATLAS experiment makes use of this ecosystem based on the WinCC OA SCADA platform [\[25\]](#page-140-4). The ELMB boards, which the current DCS uses, communicate with the OPC server using the CANopen protocol [\[6\]](#page-138-6). Developing a new chip using the CAN/- CANopen communication protocols will allow easy integration into the current system. It will save a lot of resources required to develop a new infrastructure from scratch.
- In addition to radiation hardness, commercial CAN controllers that implement CANopen in hardware also lack flexibility. Due to Intellectual Property (IP) rights, it is not possible to modify the design to meet the DCS-specific requirements, e.g., Triple Modular Redundancy (TMR) to harden the design against SEE or to add more custom functionality on top of the existing CAN/CANopen protocols.
- A CANopen-based general-purpose monitoring chip can be extended for more functionality in the future and used in other physics experiments or in any high-radiation environments.

# <span id="page-28-0"></span>Chapter 4

# Integrated circuit development and radiation tolerance

# <span id="page-28-1"></span>4.1 Development of Integrated Circuits (ICs)

Application Specific Integrated Circuits (ASICs) are highly customized integrated electronic circuits that are developed to provide a very specific functionality. These integrated circuits are then mostly packaged and produced in large numbers, to be found in almost every electronic device these days. ASICs are categorized into two main domains: analog and digital circuits. Most of the ICs these days include both domains, so they are labeled as Mixed-signal circuits. Primary examples of mixedsignal circuits are the Digital-to-Analog (DAC) and Analog-to-Digital (ADC) data converters. Figure [4.1](#page-29-0) shows some general steps involved in the design of a CMOS IC. Some more sub-processes are involved or may differ depending upon the type of IC.

ASICs can be further categorized into three main types.

- Full-custom ASICs
- Semi-custom ASICs
- Programmable ASICs

Full-custom design ASICs require a lot of time and effort for the development and verification of the desired functionality as they are highly customized, and most of the logic building blocks, analog circuits, and layouts are done from scratch. Power consumption, size, and speed are the main driving factors in designing full custom ASICs. In addition, tolerance to ionizing radiation and Single-Event Effects (SEEs) are also very important parameters for the full custom ASICs used in particle physics experiments and the space industry.

Semi-custom ASICs can be produced a lot faster than full-custom ASICs because a lot of building blocks already exist. A developer does not have to start layouts and definitions from the transistor level. The IC manufacturers provide a lot of standard cell libraries for commonly used logic blocks, e.g., logic gates, Random Access Memories (RAMs), multiplexers, flip-flops, etc. Many custom-made ASICs produced these days belong to this type.

Programmable ASICs are the most flexible, fastest, and economical variant of all

<span id="page-29-0"></span>

Figure 4.1: ASIC design flow

three different types to design and produce. They are usually categorized into Programmable Logic Devices(PLDs) and Field Programmable Gate Arrays(FPGAs). These devices are built with configurable logic blocks that are not hardwired. These devices can be programmed and configured after production using design tools provided by the IC vendors. As the logic is programmed by defining Look-up-tables (LUTs), logical wires, and logic gates, the devices can normally be programmed more than once. This allows easy and very cost-effective upgrading.

#### <span id="page-30-0"></span>4.1.1 Analog IC design

Although digital information processing has increased enormously over time, almost all the signals existing in nature are continuous/analog signals. These analog signals are often sampled to get rid of noise, digitized to increase processing speed, and then driven back to the analog world. This is the reason why many ICs are mixedsignal, where both analog and digital information is processed, e.g., transceivers for wired/wireless communication, audio/video processing, sensors, actuators, etc. A lot of analog signals are so small and noisy that they require amplification and filtration before they can be processed by a complex digital logic on the chip. In the case of mixed-signal ICs, the signals are then converted using an ADC to be processed by some Digital Signal Processor (DSP) on the chip. In the end, the digital signal is once again converted back to an analog signal by using a DAC and then provided to a driver circuit to drive an analog medium.

Single-stage amplifiers are a type of circuit topology where a single transistor is used for signal amplification. The circuit requires external biasing components to define the operating point of a transistor. Single transistor amplifiers can be biased in inverting, non-inverting, or follower configurations where each configuration has its own merits/demerits defined in terms of voltage/current gain, input/output impedance, signal inversion, frequency response, etc. Figure [4.2\(](#page-31-1)a) shows an NMOS transistor connected in a common source configuration. Some more circuitry is required to couple the input and output without disturbing the DC operating point of the amplifier.

A differential amplifier is one of the most important basic building blocks of an Analog IC. Shown in Figure [4.2\(](#page-31-1)b), the circuit amplifies a differential signal, whereas it attenuates the common-mode signal. One of the very important parameters of a differential amplifier is the matching of the two transistors to get the maximum performance. Matching is ensured not only by using transistors of the same size but also by the orientation and nearby structures in the layout, which influence how well the two transistors are matched after fabrication. The differential output is given by:

$$
V_{OD} = V_{O1} - V_{O2} \tag{4.1}
$$

Some of the basic building blocks of analog integrated circuits are mentioned below:

<span id="page-31-1"></span>

(a) A single transistor amplifier NMOS in common source configuration

(b) A Differential amplifier using NMOS transistors

(c) An NMOS-based current mirror

Figure 4.2: Basic circuits to build Analog ICs

- Single-stage amplifiers
- Differential amplifiers
- Current mirror

Figure [4.2\(](#page-31-1)c) shows a current mirror. The current mirrors are used in building analog circuits to work as a precise current source, which is not affected by load conditions. A good current mirror would have a very high output resistance so it is not disturbed by varying load, a very low input impedance, good frequency response, and, very importantly, good precision to copy the reference current. The output current is given by:

$$
I_{OUT} = I_{REF} \frac{K_{nB}(1 + \lambda V_{DSB})}{K_{nA}(1 + \lambda V_{DSA})}
$$
\n
$$
(4.2)
$$

Where  $K_n = \mu_n C_{ox} \frac{W}{L}$ L

$$
I_{OUT} = I_{REF} \frac{\frac{W_B}{L_B} (1 + \lambda V_{DSB})}{\frac{W_A}{L_A} (1 + \lambda V_{DSA})}
$$
\n
$$
(4.3)
$$

#### <span id="page-31-0"></span>4.1.2 Digital IC design

Logic gates, e.g., AND, OR, NAND, etc., are the core elements of a digital IC. Connected together in a complex way, they process information by using only binary signals, where a signal is defined as low or high. In addition to logic gates, digital ICs also use other basic building blocks e.g, flip-flops to store information, multiplexers to reduce resources and channel information efficiently, and Arithmetic Logic Units (ALUs) to do arithmetic tasks, etc.

Figure [4.1\(](#page-29-0)b) shows the main steps involved in the design of a digital IC. The very first phase is the behavioral description. Hardware Description Language (HDL), e.g., VHDL or Verilog, is used to define hardware elements, and afterward, verification is done to confirm the behavior. During the synthesis phase, standard cell libraries from the desired IC process are used, timing constraints are defined, and some generic optimization is done. In the implementation phase, floor-planing, clock tree synthesis, design placement, and verification are done.

# <span id="page-32-0"></span>4.2 Effect of radiation on Integrated Circuits

Electronic circuits are prone to damage caused by radiation. The undesirable effects can be categorized into two broad categories, i.e., cumulative effects and single-event effects (SEE). Depending on the total dose, dose rate, type of radiation, and the target IC type, the damage can cause an IC to stop working completely or shift from the optimal operational point and lose precision over time. The damage caused may or may not be recoverable by annealing. These are very vast topics on their own and not the main scope of this thesis. In this thesis, the radiation effects are only briefly described to characterize and qualify the MOPS chip to meet the defined specifications.

#### <span id="page-32-1"></span>4.2.1 Cumulative dose effects

When electronics, or more specifically integrated circuits, are exposed to radiation, the damage caused by radiation has a cumulative effect. The longer an electronic component is exposed to radiation, the more damage it inflicts on the component. Cumulative radiation damage is further categorized into two different categories. A Total Ionizing Dose (TID) effect is due to Ionizing radiation, and the displacement damage is caused by Non-Ionizing radiation.

Degradation due to Total Ionizing Dose (TID) Ionization is a process where an electron sitting in the valence band absorbs enough energy to jump into the conduction band. This creates an electron-hole pair. If a material is under the influence of an electric field, then this newly created electron-hole pair may not recombine. The movement of these extra charge carriers also introduces a parasitic current in the device. Linear Energy Transfer (LET) is used to define the amount of energy absorbed by the material during this ionization process. How much energy gets transferred to the material by the highly charged particles depends upon the energy, flux, angle of incidence, and the material itself. Because of lower mobility for charge carriers in the

<span id="page-33-0"></span>

Figure 4.3: Radiation Induced Narrow Channel Effects (RINCE) [\[26\]](#page-140-5)

insulators, it is difficult for the electrons and holes to leave the region. Compared to the holes, the higher mobility of electrons lets them escape from the silicon dioxide, whereas the holes remain trapped. This gradual accumulation of holes creates a net positive charge in the region, which is responsible for the change of threshold voltage, noise, leakage current, etc. [\[27\]](#page-140-6). Commercial CMOS electronics can typically withstand up to 5 krad of dose [\[23\]](#page-140-2). Two of the main reasons for TID effects in electronics are the trapped positive charges in insulating parts of a MOSFET, and the second one is the creation of defects at the interface between silicon and silicon dioxide due to the same trapped charges. With the decreasing thickness of gate oxide in the newer IC technologies, IC technology is inherently becoming more and more tolerant to radiation, but some other effects have started to appear. The defects in the spacers and Shallow Trench Isolation (STI) used to isolate transistors contribute significantly towards degradation due to radiation. Positive charge trapped in the spacers can change doping levels and have an impact on the effective resistance of the channel. This is especially true for devices with smaller sizes. The degradation due to this phenomenon is called the Radiation-Induced Short Channel Effect (RISCE) [\[28\]](#page-140-7). As shown in Figure [4.3,](#page-33-0) The term Radiation Induced Narrow Channel Effects (RINCE) was first coined in [\[29\]](#page-140-8). The electric field due to charges trapped in STI affects the current flowing through the channel. Devices with narrow channels/smaller widths are affected more than the devices with wider channels. The charges trapped in STIs are also responsible for the leakage current in the device because of the creation of the parasitic channels. TID effects are studied when an electronic component is under nominal bias conditions. This is due to the reason that an electron-hole pair created by the ionization process has a lower probability of recombining under an applied electric field. TID is defined by the SI unit Gy as well as using the old unit rad where 1 Gy is equal to 100 rad [\[30\]](#page-140-9). TID-induced degradation in electronics for very large doses is dependent on the dose rate, transistor size, biasing voltage, and operational temperature [\[28\]](#page-140-7). For the TSMC 65 nm process, the studies have shown that more damage has been done at low dose rates of 35 krad/h than at much higher dose rates of higher than 2 Mrad/h. The dose rate of 35 krad/h is comparable to what is expected for some of the inner detectors at the LHC; however, it might still be higher for the future ITk Pixel detector [\[31\]](#page-141-0).

<span id="page-34-1"></span>**Displacement damage** Displacement damage occurs to an electronic device when it is exposed to non-ionizing radiation. When high-energy particles hit the material's lattice, they can displace an atom, which can displace even more atoms in the lattice. This causes defects that change a material's electrical properties, doping concentration, etc. In CMOS technology, the TID effects constitute the major damage caused by radiation. No displacement damage studies have been done during the course of this thesis.



Figure 4.4: Single-Event Effect (SEE)

#### <span id="page-34-0"></span>4.2.2 Single-event Effects (SEE)

Single-event effects (SEE) in a semiconductor chip were first reported in [\[32\]](#page-141-1). In comparison to TID effects, SEEs are caused by a single high-energy particle hitting a device, which creates an ionization track similar to that shown in Figure [4.4.](#page-34-1) Because of the electric field, the charges generated near reverse biased  $p/n$  junctions/depleted channels are quickly collected by the drain and source regions. This is responsible for the generation of a current spike. Charge collection in the depleted region happens very fast. However, a slow charge collection also happens outside and far from the depleted region. This is shown in Figure [4.4](#page-34-1) as a Funnel until it reaches the bulk, where most of the created electron-hole pairs start to recombine. Any single ionizing particle randomly hitting a device may not generate a SEE. Two conditions are required for a SEE to happen: the highly energized particle must hit a specific region in a device defined as Sensitive volume, and the charge deposited by a particle hit

must also be larger than the required Critical charge to create an upset. SEEs are not accumulative, and the damage done by them can be a mere corruption of some data or a complete malfunction or bad configuration of the device. In severe cases, an SEE can also damage a device permanently. SEEs can be categorized into two broad categories, which are further divided into several different types of effects [\[30\]](#page-140-9). Here, only a few effects are discussed:

- Soft errors
	- Transients: In this case, a transient happening on some points gets propagated in the circuit as false information, eventually corrupting information, or in the case of Analog design, they can disturb the operation point of a circuit.
	- Static errors: here, information stored in a single memory cell can change.
- Hard errors
	- These types of errors can damage a device permanently if not corrected quickly.

Single-event Transients (SET) A Single-event Transient (SET) is a glitch or voltage spike that occurs when a highly energetic subatomic particle hits an electronic device. The sudden voltage spike generated due to the hit can disturb the operating point of a node in an analog circuit, or it can propagate through the combinational logic of a digital design within one clock cycle, creating logical errors. A SET can become very serious, e.g., if it happens on an asynchronous reset signal or during the critical time when a combinational input is sampled and stored in a flip-flop. In circuits with very high-speed clocks, SET can also result in an SEU [\[33\]](#page-141-2).



Figure 4.5: SEU sensitivity of an electronic device [\[30\]](#page-140-9)
Single-event Upsets (SEU) A Single-event Upset (SEU) is an event where a bit is stored in a storage element, e.g., flip-flop, toggles from low to high, or vice versa. SEU can change stored data or, more importantly, can cause corruption in the configuration memory of a device. To estimate the SEU rate and characterize a device, a crosssection plot is calculated at different LETs in case of irradiation using heavy ions [\[34\]](#page-141-0). As Shown in Figure [4.5,](#page-35-0) a SEU does not happen at energies lower than the threshold LET. The plateau of the curve indicates the total area of a device sensitive to SEUs. For any given LET, the area can be divided by the total number of sensitive nodes in a device to get an estimate of the sensitive area per node. SEU cross-section is defined as:

$$
\sigma = \frac{N_{upsets}}{\Phi \cdot N_{nodes}} \tag{4.4}
$$

Where  $N_{upsets}$  are the total number of SEUs that happen, and  $N_{nodes}$  is the total number of storage cells or sensitive nodes in a device. Φ represents time-integrated particle flux or total fluence in  $cm^{-2} s^{-1}$ . Previous studies have shown that an increase in the operational clock frequency on a chip will induce more SETs in the combinational logic, and a low supply voltage can make flip-flops more vulnerable to SEUs. Therefore, modern CMOS processes working on low voltage and high frequency are more prone to SEUs [\[35\]](#page-141-1). SEU cross-section calculation for different logic circuit implementations using the TSMC 65 nm process, different radiation types, and the factors affecting it have been investigated in [\[36\]](#page-141-2)[\[37\]](#page-141-3).

<span id="page-36-0"></span>Cross-section obtained by irradiating an electronic device using heavy ions can be translated to the expected rate in the LHC environment using the computational method defined in [\[38\]](#page-141-4).



Figure 4.6: Parasitic BJTs in a CMOS device causing a Single Event Latch-up (SEL) [\[39\]](#page-141-5)

Single-event Latch-ups (SEL) Single-event Latch-up (SEL) is a type of hard error. If not corrected swiftly, an SEL damages a device permanently. In a CMOS process, an SEL happens due to the opening of a low impedance path between the supply

#### Chapter 4 Integrated circuit development and radiation tolerance

voltage and the ground potential due to parasitic PNP and NPN transistors shown in Figure [4.6.](#page-36-0) In case of an over-voltage / thermal stress or due to an ionizing particle hitting the device, a positive feedback path is created where the collector current of the first parasitic transistor feeds the base current of the second one and vice versa. An abnormally large amount of current is drawn due to this path creation. To save the device from permanent damage, the power supply must be cut promptly.

<span id="page-37-0"></span>

Figure 4.7: MOSFET layout techniques to reduce leakage current between source and drain [\[40\]](#page-141-6)

## 4.3 TID tolerance

Radiation tolerance for electronic devices can be achieved by either using the approach Radiation Hardening by Process (RHBP), Radiation Hardening by Design (RHBD), or a combination of both. Since RHBP is very expensive, RHBD is normally used in modern CMOS ICs. The following text explains some of the important techniques. As explained earlier in this chapter, the main cause of the TID effects in a CMOS process is related to oxides used in the gate or STI, etc. Thin gate oxide transistors are very tolerant to TID effects and for thicknesses less than 10 nm, the radiation effect due to gate oxide becomes insignificant [\[41\]](#page-142-0) [\[42\]](#page-142-1). Scaling down the CMOS process also reduces the gate thickness, which makes the modern CMOS technology inherently radiation-hard.

Another problem with TID is the leakage current due to trapped charges in the STIs, which are responsible for opening the parasitic channels between the source and the drain. The leakage current due to parasitic channels increases with the increasing total dose until the point where normal operation of the circuit stops. A solution to this problem is to enclose either the source or drain of a transistor using thin gate oxide, as shown in Figure [4.7.](#page-37-0) The most commonly used Enclosed Layout Transistor (ELT) implementation and its effectiveness are further discussed in [\[43\]](#page-142-2). The leakage current between neighboring transistors can be eliminated by using heavily doped guard rings, e.g., a  $p+$  guard ring between  $n+$  diffusion regions. Due to the custom nature of analog IC design, the TID-induced damage, which is dependent on transistor sizing, e.g., RINCE and RISCE, can be reduced by choosing wider and longer channel devices. The design parameters used to increase the radiation tolerance of devices in the analog design are difficult to control in the digital IC design because standard cell libraries available for commercial IC processes mostly use the minimum possible length and width for the transistors. Also, it requires a great effort to produce numerous standard cells using ELT and guard rings.

## 4.4 SEE mitigation

Because the SEEs are caused by single high-energy ionizing particles, to make circuits tolerant, the application environment of an electronic device plays an important role in defining a mitigation strategy. Devices used in the LHC experiments experience particles and their energies differently from the devices used for space applications. As explained earlier in this chapter, an ionizing particle must deposit enough charge in a sensitive volume of a node to create undesired effects. Adding capacitance to the sensitive node can provide a very simple solution, but it has the drawback of more power consumption and loss of metallization options at the top level.

To protect against SETs, de-glitching circuits at the cost of sacrificing speed can be used as well as redundancy in combination with voters can be used in the combinational logic to prevent SETs from propagating in the logic.

Protection against SEUs can be achieved by duplicating information stored at the node level by using architectures like Heavy Ion Tolerant (HIT) proposed in [\[44\]](#page-142-3) and Dual Interlock Cell (DICE) in [\[45\]](#page-142-4). However, the effectiveness of these architectures at the node level is not outstanding for modern CMOS processes due to the fact that smaller transistor sizes in a modern CMOS process imply larger density. The two sensitive nodes of a DICE must be placed far apart in the layout to ensure that they are not hit at the same time. W.r.t. a standard cell, DICE could achieve only a tenfold improvement in a 90 nm process [\[40\]](#page-141-6). Another approach is to use redundancy for the stored information. Triple Modular Redundancy (TMR) is widely used in ICs to protect against SEUs. As shown in Figure [4.8,](#page-39-0) every storing element, e.g., a flip-flop, is triplicated, where each storing element contains a copy of the same data. A voter is used in front to decide the correct value by majority voting. To prevent multi-bit upsets from happening, the triplicated cells must be placed far apart from each other. In the circuits operating at high frequency, the SETs can easily propagate into the storing elements. There are two ways to mitigate this: using Temporal sampling, where a clock skew is used for three triplicated cells to filter out glitches, or by triplication in the combinational logic. Using Temporal sampling can make it very difficult to handle correct timing in the design, whereas triplication in the combinational logic can significantly increase the required area. Error detection and correction techniques can also be used to cope with SEUs, e.g., hamming encoding.

<span id="page-39-0"></span>

Figure 4.8: Full triplication of a digital logic

# <span id="page-42-0"></span>Chapter 5

# First test prototype

# 5.1 Purpose and components of the chip

A first test chip was submitted to study and verify the behavior of the analog circuits required for the actual MOPS chip.

- Three regulators with a regulated output voltage of 1.2 V, 2.4 V, and 3.6 V are powered using 5 V power supply. The 1.2 V regulator was intended for the digital part of the chip, while the other two were required by the standard CAN physical layer
- Standard CAN transceiver using a supply voltage of  $5V$
- Serial Control Bus (SCB) physical layer
- 3000-bit shift registers to study SEU cross-section

The chip was built using the TSMC 65 nm process, which allows a maximum of 1.2 V for the transistors. The high-voltage circuits were built using transistors in a cascoded topology to sustain a voltage higher than what is specified by the fabrication technology. For the analog part, the regulators were developed by Michael Karagounis (FH Dortmund), and the other components on the chip were a joint effort from the people from FH Dortmund and the University of Wuppertal. The Serial Control Bus (SCB), a physical layer, was also used on the chip to test communication with the PSPP chip [\[46\]](#page-142-5).

## 5.2 3000 bits shift register for SEU studies

The digital part of the test chip included only a 3000-bit long shift register logic to study SEUs. This is the same logic that was also implemented in the Pixel Serial Powering & Protection (PSPP) chip [\[46\]](#page-142-5) and was integrated into the MOPS chip as well for SEU studies. Two instances of the 3000-bit long shift register are included in the chip. One instance is the simple shift register, while the other uses full triplication. A simplified schematic of the shift register logic is shown in Figure [5.1.](#page-43-0)

<span id="page-43-0"></span>

Figure 5.1: TMR shift register schematic [\[46\]](#page-142-5)

## 5.3 Problems seen on the chip

A mistake was made regarding the IO pad selection for the high-voltage IO signals from the regulators and the CAN physical layer. The pads used in the chip for high voltage signals allow a maximum signal of 2V. If the voltage exceeds 2V, then the ESD protection circuit gets activated, and the high voltage signal is shorted to either supply voltage or ground, depending on the polarity. Because of this problem, testing the high-voltage circuits was no longer possible. A special procedure called Focused Ionizing Beam (FIB) was used to cut the ESD structures. The intention was to eliminate the ESD protection circuit, although it was not desired in normal circumstances. As the FIB procedure cannot guarantee that only specific metal layers are destroyed, the procedure done to chip created short circuits. It was later decided in the revised baseline that high voltage regulators and the CAN Physical layer must be omitted in the chip to avoid technology limitations and mitigate radiation damage, which is significantly higher at high voltages.

## 5.4 Outcome and lessons learned

Although the results of the analog circuitry were not promising, they still helped to understand some design problems, technology limitations, and shaping decisions that influenced the design and development of the succeeding versions of the MOPS chip. The on-chip digital part included only the shift register for the SEU studies. These chips were then used for the SEU studies carried out in Leuven. The results are explained in the chapter [9.](#page-114-0)

# Chapter 6

# Monitoring of Pixel System (MOPS) chip

The Monitoring of Pixel System (MOPS) chip is an Application Specific Integrated Circuit (ASIC) designed to be used in the DCS to independently monitor the voltage and the temperature across the detector modules. The detector modules constitute the front-end electronics responsible for tracking and data read-out. Chapter [3](#page-20-0) explains the concept of the DCS for the Pixel detector, and Figure [3.3](#page-23-0) shows the topology of the MOPS chip's connection in a serial powering chain. The chip has been developed using the TSMC 65 nm CMOS process. CERN has put a tremendous amount of effort into studying the radiation tolerance of the technology in detail for future upgrades. The TSMC 65 nm CMOS process itself is inherently radiation-hard, but to make it more radiation-hard to be able to tolerate radiation levels for different experiments, CERN provides additional custom libraries Hardened by Design (HBD) for different fundamental building blocks and also provides simulation corners to characterize circuits for TID.

## 6.1 Requirements for the chip

Several requirements for the MOPS chip are defined in the specification documents [\[18\]](#page-139-0) and [\[47\]](#page-142-6) related to monitoring precision, radiation hardness, operational temperature, size, packaging, etc.

Voltage monitoring The chip must be able to monitor a maximum of 16 modules in a single serial powering chain [\[11\]](#page-139-1). During regular operation, every detector module has a nominal voltage drop of  $2V$ . This makes a maximum of  $32V$  in a single chain. The MOPS chip must provide a relative precision of 15 mV for the voltage measurement within its operational temperature range.

Temperature monitoring Every detector module in the serial powering chain hosts a Negative Temperature Coefficient (NTC) resistor to measure the temperature. The reference voltage for this resistor must be provided by the MOPS chip. The chip should provide a relative temperature monitoring precision of 1 K.

Radiation hardness Radiation hardness is one of the most essential requirements for the MOPS chip. The chip will also be operated in the innermost layer of the Atlas ITk Pixel Detector, so it has to qualify for radiation levels of up to 500 Mrad of TID.

**Operational temperature** The operational temperature range for the chip is  $-40^{\circ}$ C to 60 °C.

SEE tolerance The chip must be tolerant of SEEs. An SEU in the monitoring data can be tolerated but can be fatal and unacceptable in the configuration memory of the chip. Frequent SEUs in the configuration memory can cause the chip to require more power cycles to provide reliable monitoring data. This introduces transients in the system, so the amount of transients must be reduced as much as possible. In addition to that, the chip must also be tolerant towards SET and SEL, as explained in Chapter [4.](#page-28-0)

Independent powering regulator The chip must have an on-chip voltage regulator to ensure a stable power supply to the chip and to provide a reference voltage for the NTC resistor to monitor the temperature reliably. The Voltage regulator must be able to compensate for long power lines, which can be up to approximately 70 m in length. The current consumption of the chip varies with bus activity, which also causes varying voltage drop on the long power lines. The regulator must be able to regulate for a large input supply voltage range.

Communication protocol A robust and reliable communication protocol must be chosen to communicate over long distances. The number of service lines required by the communication protocol must be as small as possible. Therefore, the chip implements CAN and CANopen protocols, which are explained in sections [6.3.1](#page-48-0) and [6.3.2.](#page-51-0) The chip must also be able to drive the communication bus of 70 meters in length with included filter capacitors of 10 µF.

**On-chip oscillator** If there is no clock available on the communication lines or can not be extracted from the data, a local on-chip oscillator is required to generate frequency for the digital part of the chip. The oscillator should provide a possibility to be trimmed remotely after installation. Trimming is necessary to compensate for process variation and radiation and for variation in operational conditions.

## 6.2 Chip versions

The first test prototype of the chip is called DCS-controllerv1. At the time of submission of the first version, the digital part of the chip was still not mature enough to be fabricated, so it was programmed on an FPGA [\[48\]](#page-142-7). The digital part also included the logic to communicate with the PSPP chip [\[46\]](#page-142-5), which was defined in the original ITk Pixel Detector TDR [\[11\]](#page-139-1) but was later dropped in the revised baseline of the Pixel project. The Analog part of the chip included the voltage regulators and the standard CAN physical layer. More details about this first test chip are provided in Chapter [5.](#page-42-0)

This chapter discusses the two complete versions of the MOPS chip, the MOPSv1 and the MOPSv2. The MOPSv2 is essentially an improved version of the MOPSv1 with some added functionality, e.g., automated oscillator trimming more watchdogs to increase SEU tolerance and malfunction mitigation. The watchdogs allow for resetting the core state machines and frequent configuration reloading to prevent a high number of power cycles.

# 6.3 MOPS communication protocol

The chip makes use of two well-defined communication protocols: the Controller Area Network (CAN) [\[49\]](#page-142-8) and the CANopen [\[50\]](#page-142-9) standard. Figure [6.1](#page-49-0) shows the Open Systems Interconnection (OSI) model. The CAN protocol is only defined for the physical and the data link layers. To handle communication effectively between different nodes on a CAN bus, the CANopen is one of several protocols defined for the rest of the upper layers. The CANopen mainly covers the transport and the application layers because the definitions in others are not required in a CAN network.

The MOPS chip implements more functionality on top of both CAN/CANopen base standards and does some customization. Below mentioned are some important differences:

- Although the digital part of the chip's CAN node complies with standard ISO 11898-1:2015 [\[49\]](#page-142-8), the physical layer uses different analog signal levels. Therefore, the physical layer signaling is incompatible with the standard/commercial controllers without translating the logic/voltage levels.
- The on-chip oscillator can be done at the start-up or after the remote reset command. During the calibration phase, the chip listens to the bus like any other CAN node but does not actively participate in the communication.
- The chip is not fully CANopen compliant. It only implements a minimal set of SDO and NMT communication objects.
- The minimal CANopen implementation in the hardware strategy used on the MOPS chip is similar in approach but completely different from what is defined in [\[22\]](#page-140-0).

## <span id="page-48-0"></span>6.3.1 CAN bus communication protocol

As shown in Figure [6.2,](#page-49-1) CAN is a serial communication bus where all the nodes are connected to a single bus terminated by a termination resistor at both ends. A CAN bus requires only four lines in total. Two lines are required for powering, and the other two are used to communicate using differential signaling. The CAN protocol

<span id="page-49-0"></span>

Figure 6.1: The layers defined in the Open Systems Interconnection (OSI) model [\[51\]](#page-143-0)

states no limit for the maximum number of nodes allowed on a single bus; however, the bus load limits the number. There is no single master on the bus, so every node is allowed to send messages at any time. Real-time non-destructive arbitration happens whenever more than one node tries to send a message simultaneously. A frame with the highest priority gets access to the bus. The priority of any frame is solely defined by the 11-bit identifier sent at the beginning.

<span id="page-49-1"></span>

Figure 6.2: Controller Area Network (CAN)

Some features of the protocol are:

- Emphasis is placed on the priority of the messages where all nodes have unrestricted access to the bus
- Message arbitration is non-destructive
- All nodes can listen to all messages on the bus and decide for themselves whether the message is important for them or not
- Data integrity is ensured by robust error detection features
- Differential signaling to provide noise immunity

<span id="page-50-0"></span>

| <b>SOF</b> | Arbitration                         | Control                                 | Data    | CRC     | EOF   |
|------------|-------------------------------------|-----------------------------------------|---------|---------|-------|
| 1 bit      | 11 bits or<br>29 bits<br>(extended) | 4 DLC and few<br>other specific<br>bits | 8 bytes | 15 bits | 1 bit |

Figure 6.3: CAN classic frame format

- Data can be requested remotely between different nodes
- Automatic re-transmission of the messages that lost arbitration

The format of a classic CAN message frame can be seen in Figure [6.3.](#page-50-0)

• SOF/EOF

These fields define the start of a frame and the end of a frame.

• Identifier / Arbitration field

The 11-bit identifier uniquely identifies a message on the CAN bus. This is used to do arbitration where a CAN frame with the highest priority gets access to the bus. If more than one node tries to send a message on the bus simultaneously, then the message with the highest priority will be delivered. It also serves the purpose of message filtration, where every node on the bus can listen to any message but decides for itself whether it is important or not.

• Control field

The control bits section of a CAN message defines the length of data in a message and some other features, e.g., whether a message is a remote request or a standard message that includes data, length of identifier, and some reserved bits, etc.

• Data field

This field of a CAN message holds the actual data. In the classical frame format shown in Figure [6.3,](#page-50-0) the data field can have no data or a maximum of up to 8 bytes.

• Cyclic redundancy check (CRC) / error correction field

This part of the frame holds the CRC checksum bits and the acknowledgment bit to prove the integrity of the message.

#### 6.3.2 CANopen protocol

The CANopen is an application-level protocol that can be used in conjunction with the CAN protocol. Figure [6.4](#page-51-0) shows the model for each MOPS chip on a network. CANopen is a device and manufacturer-independent communication protocol defined

<span id="page-51-0"></span>

Figure 6.4: CANopen device model of the MOPS chip

<span id="page-51-1"></span>

Figure 6.5: COB-ID in CANopen

for the application layer in the Open Systems Interconnection (OSI) model. As shown in Figure [6.1](#page-49-0) the CANopen covers the network to application layers of the OSI model. Due to the architecture of the CAN protocol, no definition is required for the presentation, the session, and the network layer of the OSI model. The CANopen mainly implements the transport and the application layer. The protocol uses the CAN identifier field defined in Figure [6.3](#page-50-0) to distinguish between different communication objects and address nodes connected to the network as shown in Figure [6.5,](#page-51-1) seven of the eleven identifier bits are used to address different nodes on the bus. This implies a maximum of 127 CAN nodes with distinct identifiers can be connected to a single bus in a CANopen network. The CAN protocol does not define a maximum number of CAN nodes. Theoretically, more than 127 nodes can be connected to a CANopen network, but they can not be addressed uniquely. The nodes without unique IDs can still read and broadcast messages on the bus. The protocol offers the possibility of synchronization on a CAN network. CANopen uses predefined communication objects to transfer information between different nodes. The predefined communication objects are categorized into the following types:

- Network Management (NMT) to manage/analyze the state of the network
- Process Data Objects (PDO) to transfer real-time process data
- Service Data objects (SDO) to configure nodes on a network
- Synchronization objects (SYNC) to synchronize timing and coordination between the different nodes
- Emergency objects (EMCY) to report malfunctioning by the nodes
- Time stamp objects (TIME) for time-critical applications

Due to its complexity and size, a complete implementation of the CANopen in hardware requires large storage on individual nodes and also adds complexity in the logical part. The CANopen is almost always defined at the software level for the standard CAN controllers.

In the MOPS chip, CANopen is embedded into the hardwired digital logic. The chip provides no dedicated logic to store the definition of the communication objects. For the following reasons, a modified, reduced subset of CANopen is implemented using Network Management (NMT) and Service Data Objects (SDO).

- The chip must be radiation hard up to an ionizing dose of 500 Mrad. Increasing storage elements on the chip requires more resources and complex logic to protect against SEEs.
- The final chip size can only be  $2 \times 2$  mm<sup>2</sup>. The size is limited by the cost, space, and material budget available in the final system. A complete CANopen implementation requires a significant increase in the fabricated chip size, which increases the cost with no added benefit; instead, the chip will become more vulnerable to the radiation effects.
- The DCS refresh rate requirement of 5 s [\[18\]](#page-139-0) is very relaxed. Therefore, there is no need to implement PDOs to expedite the monitoring data.
- The system topology requires the chip to send the monitoring data only upon request and stay silent otherwise.
- Expedited transfer offered by the Service Data Objects (SDOs) with a bus speed of 125 kbps is sufficient to meet the requirement for the data rate, data length, and refresh rate of the whole system.

### <span id="page-52-0"></span>Network Management Object (NMT)

Three items are implemented in the chip for this type of communication object. One of the nodes connected to the network acts as a logical master. The master node doesn't need to have a COB-ID.

- Sign-in message
- Node guarding
- Remote reset

Sign-in message As the name suggests, whenever a chip is connected to the bus and powered on, it sends a sign-in message over the bus. Each node on the bus generates a single sign-in message when the power is supplied. The structure of this sign-in message is the same as the node guarding. It is the same node guarding message but is sent only once during the start-up by the remote node despite no request from the master node. Following the CAN protocol, a node keeps repeating a sign-in message until it is acknowledged. Figure [6.6](#page-53-0) shows the structure of a sign-in message.

Node guarding The Node guarding is another NMT object implemented in the MOPS chip. The Node guarding message allows a master node in the CANpoen network to check whether a node is still alive and operational. Again, this protocol part is simplified to implement what is absolutely necessary.

Whenever there is a message on the bus with the identifier  $\sqrt{0x700} + \text{node-id}$ , the respective node will respond with a single byte of data where the MSB should toggle for every new request. If the master node finds no response from the slave node or the MSB of the response message does not toggle for every new request, there is something wrong with that node. Either it is not working, or the configuration values in the registers have been changed. A reset request will then be required to return the node to the operational state. All other status messages are not implemented. Figure [6.6](#page-53-0) shows the format of the Node guarding.

<span id="page-53-0"></span>

Figure 6.6: Node guarding/sign-in message in the MOPS chip

Remote reset request At any time, a MOPS chip can be in one of the three states: non-operational, operational, and configuration. When the chip is working normally, it is in the operational state. The chip is in the configuration state when the chip is doing automated trimming upon power-up or on demand using a remote reset command. Whenever a chip must be reset remotely, an empty message with an ID '0' is sent over the CAN bus. The CANopen standard defines this as node-dependent, but to simplify our application, every node will reset itself whenever an ID '0' is detected on the bus. After the reset, default configuration values, e.g., timing, etc, for the CAN node are reloaded from hardwired combinational blocks, and all the state machines will be returned to a known state.

#### Service Data Objects (SDO)

The Service Data Objects (SDO) provide access to all the entries of an object dictionary in a CANopen device. The SDO communication is available during preoperational mode. It is used to configure a CANopen device, but it is also possible to read out a particular entry using the SDO read protocol.

The CANopen protocol always confirms the read/write operation when using the SDO communication objects. The chip implements service data objects of the CANopen standard explained in section [6.3.2,](#page-51-0) but only the expedited transfer mode is implemented. The Object dictionary is defined so its entries do not exceed four bytes. All the object dictionary entries can be accessed using expedited data transfer of SDOs, which allows a maximum of four bytes of data to be sent in a single CAN message. In comparison, the four remaining bytes are used for other task details, i.e., index, sub-index, and SDO operation details.

Different types of SDO frames that are used to communicate with the MOPS chip are shown in Figure [6.7](#page-54-0) to Figure [6.11.](#page-55-0)

The following terms are mentioned in the figures.

scs: Command specifier

 $N: n = 4$  - Data size

T: Toggle bit (unused)

E: Expedited transfer is used

S: Data size is indicated

<span id="page-54-0"></span>Table [6.1](#page-55-1) shows the message encoded with each abort code. All values are in hexadecimal.

| $600 + node$ id | Cmd= $40h$ | Index    | Subindex |           |
|-----------------|------------|----------|----------|-----------|
| COB-ID          | Byte 0     | Byte 1&2 | Byte 3   | Byte 4  7 |

Figure 6.7: SDO read request to the MOPS chip

| $580 + node$ id | $Scs = 010b$<br>Bits 0.2 | $T=0$<br>Bit 3 | N<br>Bit<br>4.5 | $E=1$<br>Bit 6 | $S=1$<br>Bit 7      | Index  | Subindex                  | Data |
|-----------------|--------------------------|----------------|-----------------|----------------|---------------------|--------|---------------------------|------|
|                 |                          |                |                 |                |                     |        |                           |      |
| COB ID          | Byte 0                   |                |                 |                | <b>Byte 1&amp;2</b> | Byte 3 | $\mathbf{z}$<br>Byte 4  7 |      |

Figure 6.8: MOPS response to the SDO read request

<span id="page-55-1"></span>

|           | Abort code Description                                       |
|-----------|--------------------------------------------------------------|
| 0504 0000 | SDO protocol timed out.                                      |
| 0504 0001 | The client/server command specifier is not valid or unknown. |
| 0601 0000 | Unsupported access to an object.                             |
| 0601 0001 | Attempt to read a write-only object.                         |
| 0601 0002 | Attempt to write a read-only object.                         |
| 0602 0000 | The object does not exist in the object dictionary.          |
| 0606 0000 | access failed due to a hardware error.                       |
| 0606 0007 | timeout of communication occurred.                           |
| 0609 0011 | Sub-index does not exist.                                    |
| 0800 0000 | General error                                                |

Table 6.1: SDO abort codes

| $600 + node$ id | $Scs = 001b$<br>Bits 0.2 | $T=0$<br>Bit 3 | N<br>Bit<br>4.5 | $E=1$<br>Bit 6 | $S=1$<br>Bit 7 | Index               | Subindex | Data           |
|-----------------|--------------------------|----------------|-----------------|----------------|----------------|---------------------|----------|----------------|
|                 |                          |                |                 |                |                |                     |          |                |
| COB ID          |                          |                | Byte 0          |                |                | <b>Byte 1&amp;2</b> | Byte 3   | Byte $4 \dots$ |

Figure 6.9: SDO write request to the MOPS

| 580 + node id | Cmd= $60h$ | Index               | Subindex |           |
|---------------|------------|---------------------|----------|-----------|
|               |            |                     |          |           |
| COB ID        | Byte 0     | <b>Byte 1&amp;2</b> | Byte 3   | Byte 4  7 |

<span id="page-55-0"></span>Figure 6.10: Reponse of the MOPS chip to an SDO write request



Figure 6.11: SDO abort message in case of an error

## 6.4 Digital block

<span id="page-56-0"></span>

Figure 6.12: Abstract view of the digital logic on-chip

Alexander Walsemann did the physical implementation part of the digital design from Fachhochschule Dortmund. The chip's digital part is designed to keep it as small as possible, i.e., to implement only constructs that are absolutely necessary for the DCS application. This also reduces the complexity of the design. The logic, however, allows easier extension to include more constructs in the future if required. The digital part of the MOPS chip provides everything related to the CAN and CANopen protocol implementation, ADC interfacing, Automated trimming control unit, etc. The Moore Finite State Machine (FSM) concept is used everywhere in the logic for the state machines, which means the output is defined by the current state only. Mentor Graphics's HDL designer [\[52\]](#page-143-1) and Questasim [\[53\]](#page-143-2) have been used extensively to define the state machines graphically and to verify the design, respectively. Verilog 2005 [\[54\]](#page-143-3) has been used for the complete design, while Systemverilog [\[55\]](#page-143-4) has been used for verification purposes.

Figure [6.12](#page-56-0) shows an abstract view of the complete digital logic.

• The hierarchical topmost logic that controls the complete behavior of the chip is the Top-state machine. A simplified abstract view is shown in Figure [6.15.](#page-59-0)

It defines the top-level chip operation, arbitrates, delegates tasks, and defines priorities for operation among other state machines and digital blocks.

- The Object Dictionary and the Object Dictionary control blocks are a combination of state machine, sequential, and combinational blocks to realize the CANopen in hardwired logic. Shown in Figure [6.16](#page-61-0) and Figure [6.17](#page-62-0) is an overview of the sub-blocks. The implemented CANopen logic is flexible and allows future integration of other CANopen constructs to enhance functionality.
- The ADC interface logic takes care of the timing and signals to ensure the correct read-out of the ADC data.
- There are two watchdog timers on the chip, which ensure safer operation and mitigate SEU effects to avoid frequent power cycles or data corruption. They are triggered by malfunctioning or inactivity on the chip.
- The CAN interface unit is responsible for initializing, configuring, and controlling the CAN node. There are two buffers to hold the incoming and outgoing messages.

More details about the blocks are described in the following sub-sections.



## <span id="page-57-0"></span>6.4.1 Initialization scheme

Figure 6.13: Initialization scheme at the top-level logic

Figure [6.13](#page-57-0) shows the initialization scheme at the top-level Finite State Machine (FSM). The chip is initialized after the power-up to have the correct information related to the CAN bus timing and other registers, along with important information for proper functionality. Although the CAN node part of the chip allows different configurations of the bus timing by providing configuration registers, the top-level logic of the chip loads these configuration registers during power-up using hardwired values defined in a combinational block shown as Configuration in Figure [6.12.](#page-56-0) The reason for loading configuration registers using hardwired values from another block is to initialize the chip itself without any external signals/communication protocol. If required, this also provides the flexibility of changing the timing values quickly at the top level in later versions of the chip. In addition to that, hardwired values make the chip more tolerant to the SEUs by refreshing the configuration periodically. The chip does frequency trimming either manually by reading values defined on the external pads or automatically by reading incoming CAN messages. If the automated frequency trimming procedure is enabled, the chip loads configuration by decoding the incoming CAN message but only sends a sign-in after the trimming. If the feature is enabled but no message is sent over the bus within  $\approx 4$  to 6s, depending on the default frequency, the watchdog timer forces the chip to use the default frequency on-chip and then generates a sign-in message. The simulation in Figure [6.14](#page-58-0) shows

<span id="page-58-0"></span>

Figure 6.14: Sign-in after the automated trimming procedure

when Automated trimming is enabled during chip powering. Signals in the red blocks show transition on the CAN bus and corresponding data. In the automated trimming

configuration mode, the chip requires frequent transitions on the bus to detect the mean frequency. This can be realized by sending alternating bits on the CAN bus. The blue block shows a trimmed frequency of 10 MHz on the bus. An active high 'ready\_osc' signal in the cyan-colored block indicates when the chip is ready, and 'txmops' represents the signal where the MOPS chip drives the bus to send the signin message. The sign-in message from the MOPS gets decoded on the bus with an ID of '701'.

#### <span id="page-59-0"></span>6.4.2 Top level state machine



Figure 6.15: Top-level FSM to define the operation of the MOPS chip

A simplified abstract view of the top-level FSM, which controls the behavior of the MOPS digital part, is shown in Figure [6.15.](#page-59-0) It defines the overall operation of the chip, delegates tasks to the other state machines in the logic, and ensures that the chip is always in listening mode, even during the sampling of an ADC channel. It bridges the CAN bus logic, the CANopen logic, and the ADC on-chip. After the initialization phase shown in Figure [6.13](#page-57-0) ends, the chip goes into the 'Idle' state. The chip stays in this state until an interrupt signal tells the FSM to read or send a new message. Whenever a new message is ready to be read/written to the CAN bus or the CANopen side, the FSM delegates these tasks to corresponding logic blocks and returns to the 'Idle' state. Both watchdogs on the chip can bring the top FSM back to the 'Idle' state.

## 6.4.3 CAN node interface

The CAN node has a register-based interface to store timing and masking configuration specific to the bus. The top-level logic has three blocks to interface with the CAN node.

- An interface FSM to control operation and handshake signals in a synchronous manner
- An interface block that has both sequential and combinational logic to provide data buses to read/write CAN node as well as to get configuration data from the initialization block
- The third block is purely combinational to prioritize the incoming messages

The CAN node takes care of the ID arbitration over the bus, which is insufficient. This only defines which message has the highest priority at any given time, so it gets access to the bus first. In the case of the MOPS, the processing time of a single message on the chip is approximately 1.3 ms because of the slow ADC clock. If a new request arrives at the chip when it is already handling the previous request, then the chip must be able to handle the priority based on the COBID shown in Figure [6.5.](#page-51-1) The SDOs get priority over PDOs, and especially in the case of remote reset requests, the chip will stop doing everything to listen to the bus and automatically trim the oscillator.

### 6.4.4 On-chip CANopen implementation

Implementing the CANopen protocol requires an Object Dictionary (OD) definition. This is common for the server and all the CANopen nodes on the network. The object dictionary defined for the MOPS chip specifically meets the requirements of DCS monitoring. The requirement was to make the design small and as simple as possible to remain within the space and cost budget defined for the chip. To meet this goal, unnecessary generalized OD entries are not implemented. As explained in section [6.3.2,](#page-51-0) the CANopen standard uses predefined communication objects to communicate over the CAN network, but not all are implemented in the chip. The CANopen digital block is divided into two sub-blocks. One block defines the complete OD, while the other sub-block implements the interface between OD and the rest of the digital logic. As shown in Figure [6.16,](#page-61-0) the interface block is further divided into

four sub-blocks that are explained below:

- FSM
- Message type decoder
- Failure response
- <span id="page-61-0"></span>• Object Dictionary interface



Figure 6.16: CANopen realization in the digital logic

The FSM is responsible for handling tasks related to CANopen communication objects. It controls the signals to instruct other blocks to decode and categorize incoming messages from the received buffer into different communication objects, and it also prepares responses to be written to the output buffer. After receiving the decoded information from other blocks, the FSM defines the handling procedures for different communication objects. It defines SDO expedited transfer and NMT objects in detail. Although PDO, SYNC, and EMCY are the communication objects that are not used in the chip, they are still defined in a limited way, so they can be easily integrated if required in the future.

The 'message type decoder' is a combinational block identifying a newly arrived message in the received buffer. This block generates signals for the FSM to indicate the type of the communication object and provides an index/sub-index to the interface block.

The 'SDO failure response' block is also a purely combinational block that generates failure information in case of an SDO communication failure. A complete list of codes is shown in Table [6.1.](#page-55-1)

The block 'object dictionary interface' has read and write access to both the received and transmitted message buffers. This block also has the interface to communicate with the OD. The commands to read or write the OD or the data buffers come directly from the FSM. The OD interface block consists of both sequential and combinational modules.

#### 6.4.5 Object dictionary

The Object Dictionary defined in the MOPS chip consists of three blocks, as shown in Figure [6.17.](#page-62-0) The logic is separated into three units to define entries, access control, and data storage separately. The block that defines all the OD entries for the chip is a purely combinational and very large multiplexer. It gets input signals from the interfacing blocks to address all the entries defined in the large multiplexer. All the constant values are defined as hardwired values, while the writable values are read directly from the block 'stored OD entries'. For each entry in the OD, the large multiplexer also sets appropriate asynchronous access signals, which are then passed to the block 'Access control'.

The 'Access control' block generates synchronous control signals to allow writing new information to the writable registers defined in the OD.

<span id="page-62-0"></span>The block 'stored OD entries' holds all the sequential registers defined in the OD. It reads the content of the register passed directly by the multiplexer block and acts on it using the access control signals defined by the 'Access control' block. Values are driven back as feedback to the multiplexer block.



Figure 6.17: Object dictionary to define communication objects

### 6.4.6 ADC Interface

The MOPS chip does not store ADC values except the current value in the transmit buffer register. The current value is stored for a very short time before transmitting it over the CAN bus. The chip uses SDO expedited transfer to transmit monitoring values, so at any given time, a single CAN message can hold information for only one monitored value. The OD defines separate entries/indexes for each monitoring channel. The ADC on-chip works using a 10 kHz signal which is scaled down from the main clock of 10 MHz. Only one of the twelve ADC bits is sampled during a single clock cycle. This makes the ADC very slow compared to the rest of the digital logic and CAN communication; therefore, storage of monitoring data is not required. This reduces the size of the digital logic and the probability of monitoring data corruption due to SEUs. The ADC interface has an FSM to respect signal timing and a combinational block to translate OD indexes to select a respective ADC channel. 35 out of 40 channels are used on the MOPS chip. Instead of using a multiplexer, the ADC

on-chip uses 40 separate lines to activate one of the channels. After a new message arrives in the received buffer register, the index and sub-index of the OD are directly defined and mapped to the respective ADC channel. To ensure correct read-out, a channel is selected many clock cycles before the start of the conversion signal.

<span id="page-63-0"></span>

#### 6.4.7 Watchdog timers

Figure 6.18: Watchdog timers available on the chip

Figure [6.18](#page-63-0) explains the logic of the watchdogs available on the MOPS chip. Two watchdogs are available on the chip: the watchdog timer and the configuration reloader. The times mentioned in the figure are w.r.t. the on-chip frequency of 10 MHz. An important point about the watchdogs is that none of them takes the chip into the mode that requires Automated trimming once again. This is to avoid the scenario where the MOPSHUB [\[20\]](#page-139-2) does not know that the chip has gone into the trimming mode. Both watchdogs do not reset configurable storage in the OD, which also holds values for the oscillator trimming and trimming of the DAC network inside the ADC. Upon activation, the watchdog timer triggers a sign-in message, but the configuration reloader reloads without indication.

The watchdog timer mainly looks for any malfunction in the internal logic of the

<span id="page-64-0"></span>

Figure 6.19: Watchdog timer on the MOPS chip to reset the state machines

chip by continuously monitoring all the state machines at the top level. If any of the top-level state machines remain in a state other than 'Idle' for more than 5 s, then the watchdog timer triggers a soft reset. Depending on the frequency, the watchdog timer releases a soft reset signal for all the state machines after 5 s given the condition that the on-chip frequency is 10 MHz. The timeout counter changes accordingly if the frequency is other than 10 MHz on the chip. After the reset, the chip initializes itself and sends a sign-in message. The watchdog timer makes sure that all the state machines never remain stuck in a locked state due to a bug or an SEU. Shown in Figure [6.19,](#page-64-0) the red block shows four 'entimeout' signals generated from all four state machines at the top-level logic to indicate their current state and the green box shows the current frequency of the chip. There is a logic 'OR' between all four signals. If any of the four signals is 'True', the output will become 'True'. Here, the 'entimeout2' is True while the other three are 'False'. Around  $\approx 4.125$  s, the 'timeoutrst' signal in the blue block is generated to trigger a reset for all the state machines. Figure [6.19](#page-64-0) shows the scenario where no trimming configuration was provided to the chip after a power-up. The watchdog timer prevents the chip from waiting forever for the automated trimming. The configuration for the chip is loaded in the same way as shown in Figure [6.21\(](#page-66-0)explained in the next paragraph).

The configuration reloader is a second watchdog available on the chip. It checks for conditions opposite to the watchdog timer where a state machine remains in the 'Idle' state for too long. This indicates no activity happening in the digital part of the chip.

This is normal but can also be due to the lack of CAN communication on the CAN bus. An SEU can change the timing configuration on the CAN node, which results in errors on the bus, ultimately leading to the Bus-off state where the CAN node remains silent and does not participate actively. The configuration reloader reloads the CAN node configuration and resets all the state machines after every 250 s of no activity in the digital part. This process does not interrupt the CAN message reception because the reload time is less than one bit wide on the CAN bus and is completely independent of the process happening on the CAN node to decode bus communication. As shown in Figure [6.20,](#page-65-0) the red box shows the important signals from the register-based interface of the CAN node. The signal 'addr' represents the address of the registers holding configuration, 'data  $\int$ init/config' are values provided to the registers, whereas 'write' is the control signal. The green box shows the four 'reload' signals fed by the four state machines to indicate the current state. There is an 'AND' logic between the four signals. The 'reloadconf' marked in the blue box goes high around 267 ms of simulation time. The exact time is dependent upon the on-chip frequency. A zoomed-in simulation of the actual configuration reloading is shown in Figure [6.21.](#page-66-0) The red block shows the write signal, address, and data written to the CAN node. The counter shown in the blue box starts counting again after the reloading is finished.

<span id="page-65-0"></span>

Figure 6.20: Configuration reloader activation due to no activity on the bus

<span id="page-66-0"></span>

Figure 6.21: Configuration reloader signals to reload the CAN node configuration and resets state machines

#### 6.4.8 Automated Oscillator trimming

The Automated trimming is only done after the power-up and after the remote reset request explained in section [6.3.2.](#page-52-0) Non other than the above two conditions can make the chip go into automated trimming mode. After a remote reset request, automated trimming is done to mitigate power cycling due to an SEU or a frequency shift due to temperature/radiation effects. Figure [6.22](#page-68-0) explains the logic flow for two scenarios when the automated trimming is triggered. After the power-up, the chip enters the configuration mode if automated trimming is enabled. After finishing the trimming, the chip initializes, sends a sign-in message, and then remains idle. A second scenario is when a chip receives a message that does not match its node ID. If the ID of the message is zero, the chip understands it as a remote reset command, irrespective of the data in the frame. This remote reset brings the chip into configuration mode. A prerequisite for the remote reset is that the deviation in the on-chip frequency must be within  $3\%$  of the mean frequency of 10 MHz. If the frequency shift happens on the chip, then this can be very easily detected by the MOPSHUB [\[20\]](#page-139-2) by looking at error counters on the bus. In case of large error counters, the MOPSHUB can initiate a remote reset request autonomously and independent of the central DCS. This can trim the oscillator on-chip, eventually saving an extra power cycle. The remote reset command has proven helpful during the irradiation and temperature tests. Details are described in Chapters [8](#page-102-0) and [9.](#page-114-0) The control block for the Auto trimming feature is designed by Michael Karagounis (FH Dortmund), whereas the top-level and OD integration are done within the scope of this thesis.

The automated oscillator trimming procedure makes use of the information provided by the bit timing block in the CAN protocol unit. As shown in Figure [6.23,](#page-68-1) the CAN standard splits bits into several-bit time quanta. These bit time quanta are grouped into segments. The falling edge of a new dominant bit is expected to happen on the CAN bus during the synchronization segment. If the falling edge arrives in phase segment 2, then it is considered early, and a negative phase error is assigned. If the falling edge occurs in the propagation segment, it is considered delayed, and a positive phase error is attributed. In the CAN bit timing unit, this phase error is used to enlarge or shorten the bit-time accordingly to ensure phase synchronization between the sender and receiver.

As shown in Figure [6.24,](#page-69-0) in the MOPS chip, the phase error e extracted by the CAN bit timing unit is forwarded to the control block of the automated trimming system, which is implemented as a digital Proportional–integral (PI) compensator. The phase error  $e$  is multiplied by a configurable constant  $K_P$  and, in addition, integrated into a special accumulation register and multiplied afterward by the configurable constant  $K_I$ . The result of this operation is forwarded as a 6-bit trimming code to the relaxation oscillator. The oscillator output clocks the whole MOPS chip and, in addition, is scaled down by a prescaler to the frequency, which gives rise to the required CAN bus data rate. The pre-scaler clocks the CAN bit-timing unit, which defines the correct time point to sample the CAN bus or transmit a new bit on the CAN bus.

<span id="page-68-0"></span>

Figure 6.22: Automated trimming procedure of the MOPS chip

<span id="page-68-1"></span>

Figure 6.23: Bit Timing defined in the CAN protocol

<span id="page-69-0"></span>

Figure 6.24: Block diagram of the automated frequency trimming system

When a pad signal configures automated trimming, the procedure starts after poweron-reset and lasts for fifteen CAN frames. After fifteen CAN frames, the trimming procedure is switched off. The trimming procedure can be restarted using the remote reset command by sending a message with ID '0' over the bus [\[56\]](#page-143-5).

Figure [6.25](#page-70-0) shows the trimming of the oscillator after power-up. The green box shows six trimming bits, while the blue box shows the actual frequency. Here, a digital model of the oscillator is used for simulation purposes. The 'rx' signal in the red box shows CAN bus transitions while the 'bus\_data' shows the actual message in hexadecimal. Figure [6.26](#page-70-1) shows the automated trimming upon remote reset. The first instance of trimming is done after power-up and then upon remote reset command. The cyan-colored box shows the 'read\_osc' signal going high after fifteen CAN frames on the bus to indicate the end of trimming.

### 6.4.9 Controller Area Network (CAN) protocol unit

The CAN protocol unit of the MOPS chip is called Canakari. Michael Karagounis developed it during his studies at the FH Köln [\[57\]](#page-143-6). The design was later improved over the years within the scope of the Bachelor's/Master's thesis. The digital logic was written in VHDL but later translated into Verilog by Philip Ledüc during his studies at FH Dortmund. The reason for translating the design into Verilog was to achieve Triple Modular Redundancy (TMR) implementation as the TMRG tool [\[58\]](#page-143-7) available from CERN only takes Verilog designs as input. Alexander Walsemann and Philip Ledüc from FH Dortmund contributed to verifying the CAN node part in the MOPS chip.

<span id="page-70-0"></span>

| clk                    |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
|------------------------|--------------------------------|------------------|-----------------------------------------------------|--|----------------|---------------------|-------------------|----------------|----------|-------------------|-----------------------|--|
| $rst_n$                |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
| rx                     |                                |                  | <b>DUTING CONTINUES ON A SERVICE OF LARGE STATE</b> |  | TOOOOOLO       |                     |                   |                |          | UUUUULI           | _UUULUUUUUUUUUUUUUUUU |  |
| ftrim osc (100000      |                                | mmmm             |                                                     |  | $W^{111}$      |                     | men nemmenmmen    |                | 111101   | 711               | 0 T T T T T           |  |
| [5]                    |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
| $[4]_1$                |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
| $[3]$ <sub>L</sub>     |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
| [2]                    |                                | 电                |                                                     |  |                | ח ח חר              | <u>nn nin nin</u> |                |          |                   |                       |  |
| $[1]_1$                |                                |                  | ПΠ                                                  |  |                |                     |                   | ШU             |          |                   |                       |  |
| $[0]_1$                |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
|                        | freq (12.1951                  | menn             | $10-$                                               |  | 1110<br>11.    |                     |                   | <b>ETEL 10</b> |          |                   | 11 10                 |  |
|                        |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
| customcanid (000       |                                | 555              |                                                     |  | 555  <br>$0 -$ |                     |                   |                | $0$ 1555 |                   |                       |  |
|                        | data (0000                     | aaaaaaaaaaaaaaaa |                                                     |  | 0. .           | aaaaaaaaaaaaaaaa    |                   |                | 0. .     | Taaaaaaaaaaaaaaaa |                       |  |
|                        | bus data (00000000000000000000 |                  |                                                     |  |                | 555aaaaaaaaaaaaaaaa |                   |                |          |                   |                       |  |
| ready osc <sub>1</sub> |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
| txmops $\Gamma$        |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
|                        | sign_in 00000000000000000000   |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
| indic_sign_in          |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
|                        |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
|                        |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
|                        |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |
|                        |                                |                  |                                                     |  |                |                     |                   |                |          |                   |                       |  |

Figure 6.25: Automated oscillator trimming after power is supplied to the chip

<span id="page-70-1"></span>

Figure 6.26: Automated trimming after a remote reset command

<span id="page-71-0"></span>

Figure 6.27: CAN message handling by the MOPS chip

### 6.4.10 Data buffers

There are two data buffers in the chip to send and receive data. Both buffers are 75-bit wide to store the CAN frame identifier and actual data without the protocol overhead. Like the other digital logic, data buffers are also completely triplicated to protect against SEUs. Figure [6.27](#page-71-0) illustrates the chip's handling of incoming messages. If there is no ID match, the frame is checked for the remote reset command. If the ID of the node matches, the chip compares the message that is currently worked upon with the incoming message to decide priority. At any given time, the chip works on a single message with the highest priority.

## 6.5 Design techniques for a robust MOPS digital logic

The MOPS chip was developed entirely using industry-standard tools and known methods to implement a digital IC. However, the development of the complete logic at every stage of the development process is inspired by the fact that it should be very tolerant of SEUs and other failures. The SEU tolerance here refers more to the system-level reliability and operational stability than the mere corruption of some monitored values. Another important point is to make the logic small and as simple as possible to keep it economical regarding size, cost, and power consumption. Below are some important points about the MOPS digital part to make it robust.

• The complete digital design has been triplicated to make the chip tolerant to SEUs [\[58\]](#page-143-7). Figure [4.8](#page-39-0) shows the actual implementation. Every combinational
signal is triplicated and then fed to the sequential logic. There are three voters instead of just one to reduce further the probability of a Single Event Transient (SET) happening at the output of a single voter. In addition, the triplicated cells are placed with at least 15 µm [\[36\]](#page-141-0) apart to defend against multiple-bit upset where data of more than one storage cell gets corrupted, and there is no longer a possibility to correct it.

- Two watchdogs continuously monitor all four state machines on the chip. Their operation in detail is explained in Section [6.4.7.](#page-63-0) They protect against both cases where either the top-level logic gets locked due to an SEU/error or the CAN node configuration changes. Only the watchdog timer can send a sign-in message, but this can happen earliest after 5 s, the maximum time frame for all the DCS monitoring values to get updated. In case of an unknown failure, the watchdog timers can not spam the bus with unnecessary sign-in requests.
- A change in the frequency due to radiation/temperature changes can be detected by the MOPSHUB. The error registered by nodes on a CAN bus depicts the state of the communication. Due to a frequency change, many errors appear on the bus. A remote reset request from the MOPSHUB can be used to trim the chips once again without a power cycle. The automated trimming feature in the MOPS chip is a completely customized solution not defined by the CAN or CANopen protocol.
- Initialization of the digital logic is partly hardwired. Although the CAN node configuration must be written to the registers, it is defined using hardwired values in a combinational logic. These values are loaded during power-up, timeouts, or on a custom remote reset command. The chip neither requires a flash memory nor depends on any external interface/protocol to initialize.
- The design is kept modular for the CANopen implementation, the CAN node, OD, and the ADC to allow for easy integration of other constructs in the future. The functionality can easily be extended in the future.
- The digital design of the chip and the physical layer to communicate over the bus are independent of each other. This allows the user to choose an external standardized physical layer, and a commercial CAN controller can use the lowvoltage MOPS physical layer.

## 6.6 Improvements in the digital design and new features in the MOPSv2

The digital logic of the MOPSv1 chip had some functional problems. The problems in the MOPSv1 were not large enough to hinder the core functionality.

• The sign-in message is described in Section [6.3.2.](#page-52-0) Shown in Figure [6.6,](#page-53-0) the MSB of the sign-in message must be 0, and it toggles every time a new node guarding request is sent. The MOPS chip sends a sign-in message with the correct MSB, which is 0 and keeps sending this on the bus until it receives an acknowledgment. It is observed during the testing that if MOPS doesn't get an acknowledgment, then after a couple of sign-in messages on the bus, the bit toggles automatically, which should not be the case.

- There was a problem with the remote reset request functionality of the MOPSv1 chip. When an ID '0' is sent over the bus, then it should cause every MOPS chip connected to the bus to reset its state machines. After working on the reset request, the chip should respond with a node guarding response explained in Section [6.3.2,](#page-52-0) but the MOPSv1 chip sends back the last message stored in the buffer register. If the last message sent out by the MOPS chip was a node guarding message, then the MSB toggles for every new reset request.
- The watchdog timer on the chip acts only on the top-level state machine instead of all four state machines in the design.

In addition to all the bug fixes required for the MOPSv1, the chip also introduces some new features, which are listed below:

- Automated trimming of the Oscillator
- Refresh configuration registers automatically if the MOPS chip remains idle for more than 250 ms
- In case of a frequency deviation, there is a remote reset request to trim the oscillator without a power cycle

## 6.7 Digital logic verification and full-netlist simulation

As shown in Figure [6.28,](#page-74-0) the block diagram of the testbench has been written in the SystemVerilog to verify the chip's digital logic and run a full-netlist simulation of the whole chip before fabrication. The scheme of the testbench is such that a virtual CAN bus is defined as one that connects all four main entities. .

- A driver/generator acts like a master node that generates requests for the MOPS chip and generates constrained randomized messages. This module is very generalized and needs input about the message content from another block to generate CAN messages according to the standard.
- A large GUI-based FSM is defined using the Mentor Graphics HDL designer. The FSM can be easily modified using the GUI tool to change test scenarios, content and type of messages, randomization, and timing of the frames presented on the virtual bus.
- The MOPS chip is connected to the virtual CAN bus, where it reads every frame sent by the generator. The chip decides for itself whether the message

<span id="page-74-0"></span>

Figure 6.28: Abstract view of the full chip test bench

is important or not. If a message is important to the chip, it acts on it and responds.

• A third independent CAN node is also connected on the virtual bus to decode frames from either the generator or the MOPS chip.

An ADC dummy module provides values to each ADC channel on the chip, whereas another module probes other essential signals from the chip. The testbench generates thousands of CAN messages to test the chip's response. Some messages are targeted to test particular responses or functionality of the chip. In contrast, constrained randomized messages are also used to ensure the chip does not go into a bad state. A scoreboard is defined to automatically verify the functionality without looking into each signal in the simulation. The scoreboard prints all activity on the CAN bus and categorizes the MOPS response into good or bad.

## 6.8 Analog block

The analog part of the chip includes a power-on-reset circuit to ensure the digital part is in the correct start-up state. A bandgap reference circuit and a voltage regulator provide the chip with a stable reference voltage and supply voltage, respectively. The chip also includes an oscillator to locally generate a clock signal for the digital part. There is also a CAN physical layer to convert digital logic levels to appropriate voltages over the bus.

### 6.8.1 Power-on-Reset generator

<span id="page-75-0"></span>

Figure 6.29: Generation of an on-chip temperature and process independent Poweron-reset signal

A Power-on-Reset(POR) circuit on the chip ensures that the complete digital logic starts in a known good state. It generates a reset signal when the power to the chip is applied. The reset signal is kept in an active state until the supply voltage is completely stabilized. A process and temperature tolerant Kuijk cell bandgap reference [\[59\]](#page-143-0) based POR using Dynamic-Threshold(DTMOS) transistors for better matching [\[60\]](#page-143-1) has been developed for the MOPS chip. The concept of the POR is partly derived from [\[61\]](#page-143-2). Figure [6.29](#page-75-0) shows the basic circuit topology to implement the design where a Negative Temperature Coefficient (NTC) voltage and a Positive Temperature coefficient voltage (PTC) voltage are generated. The crossing point of the PTC and NTC voltages can be used to invert the output of a comparator circuit. Since the crossing point of PTC and NTC define the release point of the reset signal, the output of the POR circuit is temperature-independent. A temperature-independent source is mostly also process-independent because most of the process parameters are dependent on temperature [\[62\]](#page-143-3). The original POR using this approach was designed in [\[51\]](#page-143-4) for the PSPP chip [\[46\]](#page-142-0) using the Global foundries 130 nm CMOS process. This is a reuse of the original circuit adapted to TSMC 65 nm CMOS process.

A current through a forward-biased base-emitter of a bipolar transistor or a PN junction diode is defined as:

$$
I_c = I_s \exp\frac{V_{BE}}{V_T} \tag{6.1}
$$

Where

 $V_T$  is the temperature voltage which is equal to  $k-\frac{T}{T}$  $\frac{1}{q}$ .

Here, k is the Boltzmann's constant and q is the elementary charge.

 $I<sub>S</sub>$  is the saturation current which is proportional to  $\mu k T n_i^2$ .

 $\mu$  is the mobility of minority carriers and  $n_i$  is the intrinsic carrier concentration. Both  $\mu$  and  $n_i$  are temperature-dependent quantities.

$$
\mu \propto \mu_0 T^m \qquad \qquad with \qquad \qquad m \approx \frac{-3}{2} \tag{6.2}
$$

while

$$
n_i^2 \propto T^3 \exp \frac{-E_g}{kT} \qquad \text{with} \qquad E_g \approx 1.12 \text{eV} \tag{6.3}
$$

Whereas the saturation current is

$$
I_s = bT^{4+m} \exp \frac{-E_g}{kT}
$$
\n
$$
(6.4)
$$

To find the variation of  $V_{BE}$  concerning temperature, we can take the partial derivative of base-emitter voltage, which is

$$
V_{BE} = V_T \ln \frac{I_C}{I_S} \tag{6.5}
$$

<span id="page-77-0"></span>The partial derivative concerning temperature will be

$$
\frac{\partial V_{BE}}{\partial T} = \frac{V_{BE} - (4+m)V_T - \frac{E_g}{q}}{T}
$$
\n(6.6)

With  $V_{BE} \approx 750mV$  and  $T = 300K$  equation [6.6](#page-77-0) gives us the negative temperature coefficient of  $\approx -1.5 \frac{mV}{K}$  $\frac{nV}{K}$ . A  $\approx +1.5 \frac{mV}{K}$  $\frac{k}{K}$  is required to cancel the effect of the NTC and make it temperature independent [\[62\]](#page-143-3).

Shown in Figure  $6.30(a)$  $6.30(a)$ , the difference of the base-emitter voltage of two transistors operating at different current densities is directly proportional to the absolute temperature, so the voltage  $V_{BE}$  calculated in equation [6.9](#page-78-0) is a PTC voltage [\[63\]](#page-144-0).

<span id="page-77-1"></span>

(a) Positive Temperature Coefficient voltage generation

(b) A basic circuit to generate a bandgap voltage reference independent of temperature

Figure 6.30: PTC voltage and a basic bandgap reference voltage generation circuits

<span id="page-77-2"></span>
$$
\Delta V_{BE} = V_{BEQ2} - V_{BEQ1} \tag{6.7}
$$

$$
\Delta V_{BE} = V_T \ln \frac{nI_{Q1}}{I_{S2}} - V_T \ln \frac{I_{Q1}}{I_{S1}}
$$
\n(6.8)

<span id="page-78-0"></span>
$$
\frac{\partial \Delta V_{BE}}{\partial T} = \frac{k}{q} \ln n \tag{6.9}
$$

Equation [6.9](#page-78-0) exhibits a positive temperature coefficient.

In case more than one transistor is used in parallel, where  $m$  is the number of transistors, Equation [6.8](#page-77-2) then can be written as:

$$
\Delta V_{BE} = V_T \ln \frac{nI_{Q1}}{I_{S2}} - V_T \ln \frac{I_{Q1}}{mI_{S1}}
$$
\n(6.10)

$$
\frac{\partial \Delta V_{BE}}{\partial T} = \frac{k}{q} \ln(nm) \tag{6.11}
$$

Assuming that a temperature-independent voltage can be described by the following equation

<span id="page-78-1"></span>
$$
V_{TC0} = \alpha_1 V_{BE} + \alpha_2 (V_t \ln n) \tag{6.12}
$$

Where  $V_t \ln n$  is the PTC voltage, which results in the difference of base-emitter voltages. Then the values of  $\alpha_1$  and  $\alpha_2$  can calculated as explained below:

At room temperature equation [6.12](#page-78-1) gives us  $\frac{\partial V_{BE}}{\partial T}$  $rac{V_{BE}}{\partial T} \approx \frac{-1.5 mV}{K}$  $\frac{1.5mV}{K}$  and  $\frac{\partial V_T}{\partial T} \approx \frac{+0.087mV}{K}$ K Supposing  $\alpha_1 = 1$ ,  $\alpha_2$  must be chosen such that  $(\alpha_2 \ln n)(0.087 \frac{mV}{K})$  $\frac{mV}{K}$ ) = 1.5 $\frac{mV}{K}$  $\frac{K}{K}$ . This requires  $\alpha_2 \ln n \approx 17.2$ 

Equation [6.12](#page-78-1) then can be written as

$$
V_{TC0} \approx V_{BE} + 17.2 V_T \approx 1.25\tag{6.13}
$$

The circuit shown in Figure [6.30\(](#page-77-1)b) can be used to add the NTC and the PTC, which are  $V_{BE}$  and 17.2 $V_T$ , respectively. For simplicity, the transistors can be replaced by diodes. As shown in Figure [6.30\(](#page-77-1)a) if the voltages  $V_{BEQ1}$  and  $V_{BEQ2}$  are equalized then the voltage across resistor  $R3$  in Figure [6.30\(](#page-77-1)b) will be

$$
V_{R3} = V_{BE1} - V_{BE2} = V_T \ln n = V_{d1} - V_{d2}
$$
\n(6.14)

An equal  $R_1$  and  $R_2$  ensure that the voltage at points A and B are approximately equal. From the circuit in Figure [6.30\(](#page-77-1)b), the  $V_{REF}$  can be obtained as [\[62\]](#page-143-3):

$$
V_{REF} = V_{d2} + \frac{V_T \ln n}{R_3} (R_3 + R_2)
$$
\n(6.15)

$$
V_{REF} = V_{d2} + (V_T \ln n)(1 + \frac{R_2}{R_3})
$$
\n(6.16)

The bandgap reference circuit shown in Figure  $6.30(b)$  $6.30(b)$  can be used to generate a temperature and process independent POR signal for the digital part. To generate a POR signal, the feedback path can be removed, and a comparator can be used instead of an opamp to switch the output signal at the crossing point of PTC and NTC voltages. Figure [6.31\(](#page-79-0)b) shows the actual implementation in the chip where DTMOS

<span id="page-79-0"></span>

(a) Power-on-reset circuit based on bandgap reference using diodes

(b) Power-on-reset circuit based on bandgap reference using DTMOSFETS

Figure 6.31: Power-on-reset generation circuit for the MOPS chip

transistors are used for better matching. Two transistors in series are placed to achieve the desired voltage level. The circuit generates an asynchronous reset signal, which is released when the supply voltage to the digital part reaches approximately 930 mV. A comparator circuit designed by Tobias Froese for the CAN physical layer has been reused:

$$
V_p > V_n \quad then \quad V_{OUTPUT} = Logic high \quad or \quad V_{DD} \tag{6.17}
$$

else

$$
V_n > V_p \quad then \quad V_{OUTPUT} = Logic \, low \quad or \quad Gnd \tag{6.18}
$$

## 6.8.2 On-chip shunt LDO regulator

An on-chip LDO regulator with a PMOS pass-device that generates a 1.2 V output voltage for load currents of up to 200 mA was designed by Michael Karagounis (FH

<span id="page-80-0"></span>

Figure 6.32: Low-Drop Out Regulator [\[14\]](#page-139-0)

Dortmund). The minimum drop-out voltage is  $200 \text{ mV}$ , and the circuit is fully cascoded and can tolerate supply voltages up to  $2V$ . It is an adapted version of the regulator integrated into the pixel chip of the RD53 collaboration with reduced passdevice width [\[14\]](#page-139-0)[\[15\]](#page-139-1). As shown in Figure [6.32,](#page-80-0) the output voltage drop across the resistor R2 is fed to the non-inverting input of the error amplifier A1. The operational amplifier A1 compares the voltage to the reference voltage of 600 mV provided by a bandgap voltage reference circuit. For stabilization, an on-chip compensation circuit is integrated, which injects a charge into the middle tap of the resistive divider whenever a load or input voltage change happens. By this means, the circuit can be stabilized with an external low Equivalent Series Resistance (ESR) capacitor of  $2.2 \,\mathrm{\upmu F}$  [\[56\]](#page-143-5).

## 6.8.3 Bandgap reference generator

The bandgap voltage reference circuit is a reuse of the RD53 pixel chip [\[64\]](#page-144-1). It has a two-stage architecture to provide over-voltage tolerance and trimming capability simultaneously. The first stage is a bandgap reference voltage circuit with fully cascoded transistor structures that can tolerate supply voltages up to 2 V. Since trimming reduces the over-voltage tolerance, this bandgap reference avoids any trimming functionality, which results in low precision due to uncompensated process variations. This circuit is used as a reference of an LDO pre-regulator, which generates an output voltage of approximately 1.2 V and needs a 100 nF external compensation capacitor



Figure 6.33: 2-stage bandgap voltage reference circuit [\[56\]](#page-143-5)

at its output. The pre-regulator supplies a high-precision bandgap voltage reference circuit, which generates a reference bias current Iref and the primary regulator reference voltage Vref by means of an external resistor. Due to its high Power Supply Rejection Ratio (PSRR), the high-precision bandgap circuit can tolerate substantial supply voltage variations generated by the low precision of the pre-regulator bandgap. The high-precision bandgap output voltage can be trimmed in a region between 1.0 V and 1.32 V by a 4-bit trimming code provided externally via trimming pads. To avoid any parasitic feedback loop from the supply voltage of the trimming pads to the regulator, because the trimming pads are supplied by the regulator itself, level shifters were introduced [\[56\]](#page-143-5).

## 6.8.4 On-chip oscillator

Since in CAN signaling, data is transferred without propagating a clock; the MOPS chip needs its own oscillator for communication and clocking of sequential digital circuits. This clock signal is generated by the relaxation oscillator designed by Michael Karagounis. The circuit diagram is shown in Figure [6.34.](#page-82-0) The reference current  $I_{ref}$ provided by the bandgap reference is steered to charge one of the two capacitors  $C_1$  or  $C_2$ . Assuming the signal Q to be low, the capacitor  $C_1$  is charged until the voltage  $V_{C1}$  across capacitor  $C_1$  reaches the voltage  $V_{ref}$ , which is also provided by the bandgap reference. The voltage  $V_{C_1}$  is monitored by a comparator and asserts a logical high output signal as soon as the reference voltage is reached. This triggers a D-FlipFlop wired to generate set-reset functionality, which in turn sets the signal Q

<span id="page-82-0"></span>

Figure 6.34: Relaxation Oscillator [\[56\]](#page-143-5)

while it resets the signal  $Q_B$ . As a result, the current flow is steered towards capacitor  $C_2$  while the capacitor  $C_1$  is discharged by activation of the parallel connected NMOS. As soon as the voltage  $V_{CS2}$  across capacitor  $C_2$  reaches the voltage  $V_{ref}$ , a second comparator generates an output pulse which resets the D-FlipFlop and restarts the procedure described above, giving rise to an oscillation with a frequency of 10 MHz. The frequency can be calculated in general by the following equation:

$$
f = \frac{I_{ref}}{2CV_{ref}}\tag{6.19}
$$

The CAN protocol includes a bit timing re-synchronization mechanism, which allows for stable communication as long as the frequency deviation between the sending and receiving CAN node is less than three percent. To compensate for process variation influencing the oscillation frequency, a trimming option has been introduced. As shown in figure [6.35,](#page-83-0) the capacitors in the relaxation oscillator, which are charged and discharged during the oscillation operation, are split into a fixed and a configurable part. The configurable part consists of six capacitors of binary weighted capacitance with an NMOS switch transistor in series, resulting in a 6-bit trimming code. This trimming code can either be applied manually by a bonding option on the package level or automatically by a digital control loop that compares the phase shift between the incoming CAN data stream and the oscillator signal. The automatic procedure also allows for compensation of radiation and temperature-induced frequency shifts [\[56\]](#page-143-5).

<span id="page-83-0"></span>

Figure 6.35: Oscillator Trimming Option [\[56\]](#page-143-5)

#### 6.8.5 Custom low voltage CAN physical layer

The chip communicates over the CAN bus using the standard CAN bus communication protocol defined in the standard [\[49\]](#page-142-1) and explained in Section [6.3.1,](#page-48-0) but the electrical signals on the bus are not standard compliant. MOPS communicates using non-standard CAN high and CAN low signals. A level shifter is required to translate and isolate signals from a standard CAN physical layer to the MOPS physical layer.

- Recessive state > logic  $1 > CAN_H = CAN_L = 600$  mV
- Dominant state > logic  $0 > CAN_H = 1.2$  V and  $CAN_L = 0$  V

A low-voltage 1.2 V CAN physical layer has been developed by Tobias Fröse (FH Dortmund) for the chip to communicate over the CAN bus. The reason for the deviation from the standard is the incompatibility between the 5 V high-voltage signals as they are defined in the CAN standard, which require MOS transistors of thicker gate-oxides and radiation hardness demands which require MOS transistors of thinner gate-oxide.

The CAN receiver circuit shown in Figure [6.36](#page-84-0) is based on the LVDS receiver design of the FE-I4 pixel chip [\[65\]](#page-144-2) and consists of two parallel-operated comparator circuits with an NMOS and a PMOS input differential stage to allow a rail-to-rail commonmode input signal range. The transistors M4-M7 and M11-M14 in the decision circuit are not equally sized to introduce a hysteresis of at least  $35 \,\mathrm{mV}$ . The second stages, made of transistors M15-M18 and M19-M22, translate the low-swing signals from the first stages into full-swing CMOS signals and merge the two signal paths. A resistive divider is added to the  $CAN_H$  input port to safely detect a recessive bus state for differential input voltages lower than at least 30 mV. To match the RC-time, which is formed by the resistors at the  $CAN_H$  network and the parasitic capacitances of the input transistors, an additional resistor  $R<sub>L</sub>$  of the same resistance is connected to the  $CAN_L$  port.

The CAN driver circuit shown in Figure [6.37](#page-84-1) consists of a PMOS and an NMOS transistor, which are closed simultaneously to generate the dominant bus state and

<span id="page-84-0"></span>

Figure 6.36: Low-voltage CAN receiver circuit [\[56\]](#page-143-5)

<span id="page-84-1"></span>

Figure 6.37: Low-Voltage CAN driver circuit

to drive  $CAN_H$  to 1.2 V and  $CAN_L$  to 0 V. Both transistors are opened in the recessive bus state, and the bus lines reach the VREC potential of about 0.6 V, which is introduced to the driver using two high-ohmic resistors. To provide the same current drive capability, the PMOS transistor has been chosen three times wider than the NMOS to compensate for the lower charge mobility in PMOS transistors. As a result, a three times larger driver strength is chosen to control the PMOS gate than the NMOS gate, which is implemented by inverter chains [\[56\]](#page-143-5).

### 6.8.6 Analog to digital converter

To sample voltage monitoring values, the MOPS chip hosts a 12-bit Successive Approximation Register (SAR) based Analog-to-Digital Converter (ADC). The circuit is provided with a reference voltage of 900 mV, which provides a theoretical resolution of 219.72 µV per ADC count. The ADC is silicon proven to be radiation hard up to an ionizing dose of 500 Mrad. The circuit has been designed and tested by the CERN RD53 collaboration [\[66\]](#page-144-3). Shown in Figure [6.38](#page-85-0) is the block diagram of the MOPS ADC. It provides 40 channels. The MOPS digital logic can address all of them, but due to the limited number of pads available on the package, only 32 are used for external inputs.

<span id="page-85-0"></span>

Figure 6.38: Block diagram of the ADC used on the MOPS chip [\[67\]](#page-144-4)

# 6.9 Analog block modifications and improvements in the MOPSv2

- It was observed during testing of the MOPSv1 chip that the regulator had a problem with a fast variation of the load current. The regulator did not react fast enough to changes in the current consumption due to communication on the CAN bus. A transistor mismatch in the compensation circuitry of the regulator contributed to this problem. This has been corrected in the MOPSv2.
- The ADC requires a reference of  $450 \,\mathrm{mV}$  but it was supplied with  $600 \,\mathrm{mV}$  instead. This does not break the functionality completely but causes a large gain error, further reducing the ADC range at large input values.
- The MOPSv1 struggles to start at low temperatures when the ramp-up time of power is not very fast [\[19\]](#page-139-2). The slowest possible ramp-up time was 30 ms at −15 °C. The exact reason for this behavior is not clear. The problems mentioned in the following three points could have contributed to the start up problem on the chip.
	- Level shifters are missing on the bandgap trimming pads
	- ESD protection configuration done on the bandgap trimming pad is incorrect.
	- Parasitic voltage on different pads due to the fact that trimming bits are supplied by an external regulator, which is incorrect. They should be applied by the internal regulator. Connecting trimming bits to the external regulator means that even when the power is not supplied to the chip, the ESD protection diodes on the pads get activated. This can cause leakage current and parasitic voltage.

The start-up problem seen in the MOPSv1 was solved in the MOPSv2. As explained in Chapter [8,](#page-102-0) none of the MOPSv2 tested for the complete operational temperature range experienced any problems with the start-up.

# 6.10 Required external components and monitoring precision

The chip requires external passive electronic elements to correctly define the operation point and modes for different sub-blocks to monitor the serial powering chain. The monitoring precision of the chip is affected by several factors, among which the most important are below:

• The tolerance and topology of the required external components, e.g., resistor network, to translate large voltages from the serial chain to the input range of the ADC.

- Precision/stability of the reference voltage provided to the NTC resistors on the modules to measure the temperature.
- Radiation dose and the operational temperature of the chip. Both of these effects change the Bandgap reference voltage on the chip. A variation of voltage X in the bandgap reference results in a change of 2X in the voltage regulator output and 1.5X in the ADC reference.
- Integral and differential non-linearity of the on-chip ADC if not calibrated.
- Stable communication on the CAN bus. Errors on the bus can hinder real-time updates of the monitored values.

To meet the requirements described in Section [6.1,](#page-46-0) tolerance of the required external components is mentioned in Table [6.2.](#page-89-0) The absolute precision of the monitored values depends on the tolerance of all the components involved in the monitoring lines; the relative precision is much more important at the system level to follow precisely any changes in the temperature and the voltage of the modules during normal operation. For the voltage monitoring, the worst-case absolute error is expected in the last module in the chain due to a large dividing factor. Shown in Figure [6.39,](#page-88-0) the resistors  $R_A$  and  $R_B$  can have tolerances in the opposite direction. A negative tolerance of just 0.1% in  $R_B$  with a nominal value of 39 kΩ equates to a real value of 38.961 kΩ while a positive tolerance of 0.1% in  $R_A$  with a nominal value of 1 kΩ equates to a real value of  $1.1 \text{ k}\Omega$ . In the final system, a serial chain can have a maximum of 13 modules, which means the  $13^{th}$  module will have 26 V during the normal operation. Using the above tolerances, the chip will read  $651.26 \,\mathrm{mV}$  instead of an ideal  $650 \,\mathrm{mV}$ . An error of just  $1.26 \text{ mV}$  translates to an error of  $50 \text{ mV}$  at the module level due to a dividing factor of 40. Another significant problem is the change of reference voltage on the chip. As described in detail in the chapters [8](#page-102-0) and [9,](#page-114-0) the change of bandgap reference voltage due to temperature/radiation can introduce an absolute error of up to 300 mV in the worst case at the chain level. This must be calibrated to ensure precise monitoring.

Similarly, for temperature monitoring, the absolute error due to the tolerance of the resistors can be calculated. Figure [6.40\(](#page-89-1)a) shows the absolute error in temperature measurements due to the tolerance of the NTC resistor as well as the biasing resistor, which is referred to as  $R_{NTCREF}$  in the Figure [6.39.](#page-88-0) The x-axis represents the temperature of the modules in a serial powering chain, whereas the y-axis shows the maximum absolute error in °C using an NTC and the bias resistor of 1 % tolerance. Figure [6.40\(](#page-89-1)b) shows the measurement resolution at different temperatures. Here, the NTC's biasing resistor value is  $130 \text{ k}\Omega$ . The value is chosen to cover the complete temperature range and provide the maximum resolution at the expected operating temperature of  $-25$  °C for the experiment.

The relative precision is affected a lot by the radiation damage to the ADC itself; bandgap reference changes over time due to radiation/temperature and the nonlinearity of the ADC. Section [7.4](#page-97-0) describes the ADC characterization results in detail, considering all these factors.

<span id="page-88-0"></span>

81

<span id="page-89-1"></span>

(a) Absolute error in the temperature mea-(b) Measurement resolution at different tempersurement due to component tolerances atures

Figure 6.40: Factors affecting absolute precision of the temperature monitoring values from the MOPS chip

| Name         | Value                    | Max. Voltage | Max.Power                | <b>Tolerance</b> | Temp. Coefficient |
|--------------|--------------------------|--------------|--------------------------|------------------|-------------------|
| $R_A$        | $51 \,\mathrm{k}\Omega$  | 32V          | $1.3 \,\mathrm{\upmu W}$ | $0.10\%$         | $25$ ppm          |
| $R_B$        | $2\,\mathrm{M}\Omega$    | 32V          | $487 \,\mathrm{\upmu W}$ | $0.10\%$         | $25$ ppm          |
| $R_{NTCREF}$ | $130\,\mathrm{k}\Omega$  | 2V           | $40 \,\mathrm{\upmu W}$  | $0.10\%$         | $10$ ppm          |
| $R_{BUS}$    | $20\,\mathrm{k}\Omega$   | 2V           | $200 \,\mathrm{\upmu W}$ | $1\%$            | $100$ ppm         |
| $R_{BUS1}$   | $20\,\mathrm{k}\Omega$   | 2V           | $200 \,\mathrm{\upmu W}$ | $1\%$            | $100$ ppm         |
| $R_{SEN}$    | $360\,\mathrm{k}\Omega$  | 2V           | $4.8 \,\mathrm{\upmu W}$ | $1\%$            | $25$ ppm          |
| $R_{SEN1}$   | $200\,\mathrm{k}\Omega$  | 2V           | $3.2 \,\mathrm{\upmu W}$ | $1\%$            | $25$ ppm          |
| $R_{TERM}$   | $100\,\Omega$            | 1.2V         | $15 \,\mathrm{mW}$       | $1\%$            | $100$ ppm         |
| $R_1$        | $24.9\,\mathrm{k}\Omega$ | 2V           | $160 \,\mathrm{\upmu W}$ | $0.10\,\%$       | $10$ ppm          |
| $R_3$        | $20\,\mathrm{k}\Omega$   | 2V           | $200 \,\mathrm{\upmu W}$ | $0.10\%$         | $10$ ppm          |
| $R_4$        | $30\,\mathrm{k}\Omega$   | 2V           | $135 \,\mathrm{\upmu W}$ | $0.10\%$         | $10$ ppm          |
| $C_1$        | $6.8 \,\mathrm{\upmu F}$ | 6V           |                          | 10%              | <b>NPO</b>        |
| $C_2$        | $100\,\mathrm{nF}$       | 6V           |                          | 10%              | NPO               |
| $C_3$        | $2.2 \,\mathrm{\upmu F}$ | 6V           |                          | $10\%$           | <b>NPO</b>        |

<span id="page-89-0"></span>Table 6.2: Properties of the external components required by the MOPS chip

# Chapter 7

# Measurement setup and test results

<span id="page-92-0"></span>

Figure 7.1: Block diagram of the main testsetup used to test the MOPS chip

Several measurement setups have been used to characterize the chip depending on the environment and requirements of particular tests. Figure [7.1](#page-92-0) shows the concept of the main measurement setup, which was deployed for temperature measurements, irradiation, and ADC characterization. Because of the modular design, the setup could easily be disintegrated so that different boards could be connected via cables. This allows to put the board that hosts the chip inside a machine used for irradiation or temperature testing, whereas the other boards hosting all the commercial components could be placed on a desk next to the Raspberry Pi board. The same setup has been used to test other parameters at room temperature, e.g., automated trimming, testing with realistic services, and validation of the digital logic. Peter Kind did the layout for most of the required Printed Circuit Boards (PCB). Working towards their bachelor thesis, Julia Besproswanny and Dennis Neuhaus prepared different software required to test the MOPS chip [\[68\]](#page-144-5).

In addition to the main test setup, the regulator characterization is done using the existing test stand available in FH Dortmund. This was developed for the FE chip to test the regulator.

## 7.1 Hardware/software parts of the testsetup

The test setup consists of several hardware components like PCB boards, power supplies, and computers. The different boards are explained below:

• Raspberry Pi

#### Chapter 7 Measurement setup and test results

<span id="page-93-0"></span>

Figure 7.2: Test boards to host commercial components and the MOPS chip

A Raspberry Pi [\[69\]](#page-144-6) was chosen to benefit from the flexibility of the Linux system and already existing packages to speed up development and its small size to relocate the setup easily to different test facilities. Standard Python libraries are already available to communicate with commercial SPI devices, e.g., ADCs, DACs, MUX, and CAN controllers.

• Measurement Board

Seen in Figure [7.2\(](#page-93-0)red box), this board hosts all the commercial  $\text{ICs}, e.g.,$ DACs, ADCs, and communication controllers, to generate dummy signals for the MOPS chip and then sample the behavior. The board also offers a custombuilt DAC-controlled power supply to control the chip power remotely.

• Motherboard

As shown in Figure [7.2\(](#page-93-0)orange and blue boxes), the motherboard provides the interface on one side to the carrier board of the chip and on the other side to the measurement board. The board also has connectors to offer the possibility of connecting a dummy chain board.

• Carrier Board

The carrier board of the test setup is the one that hosts the actual chip. The board is shown in Figure [7.2\(](#page-93-0)yellow box). This has fan-out connections, which are used to wire-bond the chip. The board also hosts decoupling capacitors and resistors in the near vicinity of the chip.

• Dummy chain board

Shown in Figure [7.3,](#page-94-0) this board mimics the construction of an actual serial power chain. The board has variable resistors to depict detector modules and NTCs to provide temperature measurements as if they are on an actual module detector.

<span id="page-94-0"></span>

Figure 7.3: Test boards to host MOPS, translate CAN signals and provide test signals

## <span id="page-94-1"></span>7.2 Regulator, Bandgap reference and POR measurements

The regulator on the chip has been tested to pass the line and load regulation tests, and the startup behavior of the analog component has been sampled. All the plots shown in this section are only about the MOPSv2. Figure [7.4\(](#page-95-0)a) shows the line regulation measurements. The input supply voltage to the chip was increased from 0 V to the maximum tolerable range of  $2V$ . The regulator output rises when the supply voltage reaches  $\approx 0.5$  V. The regulated output starts already at  $\approx 1.25$  V and then stays constant until the supply voltage reaches  $2V$ . The bandgap reference also starts up as expected and then stays constant. The POR output is also very important during the line regulation to ensure a good known state for the digital logic during startup. Its output stays low until the supply voltage reaches  $\approx 930 \,\text{mV}$ . The POR then follows the regulator and stays constant.

Shown in Figure [7.4\(](#page-95-0)b) is the load regulation. The load current drawn by the regulator is increased from 0 to 200 mA. During the normal operation, the chip requires only  $10 \text{ mA}$  of current. Shown in Figure [7.4\(](#page-95-0)c), at the maximum specified current of 35 mA, the voltage drop at the regulator output is only  $\approx$  5 mV. The POR and the bandgap reference output remain constant during the load regulation test.

Transient testing has also been done for the regulator. This ensures the regulator can cope with the fast load current change during communication on the CAN bus. The load current changes abruptly when the chip actively participates in the communication and drives the CAN bus. Shown in Figure  $7.5(a)$  $7.5(a)$ , when the current changes from

<span id="page-95-0"></span>

(a) Regulator line regulation, startup of the bandgap and POR

(b) Regulator load regulation and behavior of the bandgap reference/POR



regulation

Figure 7.4: Output of the analog components at different supply voltages and load current

35 mA to 0 with a delay of only 10 µs, the maximum overshoot is  $\approx 40 \,\text{mV}$ . Figure [7.5\(](#page-96-0)b) shows the undershoot with a similar value when the load current changes from 0 to 35 mA. In the worst case, the regulator output takes 500 µs to recover. To fix this problem, more on-chip decoupling capacitance is foreseen in the next version of the chip. Figure [7.6](#page-96-1) shows the variation in the output voltage of the bandgap reference

<span id="page-96-0"></span>



(a) Maximum overshoot in the output voltage (b) Maximum undershoot in the output voltage

<span id="page-96-1"></span>Figure 7.5: Regulator output voltage at fast changing load current



Figure 7.6: Coupling of the bandgap reference and the 10 MHz on-chip oscillator

circuit. As explained in Chapter [6,](#page-46-1) the MOPS chip used a relaxation-based oscillator. The capacitor network in the oscillator is charged by the reference current generated by the bandgap reference. The time period of noise is  $\approx 100 \text{ ns}$ , which is exactly the time period of the oscillator output frequency. A solution to this problem is to add decoupling circuitry on the chip. This is foreseen for the MOPSv3.

## 7.3 Digital functionality

The chip's digital logic has been rigorously tested while carrying out the temperature and radiation testing explained in Chapters [8](#page-102-0) and [9,](#page-114-0) respectively. During temperature and radiation testing, each channel of the MOPS ADC was supplied with random values from a commercial 12-bit DAC and then read out by an independent

<span id="page-97-1"></span>



(a) Sign-in message sent by four MOPS on a single bus

(b) CAN bus decoding using an oscilloscope where the trimming messages and then a sign-in message by the MOPS can be seen

Figure 7.7: Automated trimming using four real MOPS chips on a single bus

commercial 12-bit ADC and the MOPS chip for comparison. The usefulness of the automated trimming feature introduced in the MOPSv2 was also demonstrated from results shown in chapters [8](#page-102-0) and [9,](#page-114-0) where auto trim was used without a power cycle to correct the frequency variation. In addition, some tests have also been done at room temperature to check the robustness of digital design at the system level, e.g., Automated trimming and a maximum of four chips on a single bus. Figure [7.7\(](#page-97-1)a) shows a sign-in message from four chips on a single bus. Figure [7.7\(](#page-97-1)b) shows activity on the CAN bus using an oscilloscope. After powering up, messages destined to trim the MOPS are decoded but marked as red to indicate missing acknowledgment bits. When the MOPS was correctly trimmed, it acknowledged the second last message and sent a sign-in message afterward.

## <span id="page-97-0"></span>7.4 ADC characterization

Detailed characterization of the on-chip ADC has been done at room temperature using the setup shown in Figure [7.1](#page-92-0) and MOPSv2 chip. A linear ramp-up method shown in Figure [7.8\(](#page-98-0)a) was used. A commercial 16-bit DAC with a reference voltage of 2.5 mV was used to provide an input voltage at the MOPS ADC channels, which can be incremented by  $38.14 \,\mathrm{\upmu V}$  per DAC count. A single ADC count from the MOPS chip itself equals  $219 \mu V$ . To reduce noise and make precise edges for the ADC counts, each code sent by the DAC and sampled by the MOPS was repeated five times before jumping to the next DAC count. Before calculating the Differential Non-Linearity (DNL) and Integral Non-Linearity (INL), a histogram and an average for the MOPS ADC counts were calculated to give clear transition points. Shown in Figure [7.8\(](#page-98-0)b) is the DNL. Because of the negative offset and the positive gain, the MOPSv2 loses  $\approx 75 \,\mathrm{mV}$  of measurable range. The most important takeaway from the DNL is that it stayed  $-1$  > DNL < 1. This proves no skipped counts and a monotonic behavior. The INL is shown in Figure [7.8\(](#page-98-0)c). Instead of calculating the INL by DNL summation, the INL was computed using the end-point line method to cross-check the calculated DNL. The yellow trace shows the standard error on the mean. The ADC used on the MOPS chip is the same as on the FE chip, but the calculated INL is worse for ADC counts larger than 3000. The INL calculated for the FE chip was 1.5 LSB [\[67\]](#page-144-4), shown in Figure [7.8.](#page-98-0) This could be due to the bandgap noise problem discussed in section [7.2](#page-94-1) as the ADC reference is provided by the bandgap reference circuit on the chip with a multiplying factor of 1.5.

<span id="page-98-0"></span>

Figure 7.8: Output of the analog components at different supply voltages and load current

## 7.5 Operation with realistic services

In the final system, a cable length of  $\approx 70 \,\mathrm{m}$  will be used to communicate data and supply power to the chip. The patch panel outside the detector volume uses 10 nF filter capacitors connecting the CANHigh and the CANLow line to the ground. They

#### Chapter 7 Measurement setup and test results

<span id="page-99-0"></span>

Figure 7.9: Rise/fall time of the CAN bus signals including extra filter capacitors 10 nF used in the system

introduce more delay to the rise/fall time of the signals on the CAN bus. Tests for the physical layer of the chip, including the filter capacitors, have been done. Figure [7.9](#page-99-0) shows the rise/fall time of the CANHigh/CANLow signals using an equivalent filter capacitance of 13.87 nF to the ground. The equivalent capacitance included 3.87 nF of the cable capacitance. The capacitance between the CANHigh/CANLow cables was  $4.15 \text{ nF}$ . Using a bus speed of  $125 \text{ kbit s}^{-1}$ , where each bit has a period of  $8 \text{ µs}$ , the chip proved to work reliably.

# <span id="page-102-0"></span>Chapter 8

# Operational temperature and performance

Both MOPSv1 and MOPSv2 were tested at different temperatures inside a climate chamber. The climate chamber tests aim to prove a reliable operation of the chip for the whole temperature range defined in the specification documents [\[18\]](#page-139-3) and [\[47\]](#page-142-2).

## 8.1 Scope of measurements

The operational requirements specified in the specification document require the MOPS chip to be tested at a temperature range of -40  $^{\circ}$ C to +60  $^{\circ}$ C. However, the chip may not cover the full temperature range during the nominal operation because the detector is expected to run between  $-25$  °C to  $-20$  °C. For the MOPSv1, a climate chamber was used which could go from  $-15\degree C$  to 60 °C whereas for the MOPSv2, Thermostream [\[70\]](#page-144-7) was used. The Thermostream machine can change the temperature very fast and is also able to provide a broad temperature range of −80 °C to more than 100 °C. The MOPSv2 was tested for −50 °C to 60 °C. The most important points to prove the chip's reliability for the complete temperature range.

- Start-up behavior of the chip, especially at cold temperatures
- Slow ramp-up of the power supply at different temperatures
- Change of frequency and the output of analog components
- Reliability studies/aging at high temperature of 60 °C

## 8.2 Startup performance

## 8.2.1 MOPSv1

The chip responded well but struggled to start at low temperatures using a slow ramp-up supply voltage. This can be seen in Figure [8.1,](#page-103-0) where the bandgap reference represented as VREF does not rise upon the power supply. The exact reason for the startup issue is still unclear, but there could be different reasons for that, e.g

#### <span id="page-103-0"></span>Chapter 8 Operational temperature and performance



Figure 8.1: MOPSv1 bandgap reference startup problem at low temperature

- Parasitic voltage of up to 400 mV at different analog signal pads. This was due to the fact that an external commercial regulator was used to trim the bandgap.
- Level shifters missing at the input of Bandgap trimming pads

More details about the MOPSv1 start-up problem are discussed in [\[19\]](#page-139-2). However, it is important to mention that the bandgap circuit used on the MOPS chip has proven to work completely fine for a large temperature range [\[64\]](#page-144-1). This startup issue for the MOPSv1 should not cause a problem in the final system. Although the exact reason for the problem was unclear, by fixing the issues mentioned before, the MOPSv2 experienced no startup problems.

## 8.2.2 MOPSv2

Three chips were used in total to do the startup tests. The chip was power cycled for a change of every 10 °C. A ramp-up time of minimum 600 ms was used for the power supply. None of the three chips showed any problems during the start-up.

### POR

Shown in Figure [8.2](#page-104-0) is the start-up behavior of the POR for three different chips. Chip 2 worked completely fine, whereas for chip 1, for an extreme temperature of  $-50$  °C, the POR was released at  $\approx 700 \,\text{mV}$ . Also, for chip 3, the release point of the POR output was  $\approx 600 \,\text{mV}$ . This is very low to ensure proper resetting of the digital logic; however, in the final system, the chip will not be actively cooled during operation. It is very unlikely that the chip will experience temperatures lower than −15 °C during operation [\[71\]](#page-144-8).

## Bandgap

Shown in Figure [8.3](#page-105-0) is the start-up behavior of the bandgap reference for three different chips. Two observations can be made from this plot. At colder temperatures, the

<span id="page-104-0"></span>

Figure 8.2: POR startup behavior at different temperatures

<span id="page-105-0"></span>

Figure 8.3: Bandgap/regulator startup behavior at different temperatures

bandgap circuit starts later than at warmer temperatures. At colder temperatures, a slight overshoot occurs before the output voltage settles down to the desired value after the recovery time.

#### Regulator

Shown in Figure [8.3](#page-105-0) is the start-up behavior of the regulator for three different chips. The regulator in all three chips started up perfectly fine. Because the bandgap started later at cold temperatures, the start-up of the regulator was also delayed. However, it did not cause any problems for the circuit.

## 8.3 Temperature cycling between −40 °C to 60 °C during operation

The same three chips used for the startup tests were also used during the temperature cycling test. While in operation, each chip was temperature cycled from the lowest to the maximum value of the operational temperature and vice versa. This means each chip was temperature-cycled two times. Before the tests, each chip was trimmed to the desired operational frequency of 10 MHz. The rate of temperature change was  $1\degree C \text{ min}^{-1}$ .

#### **Oscillator**

As shown in Figure [8.4,](#page-107-0) chip 1 was exceptionally good and had minimal variation for both cycles. The variation for chip 1 was small enough not to cause any communication problems on the CAN bus. The frequency deviation for both chip 2 and chip 3 shown in Figure [8.4](#page-107-0) was large enough to cause disturbance on the CAN bus communication. The communication problem started at the point where the plot became lighter with few sample points. For the chip 2, going from  $-40\degree C$  to 60 °C, it started around  $5^{\circ}$ C. whereas, going from 60 °C to  $-40^{\circ}$ C, it started at  $-30^{\circ}$ C. The plot for the chip 3 can be interpreted in the same way. Here, an important takeaway point is that the chip could communicate without errors on the bus as long as the on-chip frequency variation was not larger than 1.5 %. However, theoretically, it should be 3 %. The inflexibility of the software and incorrect setting of the level shifter used to translate the CAN bus signals were thought to have contributed towards intolerance towards change of frequency. Another interesting observation that can be made from the frequency plot is that the direction of deviation w.r.t. the temperature was not uniform for all three chips. e.g. for chip 2, going from  $-40\degree C$  to 60 °C, the frequency increased whereas it decreased for chip 3.

It is important to mention that almost every time the frequency variation was large enough to cause communication errors, a remote reset command could return the chip to the right frequency as long as the communication was not completely broken.

<span id="page-107-0"></span>

Figure 8.4: Change in the on-chip oscillator frequency w.r.t. temperature and current consumption
#### Current consumption

Shown in Figure [8.4](#page-107-0) is the current consumption of the same three chips. Because chip 1 had no communication problem due to frequency shift, the current consumption remained stable throughout the test and increased by only a couple of µA. The current consumption for both chip 2 and chip 3 increased from  $10 \text{ mA}$  to  $\approx 21 \text{ mA}$  during the time of communication problem explained earlier in the oscillator paragraph. The reason for more current consumption is that the chip continuously tried to drive the bus but with the wrong frequency.

#### POR

Figure [8.5](#page-108-0) shows the variation in the POR output. The POR continuously followed the regulator output. At all operational temperatures, no glitches were recorded in the output of the POR circuit.

<span id="page-108-0"></span>

Figure 8.5: POR output voltage variation w.r.t. temperature

#### Bandgap

The bandgap variation shown in Figure [8.6](#page-110-0) w.r.t. the operational temperature is critical to calibrate ADC measurements from the MOPS chip. As expected, a bandgap reference circuit is optimized for a particular temperature range where it exhibits the most temperature independence. It is evident from the plot from both chip 2 and chip 3 that voltage output shows the most temperature independence around  $\approx -20$  °C, which is expected to be the operational temperature of the final detector and the FE chip. The chip 1 looks exceptionally good. However, the independence point has shifted a bit towards  $0^{\circ}$ C to  $20^{\circ}$ C. A maximum variation of  $18 \text{ mV}$  was recorded in the bandgap output for the entire temperature range. This is shown as  $V_{ref}$  in Figure [6.33,](#page-81-0) after the reference current generator. Variation in the Core Bandgap circuit output is much smaller, which is reported in [\[64\]](#page-144-0).

#### Regulator

The regulator on the MOPS chip has an output voltage of two times the bandgap voltage. Shown in Figure [8.6](#page-110-0) is the variation in the regulator output. The variation in the output completely follows the pattern of the bandgap variation. The total variation in the regulator output is almost 36 mV, which is double the variation that can be seen in the output of the bandgap regulator.

## 8.4 Long term stability / reliability at  $60^{\circ}$ C

To test the reliability of the MOPS chip to continuously operate at high temperatures, one of the MOPSv2 chips is being operated in a climate chamber at 60 °C. The chip has completed more than 5000 h of operation without any problems. The tests are still ongoing.

## <span id="page-109-0"></span>8.5 Summary

The MOPSv2 chip met the operational temperature requirements defined in the section [6.1](#page-46-0) without problems. None of the three chips showed any problem with startup at any temperature. At extremely cold temperatures, the POR was released earlier than desired, but this is not of great concern. The chip in the final system will not be actively cooled, so realistic operational temperature is expected to be higher than  $-20$  °C when the detector is running. If the detector is not running, the ambient temperature can drop further. If the MOPS chip is power cycled at this stage, there is a little risk that the digital part may not start in the correct state due to the problem seen in the POR startup. The shift in the bandgap reference output changes the ADC reference as well. For the module voltage monitoring, a shift of  $20 \,\mathrm{mV}$  in the bandgap output can introduce an error of 6 ADC counts even in the module-tomodule relative voltage precision. Considering a voltage dividing factor of 40, this is equal to an error of  $52 \text{ mV}$  per module at the serial powering chain; therefore, the

<span id="page-110-0"></span>

Figure 8.6: Bandgap reference and regulator output voltage variation w.r.t. operational temperature

#### Chapter 8 Operational temperature and performance

temperature dependence of the bandgap reference voltage must be taken into account to calibrate voltage monitoring values from the MOPS chip. In addition to the factors mentioned in Section [6.10,](#page-86-0) the temperature dependence of the ADC reference must be calibrated to achieve absolute precision. To know the operational temperature of the chip in the final system, an NTC resistor shown in Figure [6.39](#page-88-0) will also be placed in the near vicinity of the chip. The value read from this NTC resistor is independent of the bandgap variation.

$$
V_{NTC} = VDD1V2 \frac{R_{NTC}}{R_{NTC} + R_{NTCREF}} \tag{8.1}
$$

Where VDD1V2 is equal to two times the bandgap reference  $V_{BG}$  output.  $V_{NTC}$  is the voltage across the NTC resistor.

$$
V_{NTC} = 2 \times V_{BG} \frac{R_{NTC}}{R_{NTC} + R_{NTCREF}} \tag{8.2}
$$

<span id="page-111-0"></span>
$$
V_M = \frac{V_{NTC}}{1.5 \times V_{BG}} = 1.33 \times \frac{R_{NTC}}{R_{NTC} + R_{NTCREF}} \tag{8.3}
$$

Here,  $V_M$  is the resultant voltage sampled by the MOPS ADC. Which can also be written in terms of ADC counts as:

$$
ADC = 4096 \times V_M \tag{8.4}
$$

From Equation [8.3,](#page-111-0) finding an approximate operational temperature of the chip is independent of bandgap variation. Hence, temperature monitoring of the modules in the serial powering chain also does not require temperature-dependent calibration.

# <span id="page-114-1"></span>Chapter 9

# Radiation tolerance of the MOPS chip

Several MOPS chips from both v1 and v2 versions have been irradiated under different conditions to check whether they meet the radiation tolerance requirements defined in Section [6.1.](#page-46-0) The behavior of the chip w.r.t. low dose, frequent power cycling, high temperature, and heavy ions were sampled during the campaign, lasting over several months.

# 9.1 Irradiation using X-rays

To ensure the chip can survive 500 Mrad of TID, the chips were irradiated using an X-ray machine [\[72\]](#page-144-1) available at the Physics Institute in Bonn.

#### 9.1.1 MOPSv1, frequent power cycling

Two X-ray campaigns using four chips were done for the first version of the chip. The motive for the first campaign using only one chip was to check whether the chip could survive 500 Mrad at all. Further details are provided in the bachelor thesis by Julia Besproswanny [\[68\]](#page-144-2).

In the second campaign, three chips were used. Table [9.1](#page-114-0) shows the three different chips's dose rates, total dose, and temperature. During the campaign, the chips were power cycled frequently at different levels of accumulated dose to rule out start-up issues.

|        |          | Total dose Operational temperature Dose rate |               |
|--------|----------|----------------------------------------------|---------------|
| Chip1  | 554 Mrad | $\sim$ (0 to 5) <sup>o</sup> C               | $1.75$ Mrad/h |
| Chip 2 | 509 Mrad | 40 $\degree$ C                               | $1.75$ Mrad/h |
| Chip 3 | 478 Mrad | 20 °C                                        | $1.75$ Mrad/h |

<span id="page-114-0"></span>Table 9.1: Parameters for the three MOPSv1 chips to test TID tolerance

Shown in Figure [9.1](#page-116-0) is the variation in the on-chip frequency and the current consumption w.r.t. of the chips mentioned in Table [9.1.](#page-114-0) The frequency of chip 1 looked perfect even after accumulating a TID of 554 Mrad. The chip was power cycled multiple times, and it showed no problems at all. The deviation remained within 1.5 %.

|                   |           | Total dose Operational temperature | Dose rate           |
|-------------------|-----------|------------------------------------|---------------------|
| Chip <sub>1</sub> | 33.8 Mrad | 0 °C                               | $40 \text{ krad/h}$ |
| Chip1             | 533 Mrad  | 0 °C                               | $1.75$ Mrad/h       |
| Chip 2            | 500 Mrad  | 20 °C                              | $1.75$ Mrad/h       |
| Chip <sub>3</sub> | 500 Mrad  | 40 °C                              | $1.75$ Mrad/h       |

<span id="page-115-0"></span>Table 9.2: Parameters for the three MOPSv2 chips to test TID tolerance

The current consumption of the chip remained constant throughout the process. For chip 2, the frequency plot in Figure [9.1](#page-116-0) shows two colors. The blue trace is when the chip could communicate perfectly fine, whereas the red trace is when the chip was only powered, being irradiated but not communicating on the CAN bus. The issue was later found to be the faulty connector to the CAN bus cable. In several instances, the frequency of chip 2 deviated more than 1.5 %, which is the cause of high current consumption when communication was enabled. This is due to the fact stated earlier that the chip continuously tries to drive the CAN bus without success. The frequency dropping completely at around 420 Mrad was due to the unstable power supply. The chip 3 had no problems communicating until around a TID of 437 Mrad. At this point, the current consumption spiked once again.

Figure [9.2](#page-117-0) shows the output of the same three MOPSv1 chips mentioned in Table [9.1](#page-114-0) and their frequency/current is shown in Figure [9.1.](#page-116-0) For chip 1, some dropping lines in the bandgap and regulator output are sampled during the power cycles of the chip. The bandgap and the regulator on chip 1 worked without any problems. Both chip 2 and chip 3 did not go very well with power cycling and had trouble with the slow ramp-up of the power supply. The behavior resembled the startup problem mentioned in Chapter [8.](#page-102-0) No radiation-dependent startup problem has been reported about the bandgap circuit used on the MOPS chip [\[64\]](#page-144-0).

#### 9.1.2 MOPSv2, low dose, high temperature

Three MOPSv2 chips were used during the third campaign; Table [9.2](#page-115-0) shows the dose rates, total dose, and temperature of the three different chips. Chip 1 was first irradiated at a critical low dose rate of 40 krad/h to accumulate a total of 33.8 Mrad of TID. Previous studies have proved more damage at low dose rates compared to higher ones [\[73\]](#page-144-3)[\[74\]](#page-145-0). After finishing the low dose rate campaign, chip 1 was then irradiated at a faster rate to accumulate a minimum of 500 Mrad. After accumulating the TID mentioned in Table [9.2,](#page-115-0) the temperature for both chip 2 and chip 3 was increased to 60 °C for some days to check the reliability. A vertical green line in the plots marks the point, where the irradiation stopped.

Figure [9.3](#page-119-0) shows the frequency and current consumption for all three chips during the campaign. For chip 1, breaks in the frequency plot happened due to a bug in the

<span id="page-116-0"></span>

Figure 9.1: MOPSv1 current consumption and frequency variation w.r.t. TID

<span id="page-117-0"></span>

Figure 9.2: MOPSv1 bandgap reference and regulator output voltage variation w.r.t. TID

software, but the chip itself was working completely fine and communicated the correct monitoring values. This can also be confirmed from the total current consumption plot. Compared to the MOPSv1 current consumption shown in Figure [9.1,](#page-116-0) the plot showing the current for the MOPSv2 gives an impression of the large variation in the current consumption and drops to a very low value. This was because of two problems with the test setup. The settings for the power supply on the test board were incorrect, and the sampling to log the current value was done faster than the circuit configuration could handle. For chip 2, at a TID of 455 Mrad, the deviation in the frequency increased to more than 1.5 %. At this point, a CAN communication error started to appear on the CAN bus. The behavior can also be confirmed by looking at the current consumption, which has increased. A remote reset command explained in Chapter [6](#page-46-1) was able to bring back the chip frequency to the desired state of 10 MHz. This saved a power cycle, which is very beneficial in the final system because power cycles introduce transients and noise in the system. Chip 3 also had frequency shifts large enough to cause disturbance in communication, but every time, a remote reset command or a power cycle brought the chip back to an operational state. Change in the current consumption is the only significant difference seen between high dose rate and low dose rate irradiation for chip 1 mentioned in Table [9.2.](#page-115-0) Figure [9.6](#page-122-0) shows the current consumption of the MOPSv2 under irradiation at a low dose rate of 40 krad/h. The plot [9.3\(](#page-119-0)b) shows the current consumption of the same chip at a dose rate of 1.75 Mrad/h. Although, TID at a high dose rate is 16 times larger, an increase in the current consumption of  $\approx 3 \text{ mA}$  can only be seen during the low dose rate operation. This has not been observed in the other five chips irradiated at a high dose rate.

The output voltage from the bandgap reference and the regulator is shown in Figure [9.4.](#page-120-0) The total variation in the bandgap output after 500 Mrad is  $\approx 3.33\%$  which is 0.58 % more than what is reported in [\[64\]](#page-144-0) but much better than the MOPSv1 bandgap, which uses an older version of the same circuit and has a variation of as much as  $5\%$ . Because of the  $2\times$  multiplication factor between the bandgap reference and the regulator, the variation in the regulator output was double compared to the bandgap. The drop in the regulator output voltage around 400 Mrad for chip 2 and after the complete dose for chip 3 was because the MOPS regulator output was routed and also used as the supply voltage for the commercial CAN controller. There was some extra circuitry with a resistor in series, which caused the sampling problem only when the CAN communication was not good. The regulator output itself was without problems. This was confirmed from other sampled signals-

Figure [9.5](#page-121-0) shows the POR circuit output for three MOPSv1 chips and three MOPSv2 chips. All six chips are the same, as explained in the previous sections. Apart from the powering problems that affected chips 2 and 3, the POR output strictly followed the regulator output without glitches.

<span id="page-119-0"></span>

Figure 9.3: MOPSv2 current consumption and frequency variation w.r.t. TID

<span id="page-120-0"></span>

Figure 9.4: MOPSv2 bandgap reference and regulator output voltage variation w.r.t. TID

<span id="page-121-0"></span>

Figure 9.5: POR output w.r.t. TID

<span id="page-122-0"></span>

Figure 9.6: MOPSv2 current consumption under low dose rate irradiation

## 9.2 SEU tolerance using Heavy Ions

To calculate the SEU cross section for the TSMC 65 nm process in general and to test the tolerance of the MOPSv2 chip, heavy ions were used to irradiate the chips. The tests were done at the heavy Ion irradiation facility in Cyclotron Resource Centre (CRC), the technology platform of the Université Catholique de Louvain.

#### 9.2.1 MOPSv2 performance

To ensure the MOPS chip can reliably provide the monitoring data, the chip must be able to tolerate the SEUs and the SETs. The digital logic does not facilitate a direct calculation of the SEU cross section for the chip because there is no real storage of monitoring data inside the chip. As soon as some new monitoring data becomes available, it stays for a very short period in a buffer before it is sent over the CAN bus. The configuration of the CAN node is the only critical storage inside the chip, but it is not possible to read it back to give an estimate of the flipped bits. The purpose of the SEU testing using heavy Ions was to confirm that the chip could survive without requiring any power cycles, latch-up, etc. For this purpose, the heaviest available Ion was chosen to irradiate the chip, which is Xenon with a LET of  $62.5 \,\mathrm{MeV/(mg/cm^2)}$  and a range of 73.1 µm. A maximum flux of  $1.5 \times 10^4$  ions/s cm<sup>2</sup> was used during the campaign. The chip was irradiated for a total of 3.5 h. During the irradiation, the current consumption of the chip remained within range, and the chip never went into a state where it stopped communicating or required a power cycle. However, the chip displayed an undesired behavior, continuously sending signin messages between the normal monitoring values. The messages are highlighted yellow in Figure [9.7.](#page-123-0) Although this did not hinder the chip from communicating monitoring values, this behavior can be problematic for the automated trimming procedure. A sign-in message indicates either a real reset glitch on the chip or a bug in the digital logic. If it was a real reset glitch, this could reset the trimming

#### <span id="page-123-1"></span>Chapter 9 Radiation tolerance of the MOPS chip

configuration, which is stored inside the chip after powering up. This can change the frequency, which will eventually cause the communication to stop. The automated trimming was not used during the irradiation campaign. Without a second SEU campaign and more verification, it is impossible to point out whether the problem is on the chip or the test boards.

<span id="page-123-0"></span>

| 1643960286.577297 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43. | 00. |    |     |                   |          | 00 00                                                                                                                         |                                        |
|-------------------|------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------|---|-----|-----|----|-----|-------------------|----------|-------------------------------------------------------------------------------------------------------------------------------|----------------------------------------|
| 1643960286.607510 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | 00. |    |     |                   |          | 00 00                                                                                                                         |                                        |
| 1643960286.635824 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | ൈ   | 24 | 13  |                   | 08       | 00 00                                                                                                                         |                                        |
| 1643960286.649598 | 0703 | S                                                                                                                                                                                  | DLC:  | 8 | 05  | രി  | 00 | 00  | 00                | 00       | 00                                                                                                                            | 00                                     |
| 1643960286.663581 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | 00. |    |     |                   |          | 00 00                                                                                                                         |                                        |
| 1643960286.686621 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | ൈ   |    |     |                   |          | 00 00                                                                                                                         |                                        |
| 1643960286.711896 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | 00  |    |     |                   |          | 00 00                                                                                                                         |                                        |
| 1643960286.734430 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | 00  |    |     |                   |          | 00 00                                                                                                                         |                                        |
| 1643960286.757210 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | ൈ   |    |     |                   |          |                                                                                                                               |                                        |
| 1643960286.779843 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | ൈ   |    |     |                   |          |                                                                                                                               |                                        |
| 1643960286.801259 | 0703 | $\overline{\mathsf{S}}$                                                                                                                                                            | DLC:  | 8 | 05  | 00  | 00 | -00 | -00-              | -00-     | 00 00                                                                                                                         |                                        |
| 1643960286.825226 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | 00  |    |     |                   |          | 00 00                                                                                                                         |                                        |
| 1643960286.847813 |      | S                                                                                                                                                                                  | DI C: | 8 | 43  | 00. |    |     |                   |          | 00 00                                                                                                                         |                                        |
| 1643960286.870477 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | ൈ   |    |     |                   |          | 00 00                                                                                                                         |                                        |
| 1643960286.893528 |      | S                                                                                                                                                                                  | DLC:  | 8 | 43  | 00  |    |     |                   | 07       | 00 00                                                                                                                         |                                        |
| 1643960286.916704 |      | S                                                                                                                                                                                  | DI C: | 8 | 43  | 00  | 24 | 1f  | 00.               | 09       | 00.                                                                                                                           | - 00                                   |
|                   |      | ID: 0583<br>ID: 0583<br>ID: 0583<br>ID:<br>ID: 0583<br>ID: 0583<br>ID: 0583<br>ID: 0583<br>TD: 0583<br>ID: 0583<br>ID:<br>ID: 0583<br>TD: 0583<br>ID: 0583<br>ID: 0583<br>ID: 0583 |       |   |     |     |    |     | 24<br>24 17<br>24 | 24 1e ff | 11 0d 08<br>24 12 df 07<br>ററ<br>24 14 00 09<br>24 15 80 08<br>24 16 60 09<br>00 09<br>1b 80 09<br>24 1c 20 09<br>24 1d 80 08 | 24 18 60 08 00 00<br>24 19 00 0a 00 00 |

Figure 9.7: Performance of the MOPSv2 chip under heavy Ion irradiation

#### 9.2.2 SEU cross section for 3000-bit shift register

The first test prototype chip, briefly explained in Chapter [5,](#page-42-0) was used for these tests. Because of the large beam diameter of 25 mm, a homogeneity of  $\pm 10\%$  and a small carrier board, it was possible to irradiate four chips in parallel. The deviation of the error rate between the four chips was insignificant. The cross-section for the 3000-bit shift register is shown in Figure [9.8\(](#page-124-0)a). The blue line indicates the cross-section calculated for the simple shift register, whereas the green points on the plot indicate the cross-section for the register, which implements full TMR. Even though the FE chip uses clock skew to mitigate problems due to SET and a minimum distance of  $15 \,\mathrm{\upmu m}$ between triplicated cells to mitigate multi-bit upsets, the cross-section calculated for the FE chip [\[36\]](#page-141-0) is similar to what is calculated by using the 3000-bit shift register chip. The 3000-bit shift register chip implements full triplication without a distance between the triplicated cells and the clock skew. Limited irradiation time was the only reason that hindered getting a complete plot at different LET Ions.

### 9.3 Summary

As already explained in Section [8.5,](#page-109-0) the variation in the bandgap output voltage does not affect the temperature monitoring, but to meet the voltage precision requirement to monitor the serial powering chain, calibration is required for the ADC values. The bandgap reference variation w.r.t. TID is not uniform but similar for four out of six MOPSv1/MOPSv2 chips.

As with temperature measurements, the automated frequency trimming feature introduced in MOPSv2 proved very helpful in coping with frequency variation w.r.t.

<span id="page-124-1"></span><span id="page-124-0"></span>

(a) SEU cross-section at different LET/ions using the 3000-bit shift register (b) SEU cross-section calculated for the global configuration memory of the FE chip [\[36\]](#page-141-0)

Figure 9.8: SEU cross-section for the TSMC 65 nm process

TID. The feature is beneficial for avoiding unnecessary power cycling. However, the reset glitch problem during the heavy Ion testing must be understood in more detail to not only reap the benefits of this feature but also to ensure the reliable operation of the chip.

# Chapter 10

# Packaging and production

## 10.1 Packaging experience

The size of the MOPS chip itself is only  $2\times 2mm^2$ , the package chosen for the chip is a  $9\times9$ mm<sup>2</sup> QFN package that offers 64 pins. The package size is limited by the number of required pins.

MOPSv1 A total of 45 MOPSv1 chips were packaged at the University of Bonn and the University of Bern. Although with some difficulties, the overall yield was good. 39 out of 45 chips worked completely fine, giving a yield of 86.6 %. The only major problem seen for the MOPSv1 QFN packaged chips during testing was the glitches on the reset signal. Because of this problem, it was not possible to reliably communicate with the chip. The reason for the glitches was the way the test socket was used on the boards and not the chip itself. A QFN test socket was used to test the packages, but it was not pin-compatible with already available test boards. An adapter board was developed to bridge connections between the test board and the QFN socket board. This workaround created long signal lines and decoupling capacitors far from the chip pads. The problem was fixed with a new test socket carrier board.

MOPSv2 Thirty chips were wire-bonded at the University of Bern. A lot of those 30 chips saw the problem with the lid attachment. This was because not enough pressure was applied during the lid-baking process. This was easily fixed by baking the chips once again with new lids and under the weight of an approx. Forty grams of copper cube.

Due to a bug in the bonding scheme, a high current consumption of up to approx. 40 mA was seen in those chips. Some pads were configured as bi-directional inside the chip. Depending on whether automated trimming is enabled/disabled, the pads act as output/input, respectively. To provide the code manually, trimming pads acted as input when the automated trimming mode was disabled. In this mode, the current had an average current consumption of  $8 \text{ mA}$ . The results from the MOPSv2 packaged chips helped to decide the default trimming mode for the pre-production chips.

## 10.2 Pre-production yield and QA/QC tests

A total of 400 MOPSv2 pre-production chips were produced and packaged.  $\approx 100$ chips were used for different test boards required for other tests. The first commercial packaging of the MOPS chip was done using a total of 330 chips for pre-production needs packaged by IMEC. This was enough to meet demand for all the subsystems. At the point of writing this thesis, 165 out of a total of 188 tested chips worked completely fine, which gives a yield of  $\approx 87.7\%$ . The remaining 23 chips failed the QA/QC test but may not be completely bad because it could be just a single value out of the defined good range. Shown in Figure  $10.1(a)$  $10.1(a)$  is the test socket to place packaged chips. [10.1\(](#page-129-0)b) shows a screenshot of the software used to check the quality of MOPSv2 pre-production chips

<span id="page-129-0"></span>

(a) Test socket for the packaged MOPS (b) Quality control software for the pre-production and prochips duction

Figure 10.1: Quality assurance/quality control tests of the pre-production chip

# 10.3 MOPSv3 for production

When writing this thesis, there is a plan to submit the third version of the chip called the MOPSv3. Most likely, this version will be used for the production as well. The MOPSv3 chip will get some updates to increase stability, and some problems in the digital logic will also be removed. Below mentioned are some of the updates:

- After the MOPSv2 fabrication, the bandgap reference circuit available from FE chip developers was updated. The new version of the bandgap has an improved Pre-regulator bandgap startup circuit. Tolerance towards SETs has been improved in the newer version. The newest version will most likely be used on the MOPSv3, but this is still under consideration.
- A 100 ns noise on the bandgap reference output is most likely because of the oscillator. A low pass filter will be integrated into the MOPSv3. This will also improve the precision of the ADC.
- $\bullet\,$  A large decoupling capacitor will be used on the chip to improve the regulator output.
- TMR implementation in the digital logic will be fixed where some voters are not in the right position, and also some critical feedback paths are missing

# Chapter 11 Conclusion and future work

The Monitoring of Pixel System (MOPS) chip has been developed to monitor temperature and voltage for the new ATLAS ITk Pixel Detector readout modules. The chip was developed using the TSMC 65 nm CMOS process. The TSMC 65 nm was chosen because the technology is inherently radiation-hard. The radiation hardness of the technology has been studied and characterized in detail for more than a decade by the particle physics community and chosen by CERN as the default technology for upcoming upgrades of different experiments. Three prototype MOPS chips were developed, from the first concept to the pre-production version. The chip offers a 12-bit Successive-Approximation-Register (SAR) based Analog-to-Digital (ADC) to read up to 34 channels with a resolution of  $219 \,\mu$ V. The chip uses CAN communication protocol to communicate monitoring data over a distance of more than 70 m. In addition to compliance with the standard CAN protocol, the chip uses a non-standard low-voltage CAN physical layer that works at 1.2 V. This is especially useful for modern low-voltage CMOS processes. The chip also provides some new features on top of the existing CAN protocol, where the on-chip oscillator can be calibrated via the CAN messages to cope with the process variation and operational conditions. This can be done at startup or by using the remote reset command to avoid unnecessary power cycling. The MOPS chip provides a minimal implementation of the CANopen standard to reduce size, cost, and complexity. In addition to full Triple Modular Redundancy (TMR), reset de-glitching, and frequent configuration refreshing, the minimal implementation of the CANopen in a hardwired logic also helps protect the chip against Single Event Effect (SEE). Rigorous testing of the chip under different operating conditions explained in Chapters [7,](#page-92-0) [8,](#page-102-0) and [9](#page-114-1) has proven the robustness of the chip to reliably provide the monitoring data and meet the requirements defined in Section [6.1.](#page-46-0) The chip has been tested for an operational temperature of −40 °C to 60 °C, irradiated for a Total Ionizing Dose (TID) of at least 500 Mrad up to a temperature of as high as 40 °C and also proved its SEU tolerance under heavy ion irradiation. The fourth prototype chip, called MOPSv3, is under development. The MOPSv3 will probably be used for the final production.

The MOPS chip can also be used as a general-purpose monitoring and control chip in other CERN experiments or high-radiation environments. The chip's digital logic has been designed to allow easy extension of other CANopen constructs to monitor and interface peripherals in the future. The chip currently only uses Service Data Objects (SDO), but integrating the Process Data Objects (PDO) can expedite the data throughput with the least protocol overhead. Because of modular design, an ADC that provides more resolution can also be integrated with little effort for highprecision applications.

# Acknowledgments

This work was funded in the context of "ErUM: Exploration of the Universe and Matter" by the Federal Ministry for Education and Research (BMBF). The work that I have done during my time as a PhD student wouldn't be possible without the help of several people. I would like to thank everyone who helped me continue through this phase of my studies.

My big thanks go to all three Professors who supervised me during the project and helped me develop professional skills.

Prof. Dr.-Ing. Dietmar Tutsch gave me the opportunity and accepted me as PhD student to write my thesis in the Electrical Engineering department. During our meetings, his suggestions helped me improve the orientation of the research work done throughout writing this thesis.

Prof. Dr. Christian. Zeitnitz trusted my skills/abilities and gave me the opportunity to work on this topic under his supervision in the particle physics department. His knowledge and experience in developing electronics for the particle physics experiments was beneficial. His ability to explain things such that a problem does not feel like a problem anymore helped a lot.

Prof. Dr. Michael Karagounis brought this interesting topic to my knowledge during my master's studies and later provided a lot of technical supervision and help as a collaboration partner in the project.

Many thanks go to all of my colleagues at the University of Wuppertal. Susanne Kersten and Dr. Tobias Flick, with their immense experience with the ATLAS ITk and contacts, helped a lot with scheduling/managing things on time, proofreading documents, and providing very helpful suggestions. Peter Kind helped a lot by developing test boards for the chip and with his experience in electronics. With his IC design experience, Dr. Niklaus Lehmanne was very helpful with all the design tools and building concepts. Dr. Marius Wensing always rescued me from going wild with Linux/FPGAs. Working towards his PhD, Ahmed Qamesh became a very valuable colleague to help test the MOPS chip at the system level. Casual discussions with Ahmed Qamesh, Dr. Carsten Dülsen, and Dr. Amine Elkhalii helped relieve the stress.

I would also like to thank all the colleagues from FH Dortmund, especially Alexander Walsemann and Jeremias Kampkötter, who helped a lot with the development and testing of the chip. Without Marco Vogt from the University of Bonn's help, the irradiation campaigns would have been more difficult. My thanks also go to Julia Besproswanny, Alexander Adam, and other students who helped with different tasks.

Many thanks go to my Father, Siblings, and my friends. They always encouraged me and gave me a positive energy to keep going.

A special thanks goes to my wife, Romana. Without her support and endurance, this would not have been possible.

# Bibliography

- [1] E. Lopienska, The CERN accelerator complex, layout in 2022: CERN-GRAPHICS-2022-001. [Online]. Available: https://cds.cern.ch/record/2800984/ (cit. on p. [2\)](#page-1-0).
- [2] CERN, "LHC: The guide: LHC Facts and Figures," p. 64, 2021. [Online]. Available: <http://press.cern/press-kit> (cit. on pp. [1,](#page-0-0) [2\)](#page-1-0).
- [3] ——, Experiments at CERN, 2022. [Online]. Available: [https:// home. cern/](https://home.cern/science/experiments) [science/experiments](https://home.cern/science/experiments) (cit. on p. [3\)](#page-10-0).
- [4] Irene Garcia Obrero, "Flyer HiLumi\_ENG\_Dec22," 2022. [Online]. Available: [https:// home. cern/ sites/ default/ files/2022 - 12/Flyer%20HiLumi\\_ENG\\_](https://home.cern/sites/default/files/2022-12/Flyer%20HiLumi_ENG_Dec22.pdf) Dec<sub>22.pdf</sub> (cit. on p. [3\)](#page-10-0).
- [5] R. M. Bianchi and ATLAS Collaboration, ATLAS experiment schematic illustration :  $ATLAS-PHOTO-2022-055$ . [Online]. Available: [https://cds.cern.ch/](https://cds.cern.ch/record/2837191/files/) [record/2837191/files/](https://cds.cern.ch/record/2837191/files/) (cit. on p. [5\)](#page-12-0).
- [6] T. A. Collaboration, "The ATLAS Experiment at the CERN Large Hadron Collider," Journal of Instrumentation, vol. 3, no. 08, S08003–S08003, 2008. DOI: [10.1088/1748-0221/3/08/S08003](https://doi.org/10.1088/1748-0221/3/08/S08003) (cit. on pp. [6–](#page-13-0)[8,](#page-15-0) [17,](#page-24-0) [19\)](#page-26-0).
- [7] L.Bouvet et al, ATLAS Inner Detector Services: Parameters describing the orientation naming and Parameters describing the orientation naming and layout of Inner Detector Services: ATL-IC-EP-0017, 2005. [Online]. Available: https://atlaslegacy.lbl.gov/ciocio/SCTcables/ID\_services\_identification [05December2005.pdf](https://atlaslegacy.lbl.gov/ciocio/SCTcables/ID_services_identification_05December2005.pdf) (cit. on p. [6\)](#page-13-0).
- [8] S. Mehlhase, "ATLAS detector slice (and particle visualisations)," animated versions, no. ATLAS-OUTREACH-2021-052, [Online]. Available: [https://cds.](https://cds.cern.ch/record/2770815/export/hx?ln=en) [cern.ch/record/2770815/export/hx?ln=en](https://cds.cern.ch/record/2770815/export/hx?ln=en) (cit. on p. [7\)](#page-14-0).
- [9] M. Nicolás Viaux, "The ATLAS 'New Small Wheel' Muon detector stations recently commissioned for LHC Run3," Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, vol. 1045, p. 167 574, 2023, ISSN: 01689002. DOI:  $10.1016/j$ . [nima.2022.167574.](https://doi.org/10.1016/j.nima.2022.167574) [Online]. Available: [https://www.sciencedirect.com/science/](https://www.sciencedirect.com/science/article/pii/S016890022200866X) [article/pii/S016890022200866X](https://www.sciencedirect.com/science/article/pii/S016890022200866X) (cit. on p. [8\)](#page-15-0).
- [10] L. Meng, "ATLAS ITk Pixel Detector Overview," ATL-COM-ITK-, [Online]. Available: <http://arxiv.org/pdf/2105.10367> (cit. on p. [9\)](#page-16-0).
- [11] The ATLAS Collaboration, "Technical Design Report for the ATLAS Inner Tracker Pixel Detector: ATLAS-TDR-030," doi: [10.17181/CERN.FOZZ.ZP3Q](https://doi.org/10.17181/CERN.FOZZ.ZP3Q) (cit. on pp. [9,](#page-16-0) [10,](#page-17-0) [13,](#page-20-0) [14,](#page-21-0) [19,](#page-26-0) [39,](#page-46-2) [40\)](#page-47-0).
- [12] ——, Expected tracking and related performance with the updated ATLAS Inner Tracker layout at the High-Luminosity LHC: ATL-PHYS-PUB-2021-024. [Online]. Available: <http://cds.cern.ch/record/2776651/export/hx?ln=en> (cit. on p. [9\)](#page-16-0).
- [13] Mark Hendrik Standke, "Hybrid Pixel Readout Chip Verification, Characterization and Wafer Level Testing for the ATLAS-ITK Upgrade at the HL-LHC," PhD Dissertation, University of Bonn, Bonn, Germany, 2023 (cit. on p. [10\)](#page-17-0).
- [14] M. Karagounis, D. Arutinov, M. Barbero, et al., "An integrated Shunt-LDO regulator for serial powered systems," in 2009 Proceedings of ESSCIRC, IEEE, 2009, pp. 276–279, isbn: 978-1-4244-4355-0. doi: [10 . 1109 / ESSCIRC . 2009 .](https://doi.org/10.1109/ESSCIRC.2009.5325974) [5325974](https://doi.org/10.1109/ESSCIRC.2009.5325974) (cit. on pp. [10,](#page-17-0) [73\)](#page-80-0).
- [15] J. Kampkoetter, M. Karagounis, A. Stiller, et al., "Stabilization and Protection of the Shunt-LDO regulator for the HL-LHC pixel detector upgrades," in Proceedings of Topical Workshop on Electronics for Particle Physics — PoS(TWEPP2019), A. Kluge, Ed., Trieste, Italy: Sissa Medialab, 2-6 September 2019, p. 067. doi: [10.22323/1.370.0067](https://doi.org/10.22323/1.370.0067) (cit. on pp. [10,](#page-17-0) [73\)](#page-80-0).
- [16] S. Kersten, H. Borecka-Bielska, C. W. Chen, et al., "The ITk interlock hardware protection system," Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, vol. 1045, p. 167 613, 2023, ISSN: 01689002. DOI: [10.1016/j.nima.2022.167613](https://doi.org/10.1016/j.nima.2022.167613) (cit. on pp. [14,](#page-21-0) [15\)](#page-22-0).
- [17] D. Blasco Serrano, M. Haahr Kristensen, A. Bay Madsen, et al., "Description and status of the EMCI-EMP interface," Journal of Instrumentation, vol. 17, no. 06, p. C06012, 2022. doi:  $10.1088 / 1748 - 0221 / 17 / 06 / \text{C}06012$ . [Online]. Available: [https: / /iopscience.iop. org / article / 10. 1088 / 1748 - 0221 / 17 / 06 /](https://iopscience.iop.org/article/10.1088/1748-0221/17/06/C06012/meta) [C06012/meta](https://iopscience.iop.org/article/10.1088/1748-0221/17/06/C06012/meta) (cit. on p. [15\)](#page-22-0).
- [18] M. Beretta Et al., DCS : Requirements Document for HL-LHC: ATLAS internal document, ATU-GE-ES-0004, EDMS Id: 2276493, 2020. [Online]. Available: <https://edms.cern.ch/document/2276493> (cit. on pp. [15,](#page-22-0) [39,](#page-46-2) [45,](#page-52-0) [95\)](#page-102-1).
- [19] R. Ahmad, "The Monitoring of Pixel System (MOPS) chip for the Detector Control System of the ATLAS ITk Pixel Detector," Journal of Physics: Conference Series, vol. 2374, no. 1, p. 012 094, 2022, ISSN: 1742-6588. DOI: [10.1088/1742-](https://doi.org/10.1088/1742-6596/2374/1/012094)  $6596/2374/1/012094$  (cit. on pp. [16,](#page-23-0) [79,](#page-86-1) [96\)](#page-103-0).
- [20] A. Qamesh, R. Ahmad, D. Ecker, et al., "System integration of ATLAS ITK Pixel DCS ASICs," Journal of Instrumentation, vol. 18, no. 05, p. C05003, 2023. doi: [10.1088/1748-0221/18/05/C05003](https://doi.org/10.1088/1748-0221/18/05/C05003) (cit. on pp. [16,](#page-23-0) [56,](#page-63-0) [60\)](#page-67-0).
- [21] B. Hallgren, H.Boterenbrood, H.J. Burckhart, H.Kvedalen, The Embedded Local Monitor Board (ELMB) in the LHC Front-end I/O Control System. [Online]. Available: <https://cds.cern.ch/record/530675/files/p325.pdf> (cit. on p. [17\)](#page-24-0).
- [22] European cooperation for space standardization, ECSS-E-ST-50-15C: CANbus extension protocol, 2200 AG Noordwijk The Netherlands, 1.05.2015. [Online]. Available: [https://ecss.nl/standard/ecss-e-st-50-15c-space-engineering-canbus](https://ecss.nl/standard/ecss-e-st-50-15c-space-engineering-canbus-extension-protocol-1-may-2015/)[extension-protocol-1-may-2015/](https://ecss.nl/standard/ecss-e-st-50-15c-space-engineering-canbus-extension-protocol-1-may-2015/) (cit. on pp. [17,](#page-24-0) [41\)](#page-48-0).
- [23] Renesas, Using CAN Bus Serial Communications in Space Flight Applications, 2018. [Online]. Available: [https://www.renesas.com/us/en/document/whp/](https://www.renesas.com/us/en/document/whp/using-can-bus-serial-communications-space-flight-applications) [using-can-bus-serial-communications-space-flight-applications](https://www.renesas.com/us/en/document/whp/using-can-bus-serial-communications-space-flight-applications) (cit. on pp. [18,](#page-25-0) [26\)](#page-33-0).
- [24] S. Agostinelli, J. Allison, K. Amako, et al., "Geant4—a simulation toolkit," Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, vol. 506, no. 3, pp. 250– 303, 2003, issn: 01689002. doi: [10.1016/S0168-9002\(03\)01368-8.](https://doi.org/10.1016/S0168-9002(03)01368-8) [Online]. Available: <https://www.sciencedirect.com/science/article/pii/S0168900203013688> (cit. on p. [19\)](#page-26-0).
- [25] P. Nikiel, P. Moschovakos, and S. Schlenker, Eds., quasar : The Full-Stack Solution for Creation of OPC-UA Middleware: JACoW Publishing, Geneva,  $Switzerland$ , 2020. doi:  $10.18429/JACOW-ICALEPCS2019-MOPHA100$  (cit. on p. [19\)](#page-26-0).
- [26] Giulio Borghello, "Ionizing radiation effects in nanoscale CMOS technologies exposed to ultra-high doses," PhD dissertation, University of Udine, Italy, 8Mar 2019. [Online]. Available: <https://cds.cern.ch/record/2680840> (cit. on p. [26\)](#page-33-0).
- [27] A. Camplani, S. Shojaii, H. Shrimali, et al., "CMOS IC radiation hardening by design," Facta universitatis - series: Electronics and Energetics, vol. 27, no. 2, pp. 251–258, 2014, issn: 0353-3670. doi: [10.2298/FUEE1402251C](https://doi.org/10.2298/FUEE1402251C) (cit. on p. [26\)](#page-33-0).
- [28] F. Faccio, S. Michelis, D. Cornale, et al., "Radiation-Induced Short Channel (RISCE) and Narrow Channel (RINCE) Effects in 65 and 130 nm MOSFETs," IEEE Transactions on Nuclear Science, vol. 62, no. 6, pp. 2933–2940, 2015, ISSN: 0018-9499. DOI: [10.1109/TNS.2015.2492778](https://doi.org/10.1109/TNS.2015.2492778) (cit. on pp. [26,](#page-33-0) [27\)](#page-34-0).
- [29] F. Faccio and G. Cervelli, "Radiation-induced edge effects in deep submicron CMOS transistors," IEEE Transactions on Nuclear Science, vol. 52, no. 6, pp. 2413–2420, 2005, issn: 0018-9499. doi: [10.1109/ TNS.2005.860698](https://doi.org/10.1109/TNS.2005.860698) (cit. on p. [26\)](#page-33-0).
- [30] F. Faccio, Radiation effects in the electronics for CMS, 1999. [Online]. Available: [https://lhcb-elec.web.cern.ch/papers/radiation\\_tutorial.pdf](https://lhcb-elec.web.cern.ch/papers/radiation_tutorial.pdf) (cit. on pp. [27,](#page-34-0) [28\)](#page-35-0).
- [31] G. Borghello, F. Faccio, E. Lerario, et al., "Dose-Rate Sensitivity of 65-nm MOSFETs Exposed to Ultrahigh Doses," IEEE Transactions on Nuclear Sci-ence, vol. 65, no. 8, pp. 1482–1487, 2018, ISSN: 0018-9499. DOI: [10.1109/TNS.](https://doi.org/10.1109/TNS.2018.2828142) [2018.2828142](https://doi.org/10.1109/TNS.2018.2828142) (cit. on p. [27\)](#page-34-0).
- [32] T. C. May and M. H. Woods, "Alpha-particle-induced soft errors in dynamic memories," IEEE Transactions on Electron Devices, vol. 26, no. 1, pp. 2–9, 1979, issn: 0018-9383. doi: [10.1109/T-ED.1979.19370](https://doi.org/10.1109/T-ED.1979.19370) (cit. on p. [27\)](#page-34-0).
- [33] K. J. Hass and J. W. Ambles, "Single event transients in deep submicron CMOS," in 42nd Midwest Symposium on Circuits and Systems (Cat. No.99CH36356), IEEE, 2000, pp. 122–125, isbn: 0-7803-5491-5. doi: [10.1109/MWSCAS.1999.](https://doi.org/10.1109/MWSCAS.1999.867224) [867224](https://doi.org/10.1109/MWSCAS.1999.867224) (cit. on p. [28\)](#page-35-0).
- [34] P. J. McNulty, W. G. Abdel-Kader, and J. E. Lynch, "Modeling charge collection and single event upsets in microelectronics," Nuclear Instruments and Methods in Physics Research Section B: Beam Interactions with Materials and Atoms, vol. 61, no. 1, pp. 52–60, 1991, ISSN: 0168-583X. DOI:  $10.1016/0168$ -[583X\(91\)95560-Z.](https://doi.org/10.1016/0168-583X(91)95560-Z) [Online]. Available: [https://www.sciencedirect.com/science/](https://www.sciencedirect.com/science/article/pii/0168583X9195560Z) [article/pii/0168583X9195560Z](https://www.sciencedirect.com/science/article/pii/0168583X9195560Z) (cit. on p. [29\)](#page-36-0).
- [35] N. N. Mahatme, N. J. Gaspard, S. Jagannathan, et al., "Impact of Supply Voltage and Frequency on the Soft Error Rate of Logic Circuits," IEEE Transactions on Nuclear Science, vol. 60, no. 6, pp.  $4200-4206$ ,  $2013$ , ISSN:  $0018-9499$ . DOI: [10.1109/TNS.2013.2288782](https://doi.org/10.1109/TNS.2013.2288782) (cit. on p. [29\)](#page-36-0).
- <span id="page-141-0"></span>[36] M. Menouni, P. Barrillon, L. Flores, et al., "Single event effects testing of the RD53B chip," Journal of Physics: Conference Series, vol. 2374, no. 1, p. 012 084, 2022, issn: 1742-6588. doi: [10.1088/1742-6596/2374/1/012084](https://doi.org/10.1088/1742-6596/2374/1/012084) (cit. on pp. [29,](#page-36-0) [65,](#page-72-0) [116,](#page-123-1) [117\)](#page-124-1).
- [37] C.-H. Chen, P. Knag, and Z. Zhang, "Characterization of Heavy-Ion-Induced Single-Event Effects in 65 nm Bulk CMOS ASIC Test Chips," IEEE Transactions on Nuclear Science, vol. 61, no. 5, pp. 2694–2701, 2014, issn: 0018-9499. doi: [10.1109/TNS.2014.2342872](https://doi.org/10.1109/TNS.2014.2342872) (cit. on p. [29\)](#page-36-0).
- [38] M. Huhtinen and F. Faccio, "Computational method to estimate Single Event Upset rates in an accelerator environment," Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, vol. 450, no. 1, pp.  $155-172$ , 2000, ISSN: 01689002. DOI: [10.1016/S0168-9002\(00\)00155-8](https://doi.org/10.1016/S0168-9002(00)00155-8) (cit. on p. [29\)](#page-36-0).
- [39] FAIRCHILD Semiconductor, Understanding Latch-Up in Advanced CMOS Logic: AN-600, 1999. [Online]. Available: [http://large.stanford.edu/courses/2015/](http://large.stanford.edu/courses/2015/ph241/clark2/docs/AN-600.pdf) [ph241/clark2/docs/AN-600.pdf](http://large.stanford.edu/courses/2015/ph241/clark2/docs/AN-600.pdf) (cit. on p. [29\)](#page-36-0).
- [40] R. Velazco, P. Fouillat, and R. A. L. Reis, Radiation effects on embedded systems. Dordrecht: Springer, 2007, isbn: 9781402056451 (cit. on pp. [30,](#page-37-0) [31\)](#page-38-0).
- [41] N. S. Saks, M. G. Ancona, and J. A. Modolo, "Radiation Effects in MOS Capacitors with Very Thin Oxides at 80degK," IEEE Transactions on Nuclear Science, vol. 31, no. 6, pp. 1249–1255, 1984, ISSN: 0018-9499. DOI:  $10.1109/$ [TNS.1984.4333491](https://doi.org/10.1109/TNS.1984.4333491) (cit. on p. [30\)](#page-37-0).
- [42] N. S. Saks, M. G. Ancona, and J. A. Modolo, "Generation of Interface States by Ionizing Radiation in Very Thin MOS Oxides," IEEE Transactions on Nuclear Science, vol. 33, no. 6, pp. 1185–1190, 1986, ISSN: 0018-9499. DOI:  $10.1109/$ [TNS.1986.4334576](https://doi.org/10.1109/TNS.1986.4334576) (cit. on p. [30\)](#page-37-0).
- [43] W. Snoeys, F. Faccio, M. Burns, et al., "Layout techniques to enhance the radiation tolerance of standard CMOS technologies demonstrated on a pixel detector readout chip," Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, vol. 439, no. 2-3, pp. 349–360, 2000, ISSN: 01689002. DOI: 10.1016 / S0168-[9002\(99\)00899-2.](https://doi.org/10.1016/S0168-9002(99)00899-2) [Online]. Available: [https://www.sciencedirect.com/science/](https://www.sciencedirect.com/science/article/pii/S0168900299008992) [article/pii/S0168900299008992](https://www.sciencedirect.com/science/article/pii/S0168900299008992) (cit. on p. [31\)](#page-38-0).
- [44] R. Velazco, D. Bessot, S. Duzellier, et al., "Two CMOS memory cells suitable for the design of SEU-tolerant VLSI circuits," IEEE Transactions on Nuclear Science, vol. 41, no. 6, pp. 2229–2234, 1994, ISSN: 1558-1578. DOI: [10.1109/23.](https://doi.org/10.1109/23.340567) [340567](https://doi.org/10.1109/23.340567) (cit. on p. [31\)](#page-38-0).
- [45] T. Calin, M. Nicolaidis, and R. Velazco, "Upset hardened memory design for submicron CMOS technology," IEEE Transactions on Nuclear Science, vol. 43, no. 6, pp. 2874–2878, 1996, issn: 1558-1578. doi: [10.1109/23.556880](https://doi.org/10.1109/23.556880) (cit. on p. [31\)](#page-38-0).
- [46] N. Lehmann, "Development of a Detector Control System Chip," Dissertation, Wuppertal, 2019. doi:  $10.25926/m1ns-xc90$ . [Online]. Available: [urn:nbn:de:](urn:nbn:de:hbz:468-20191113-112127-6) [hbz:468-20191113-112127-6](urn:nbn:de:hbz:468-20191113-112127-6) (cit. on pp. [35,](#page-42-1) [36,](#page-43-0) [40,](#page-47-0) [69\)](#page-76-0).
- [47] S.Kersten Et al., Specifications for the ITk Pixel DCS controller chip: ATLAS internal document, AT2-IP-ES-0001, 2021. [Online]. Available: [https://edms.](https://edms.cern.ch/document/1909505) [cern.ch/document/1909505](https://edms.cern.ch/document/1909505) (cit. on pp. [39,](#page-46-2) [95\)](#page-102-1).
- [48] A. Brown, Arty A7 Digilent Reference, 28/02/2023. [Online]. Available: [https:](https://digilent.com/reference/programmable-logic/arty-a7/start?redirect=1) [//digilent.com/reference/programmable-logic/arty-a7/start?redirect=1](https://digilent.com/reference/programmable-logic/arty-a7/start?redirect=1) (cit. on p. [40\)](#page-47-0).
- [49] International Organization for Standardization, ISO 11898-1:2015: Road vehicles — Controller area network  $(CAN)$  — Part 1: Data link layer and physical signalling, 2015-12 (cit. on pp. [41,](#page-48-0) [76\)](#page-83-0).
- [50] CAN in Automation, CANopen application layer and communication profile, 2011. [Online]. Available: <https://www.can-cia.org/groups/specifications/> (cit. on p. [41\)](#page-48-0).

#### Bibliography

- [51] A. Rizwan, Analog and Digital CMOS Circuit Design for the Control System of ATLAS Pixel Detector: Masterthesis, 2018. doi: [10.26205/opus-3055.](https://doi.org/10.26205/opus-3055) [Online]. Available: [https://opus.bsz-bw.de/fhdo/frontdoor/index/index/start/1/rows/](https://opus.bsz-bw.de/fhdo/frontdoor/index/index/start/1/rows/10/sortfield/score/sortorder/desc/searchtype/simple/query/analog+and+digital+/facetNumber_year/all/yearfq/2018/docId/3055) [10/ sortfield/ score/ sortorder/desc/ searchtype/ simple/ query/analog+and+](https://opus.bsz-bw.de/fhdo/frontdoor/index/index/start/1/rows/10/sortfield/score/sortorder/desc/searchtype/simple/query/analog+and+digital+/facetNumber_year/all/yearfq/2018/docId/3055) [digital+/facetNumber\\_year/all/yearfq/2018/docId/3055](https://opus.bsz-bw.de/fhdo/frontdoor/index/index/start/1/rows/10/sortfield/score/sortorder/desc/searchtype/simple/query/analog+and+digital+/facetNumber_year/all/yearfq/2018/docId/3055) (cit. on pp. [42,](#page-49-0) [69\)](#page-76-0).
- [52] Siemens Digital Industries Software, HDL Designer Visualizing Complex RTL Designs, 24/09/2023. [Online]. Available: [https: / / eda. sw. siemens. com / en -](https://eda.sw.siemens.com/en-US/ic/hdl-designer/) [US/ic/hdl-designer/](https://eda.sw.siemens.com/en-US/ic/hdl-designer/) (cit. on p. [49\)](#page-56-0).
- [53] Mentor Graphics Corporation, ModelSim Reference Manual. [Online]. Available: [https://www.microsemi.com/document-portal/doc\\_view/131617-modelsim](https://www.microsemi.com/document-portal/doc_view/131617-modelsim-reference-manual)[reference-manual](https://www.microsemi.com/document-portal/doc_view/131617-modelsim-reference-manual) (cit. on p. [49\)](#page-56-0).
- [54] IEEE Std 1364-2005 (Revision of IEEE Std 1364-2001): IEEE Standard for Verilog Hardware Description Language. IEEE, 2006, isbn: 9780738148519 (cit. on p. [49\)](#page-56-0).
- [55] IEEE Standard for SystemVerilog–Unified Hardware Design, Specification, and Verification Language, Piscataway, NJ, USA. DOI: 10.1109 / IEEESTD.2013. [6469140](https://doi.org/10.1109/IEEESTD.2013.6469140) (cit. on p. [49\)](#page-56-0).
- [56] Hucheng Chen, Final Design Review / Production Readiness Review of the ITk Pixel MOPS ASIC: ATU-TC-ER-0268: ATLAS internal document. [Online]. Available: <https://edms.cern.ch/document/2801991> (cit. on pp. [62,](#page-69-0) [73–](#page-80-0)[78\)](#page-85-0).
- [57] Michael A. Karagounis, Design eines CAN Controllers mit VHDL und Spec-Charts: Diplomarbeit, Cologne, Germany, 2000 (cit. on p. [62\)](#page-69-0).
- [58] S. Kulis, "Single Event Effects mitigation with TMRG tool," Journal of Instrumentation, vol. 12, no. 01, pp. C01082–C01082, 2017. DOI:  $10.1088/1748$ - $0221/12/01/C01082$  (cit. on pp. [62,](#page-69-0) [64\)](#page-71-0).
- [59] K. E. Kuijk, "A precision reference voltage source," IEEE Journal of Solid-State Circuits, vol. 8, no. 3, pp. 222-226, 1973, ISSN: 1558-173X. DOI: [10.1109/JSSC.](https://doi.org/10.1109/JSSC.1973.1050378) [1973.1050378](https://doi.org/10.1109/JSSC.1973.1050378) (cit. on p. [69\)](#page-76-0).
- [60] A.-J. Annema, "Low-power bandgap references featuring DTMOSTs," IEEE Journal of Solid-State Circuits, vol. 34, no. 7, pp. 949–955, 1999, issn: 00189200. DOI:  $10.1109/4.772409$  (cit. on p. [69\)](#page-76-0).
- [61] T. Tanzawa, "A process- and temperature-tolerant power-on reset circuit with a flexible detection level higher than the bandgap voltage," in 2008 IEEE International Symposium on Circuits and Systems (ISCAS), IEEE, 2008, pp. 2302– 2305, isbn: 978-1-4244-1683-7. doi: [10 . 1109 / ISCAS . 2008 . 4541914](https://doi.org/10.1109/ISCAS.2008.4541914) (cit. on p. [69\)](#page-76-0).
- [62] B. Razavi, Design of analog CMOS integrated circuits, Internat. ed., [Nachdr.], ser. McGraw-Hill series in electrical and computer engineering. Boston, Mass.: McGraw-Hill, 20]11, isbn: 978-0071188395 (cit. on pp. [69](#page-76-0)[–71\)](#page-78-0).
- [63] D. Hilbiber, "A new semiconductor voltage standard," in 1964 IEEE International Solid-State Circuits Conference. Digest of Technical Papers, IEEE, 1964, pp. 32–33. doi: [10.1109/ISSCC.1964.1157541](https://doi.org/10.1109/ISSCC.1964.1157541) (cit. on p. [70\)](#page-77-0).
- [64] G. Traversi, L. Gaioni, M. Manghisoni, et al., "A Rad-Hard Bandgap Voltage Reference for High Energy Physics Experiments," in Applications in Electronics Pervading Industry, Environment and Society, ser. Lecture Notes in Electrical Engineering, S. Saponara and A. de Gloria, Eds., vol. 627, Cham: Springer International Publishing, 2020, pp. 19–24, isbn: 978-3-030-37276-7. doi: [10.](https://doi.org/10.1007/978-3-030-37277-4{\textunderscore }3) [1007/978-3-030-37277-4{\textunderscore}3](https://doi.org/10.1007/978-3-030-37277-4{\textunderscore }3) (cit. on pp. [73,](#page-80-0) [96,](#page-103-0) [102,](#page-109-0) [108,](#page-115-0) [111\)](#page-118-0).
- [65] M. Garcia-Sciveres, D. Arutinov, M. Barbero, et al., "The FE-I4 pixel readout integrated circuit," Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, vol. 636, no. 1, S155–S159, 2011, ISSN: 01689002. DOI: [10.1016/j.nima.2010.04.](https://doi.org/10.1016/j.nima.2010.04.101) [101.](https://doi.org/10.1016/j.nima.2010.04.101) [Online]. Available: [https://www.sciencedirect.com/science/article/pii/](https://www.sciencedirect.com/science/article/pii/S0168900210009551) [S0168900210009551](https://www.sciencedirect.com/science/article/pii/S0168900210009551) (cit. on p. [76\)](#page-83-0).
- [66] M. Garcia-Sciveres, The RD53A Integrated Circuit: CERN-RD53-PUB-17-001. [Online]. Available: [https://cds.cern.ch/ record/2287593/export/hx?ln=en](https://cds.cern.ch/record/2287593/export/hx?ln=en) (cit. on p. [78\)](#page-85-0).
- [67] Mohsine Menouni, ATLAS ITk FE chip ADC characterization : Personal com-munication: email, 2022 (cit. on pp. [78,](#page-85-0) [91\)](#page-98-0).
- [68] Julia Besproswanny, Testing the Radiation Hardness of the Monitoring Chip for the new ATLAS Pixel Detector: Bachelor thesis- Particle physics department, Wuppertal, 2020 (cit. on pp. [85,](#page-92-0) [107\)](#page-114-0).
- [69] R. P. Foundation, Teach, learn, and make with the Raspberry Pi Foundation, 24/09/2023. [Online]. Available: <https://www.raspberrypi.org/> (cit. on p. [86\)](#page-93-0).
- [70] Thermal Test Systems | Temptronic ThermoStream Temperature Cycling Systems, 18/09/2023. [Online]. Available: [https://www.intestthermal.com/temptron](https://www.intestthermal.com/temptronic/thermostream)ic/ [thermostream](https://www.intestthermal.com/temptronic/thermostream) (cit. on p. [95\)](#page-102-0).
- [71] Prof. Dr. C. Zeitnitz, Detector operation : Personal Communication, 2022 (cit. on p. [96\)](#page-103-0).
- [72] Ahmed Qamesh, "X-ray Irradiation and Calibration of the RD53A Pixel Readout Chip," Master thesis, University of Bonn, Physics Institute, 2019 (cit. on p. [107\)](#page-114-0).
- [73] Mohsine Menouni, 65 nm TID and SEU experience, ACES 2020 Seventh Common ATLAS CMS Electronics Workshop for LHC Upgrades, 2020. [Online]. Available: [https://indico.cern.ch/event/863071/contributions/3738770/](https://indico.cern.ch/event/863071/contributions/3738770/attachments/2044886/3425591/ACES2020_Menouni_20200526.pdf) [attachments/2044886/3425591/ACES2020\\_Menouni\\_20200526.pdf](https://indico.cern.ch/event/863071/contributions/3738770/attachments/2044886/3425591/ACES2020_Menouni_20200526.pdf) (cit. on p. [108\)](#page-115-0).

### Bibliography

[74] G. Borghello, F. Faccio, G. Termo, et al., "Effects of Bias and Temperature on the Dose-Rate Sensitivity of 65-nm CMOS Transistors," IEEE Transactions on Nuclear Science, vol. 68, no. 5, pp. 573-580, 2021, ISSN: 0018-9499. DOI: [10.1109/tns.2021.3062622](https://doi.org/10.1109/tns.2021.3062622) (cit. on p. [108\)](#page-115-0).

## List of Figures







## List of Tables



# Appendix A MOPS IO pads description

Below-mentioned is the description of some of the pads and components:

VDD SHUNT This is used to

- enable  $\Rightarrow$  connection to VCAN
- disable  $\Rightarrow$  connection to ground

shunt-mode of the SLDO amplifier.

REXT The resistor defines the input impedance of the SLDO in shunt-mode and is connected to VCAN when shunt-mode of the SLDO is enabled.

VDDPRE output of the pre-regulator and requires a connection to a 100nF blocking capacitor against the ground.

VREF The voltage reference for the definition of the regulator output and the ADC LSB voltage requires a 30 KOhm resistor connected to the ground.

VOFSHALF Offset voltage reference which is used when the SLDO is configured in shunt-mode and requires a 20 KOhm resistor connected to the ground.

BGTRIMx These pads are used to trim the bandgap circuit. MSB which is BGTRIM3 is pulled up internally while the rest of the pins are pulled down internally.

#### A.0.1 Power and IO Signal Specifications

In the following section, IO and power signals are described.

Table [A.1](#page-153-0) describes all the IO signals used in the chip. Please see the footnotes of the table for hints. Further details about different pads are given in table [A.4](#page-157-2) and [A.3.](#page-157-1)

## Appendix A MOPS IO pads description

<span id="page-153-0"></span>

| Signal name               | Direction                                                      | Signal type                | Pad type                                       | Description                                     |
|---------------------------|----------------------------------------------------------------|----------------------------|------------------------------------------------|-------------------------------------------------|
| BGTRIM <sub>0</sub>       | IN                                                             | digital                    | CERN_IO_<br>mod VSUB<br>PAD                    | Trimming bit for<br>the bandgap circuit         |
| BGTRIM1                   | IN                                                             | digital                    | $CERN$ <sub>_IO</sub><br>mod_VSUB<br>$-PAD$    | Trimming bit for<br>the bandgap circuit         |
| BGTRIM2                   | IN<br>digital                                                  |                            | $CERN$ $IO$<br>mod_VSUB<br>$-PAD$              | Trimming bit for<br>the bandgap circuit         |
| BGTRIM <sub>3</sub>       | IN                                                             | digital                    | $CERN$ <sub>_IO</sub> _<br>mod_VSUB<br>$-$ PAD | Trimming bit for<br>the bandgap circuit         |
| <b>BGIREF</b>             |                                                                | analog                     | $\mathrm{SF\_1V2\_FULL}$<br>LOCAL              | Reference current<br>for the bandgap<br>circuit |
| <b>VBUS</b>               | IN                                                             | analog                     | $SF_1V2\_FULL$<br>LOCAL                        | CAN bus<br>voltage in recessive<br>state        |
| <b>RXCAN</b>              | <b>OUT</b><br>digital                                          |                            | $CERN\_IO$<br>mod_VSUB<br>PAD                  | Received bit from<br>CAN Phy. layer             |
| <b>TXCAN</b>              | $CERN$ <sub>_IO</sub> _<br>mod_VSUB<br>IN<br>digital<br>$-PAD$ |                            | Bit to transmit<br>using CAN Phy.<br>layer     |                                                 |
| <b>RXLOG</b>              | IN                                                             | digital                    | CERN IO<br>mod_VSUB<br>$-PAD$                  | Received bit<br>to the logic                    |
| <b>TXLOG</b>              | digital<br><b>OUT</b>                                          |                            | $CERN$ <sub>_IO</sub> _<br>mod_VSUB<br>PAD     | Bit from the logic<br>to be transmitted         |
| ADDRCAN0<br>IN<br>digital |                                                                | CERN IO<br>mod VSUB<br>PAD | CAN node<br>address                            |                                                 |

Table A.1: All pads in the outer ring of the chip

| Signal name       | Direction  | Signal type | Pad type                                 | Description                                    |  |
|-------------------|------------|-------------|------------------------------------------|------------------------------------------------|--|
| ADDRCAN1          | IN         | digital     | CERN IO<br>mod VSUB<br>PAD               | CAN node<br>address                            |  |
| auto_trim_en      | IN         | digital     | $CERN$ <sub>_IO</sub><br>mod VSUB<br>PAD | CAN node<br>address                            |  |
| <b>RESETD</b>     | IN         | digital     | $CERN\_IO$<br>mod VSUB<br>$-PAD$         | External reset to<br>reset the digital<br>part |  |
| $Reset\_out$      | <b>OUT</b> | digital     | $CERN$ <sub>-IO</sub><br>mod_VSUB<br>PAD | Output of the<br>POR circuit                   |  |
| ADC34             | IN         | analog      | $SF_1V2$ CDM                             | ADC channel<br>input pad                       |  |
| ADC33             | IN         | analog      | $SF_1V2$ CDM                             | ADC channel<br>input pad                       |  |
| * VSUB            |            |             |                                          |                                                |  |
| ADC32             | IN         | analog      | $SF_1V2$ <sub>_</sub> CDM                | ADC channel                                    |  |
| ADC31             | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>                 | ADC channel                                    |  |
| ADC30             | IN         | analog      | $SF_1V2$ <sub>_</sub> CDM                | ADC channel                                    |  |
| ADC29             | IN         | analog      | $SF_1V2_CDM$                             | ADC channel                                    |  |
| ADC28             | IN         | analog      | $SF_1V2$ <sub>_</sub> CDM                | ADC channel                                    |  |
| ADC27             | IN         | analog      | SF 1V2 CDM                               | ADC channel                                    |  |
| ADC <sub>26</sub> | IN         | analog      | $\rm{SF\_1V2\_CDM}$                      | ADC channel                                    |  |
| ADC25             | IN         | analog      | $SF_1V2_CDM$                             | ADC channel                                    |  |
| ${\rm ADC24}$     | IN         | analog      | $SF_1V2_CDM$                             | ADC channel                                    |  |
| ADC23             | IN         | analog      | SF 1V2 CDM                               | ADC channel                                    |  |
| ADC22             | IN         | analog      | $SF_1V2$ <sub>_</sub> CDM                | ADC channel                                    |  |
| ADC21             | IN         | analog      | SF 1V2 CDM                               | ADC channel                                    |  |
| ADC20             | IN         | analog      | SF 1V2_CDM                               | ADC channel                                    |  |
| ADC19             | IN         | analog      | SF 1V2 CDM                               | ADC channel                                    |  |
| ADC18             | IN         | analog      | SF 1V2 CDM                               | ADC channel                                    |  |

Table A.1 continued from previous page

## Appendix A MOPS IO pads description

|                  |            |             | Lable A.I continued from previous page |                                                                   |  |
|------------------|------------|-------------|----------------------------------------|-------------------------------------------------------------------|--|
| Signal name      | Direction  | Signal type | Pad type                               | Description                                                       |  |
| ADC17            | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | $\operatorname{ADC}$ channel                                      |  |
| * VSUB           |            |             |                                        |                                                                   |  |
| ADC16            | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | ADC channel                                                       |  |
| ADC15            | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | ADC channel                                                       |  |
| ADC14            | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | ADC channel                                                       |  |
| ADC13            | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | ADC channel                                                       |  |
| ADC12            | IN         | analog      | SF 1V2 CDM                             | $\operatorname{ADC}$ channel                                      |  |
| ADC11            | IN         | analog      | SF 1V2 CDM                             | ADC channel                                                       |  |
| ADC10            | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | ADC channel                                                       |  |
| ADC9             | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | ADC channel                                                       |  |
| ADC8             | $\rm IN$   | analog      | $SF_1V2$ <sub>_CDM</sub>               | $\operatorname{ADC}$ channel                                      |  |
| ADC7             | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | ADC channel                                                       |  |
| ADC <sub>6</sub> | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | ADC channel                                                       |  |
| ADC <sub>5</sub> | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | ADC channel                                                       |  |
| ADC4             | IN         | analog      | $SF_1V2$ <sub>_CDM</sub>               | ADC channel                                                       |  |
| ADC3             | IN         | analog      | SF 1V2 CDM                             | ADC channel                                                       |  |
| <b>GNDSEN</b>    | IN         | analog      | $SF_1V2$ CDM                           | ADC channel                                                       |  |
| <b>VCANSEN</b>   | IN         | analog      | SF 1V2 CDM                             | ADC channel                                                       |  |
| <b>ADCRET</b>    | IN         | analog      | SF 1V2 FULL<br>LOCAL                   | Return line<br>for the ADC                                        |  |
| Clk out          | <b>OUT</b> | digital     | CERN IO mod<br>$VSUB$ PAD              | Clock output<br>from the internal<br>oscillator                   |  |
| $Clk$ in         | IN         | digital     | CERN IO mod<br>VSUB PAD                | Input clock to<br>the digital part                                |  |
| Vofshalf         | IN         | analog      | SF 1V2 OVT<br>NOB <sub>2</sub> B       | CAN transceiver<br>local ground                                   |  |
| Vref             | IN         | analog      | SF 1V2 OVT<br>NOB <sub>2</sub> B       | Reference voltage for<br>the regulator and<br>the ADC LSB voltage |  |

Table A.1 continued from previous page

| power<br>power                                  | $SF_1V2$ OVT<br>analog<br>$\_$ NOB2B<br>$SF_1V2_OVT$<br>analog<br>NOB <sub>2</sub> B<br>SF 1V2 OVT<br>analog<br>NOB <sub>2</sub> B<br>SF 1V2 POWER<br>$CLAMP$ CORE<br>SUPPLY<br>SF 1V2 POWER | Output of<br>the preregulator<br>Supply voltage for<br>the chip<br>Config resistor for<br>shuldo<br>Output of the<br>regulator<br>Output of the |                                                                                                   |
|-------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------|
|                                                 |                                                                                                                                                                                              |                                                                                                                                                 |                                                                                                   |
|                                                 |                                                                                                                                                                                              |                                                                                                                                                 |                                                                                                   |
|                                                 |                                                                                                                                                                                              |                                                                                                                                                 |                                                                                                   |
|                                                 |                                                                                                                                                                                              |                                                                                                                                                 |                                                                                                   |
|                                                 |                                                                                                                                                                                              |                                                                                                                                                 |                                                                                                   |
| power/out                                       | CLAMP IO<br>analog<br><b>SUPPLY</b>                                                                                                                                                          | regulator                                                                                                                                       |                                                                                                   |
| power/in                                        | $SF_1V2_OVT$<br>analog<br>NOB2B                                                                                                                                                              | enable/disable shunt<br>mode by conn. to<br>VCAN/GND                                                                                            |                                                                                                   |
|                                                 |                                                                                                                                                                                              |                                                                                                                                                 |                                                                                                   |
| gnd                                             | $SF_1V2$<br><b>POWER</b><br>analog<br>$CLAMP$ <sub><math>_</math>IO</sub><br>GROUND                                                                                                          | Ground conn. for<br>the chip                                                                                                                    |                                                                                                   |
| $\text{in}/\text{out}$                          |                                                                                                                                                                                              | CAN bus high sig.                                                                                                                               |                                                                                                   |
| <b>CANL</b><br>$\text{in}/\text{out}$<br>analog |                                                                                                                                                                                              | CAN bus low sig.                                                                                                                                |                                                                                                   |
|                                                 |                                                                                                                                                                                              | Substrate conn.                                                                                                                                 |                                                                                                   |
|                                                 |                                                                                                                                                                                              | analog<br>gnd                                                                                                                                   | SF 1V2 CDM<br>$SF_1V2$ <sub>_CDM</sub><br>$SF_1V2$<br><b>POWER</b><br>CLAMP_CORE<br><b>GROUND</b> |

Table A.1 continued from previous page

### Maximum Ratings

This section specifies the maximum ratings for inputs.

• All the inputs with the pads that are connected to the 1V2 ESD ring must not be supplied more than  $\pm 1.3V$ . If the input goes above that value, then the ESD protection circuit gets activated.

- Inputs using SF\_1V2\_OVT\_NOB2B pad must not be supplied more than 2V in general but must look at individual values given in the table [A.2:](#page-157-0)
- <span id="page-157-0"></span>• To check all the ESD protection ratings of different pads used in the chip, tables [A.4](#page-157-2) and [A.3](#page-157-1) must be consulted.





## Power pad specifications

<span id="page-157-1"></span>Table [A.3](#page-157-1) lists all the power pads used in this chip.





#### IO pad Specifications

<span id="page-157-2"></span>Table [A.4](#page-157-2) lists all the IO pads used in the chip.

Table A.4: This table lists all the pads used for IO connections

| <b>Name</b>          |                | Type ESD connection    |
|----------------------|----------------|------------------------|
| CERN IO mod VSUB PAD |                | digital $1V2$ ESD ring |
| SF 1V2 CDM           |                | analog $1V2$ ESD ring  |
| SF 1V2 OVT NOB2B     | analog $OVT^*$ |                        |

| Table 11.4 continued from previous page<br>Name      | Type ESD connection   |
|------------------------------------------------------|-----------------------|
| SF 1V2 FULL<br>LOCAL                                 | analog $1V2$ ESD ring |
| * Over voltage tolerant i.e. can withstand up to 2V. |                       |

Table A.4 continued from previous page

## Appendix B MOPS CANopen object dictionary

### Object Dictionary (OD)

Table [B.1](#page-161-0) and [B.2](#page-164-0) show communication and manufacturer-specific profile area of the object dictionary respectively. All values are hexadecimal if not otherwise stated. The information in the object dictionary is required to communicate with the MOPS chip. The placement of index/subindex and read/write commands inside the CAN message should be done as explained in [6.3.2](#page-52-0) and [6.3.2](#page-54-2)

<span id="page-161-0"></span>

| Index       | <b>SI</b>        | Description                       | Data/Object    | Attr           | Default          | Comment                                                                    |
|-------------|------------------|-----------------------------------|----------------|----------------|------------------|----------------------------------------------------------------------------|
|             | 3                | Inhibit time                      | U16            | RO             | $\boldsymbol{0}$ |                                                                            |
|             | 4                | Reserved                          | U <sub>8</sub> | R <sub>O</sub> |                  | Unused, manufacturer specific                                              |
|             | $\overline{5}$   | Event timer                       | U16            | RO             | $\boldsymbol{0}$ |                                                                            |
|             | 6                | SYNC start value                  | U8             | R <sub>O</sub> | $\boldsymbol{0}$ |                                                                            |
| 1801        |                  | TPDO communication<br>parameter 1 | <b>RECORD</b>  |                |                  |                                                                            |
|             | $\boldsymbol{0}$ | Number of entries                 | U <sub>8</sub> | R <sub>O</sub> | $\,6\,$          |                                                                            |
|             | 1                | $COB-ID$                          | U32            | R <sub>O</sub> | $280 + Nodeld$   |                                                                            |
|             | $\overline{2}$   | Transmission type                 | U <sub>8</sub> | R <sub>O</sub> | FE               | Event driven - manufacturer specific                                       |
|             | 3                | Inhibit time                      | U16            | R <sub>O</sub> | $\boldsymbol{0}$ |                                                                            |
|             | 4                | Reserved                          | U <sub>8</sub> | R <sub>O</sub> |                  | Unused, manufacturer specific                                              |
|             | $\overline{5}$   | Event timer                       | U16            | R <sub>O</sub> | $\boldsymbol{0}$ |                                                                            |
|             | 6                | SYNC start value                  | U8             | R <sub>O</sub> | $\boldsymbol{0}$ |                                                                            |
|             |                  | TPDO mapping<br>parameter 0       | <b>RECORD</b>  |                |                  |                                                                            |
| <b>1A00</b> | $\overline{0}$   | Number of entries                 | U8             | RO             | $\mathbf{1}$     |                                                                            |
|             | 1                | PDO mapping entry                 | U32            | R <sub>O</sub> | 21000020         | "2100": Index; "00": Subindex; "20"<br>: Data length $(20=32 \text{ bit})$ |
| 1A01        |                  | TPDO mapping<br>parameter 1       | <b>RECORD</b>  |                |                  |                                                                            |
|             |                  |                                   |                |                |                  |                                                                            |

Table B.1 continued from previous page



<span id="page-164-0"></span>

