Embedding it better...



# **Table of Contents**

| 1. Introduction                                         | 3  |
|---------------------------------------------------------|----|
| 2. Clocking                                             |    |
| 1.1.ARM Core Clock                                      | -  |
| 1.2.IPG Clock – used for the access of most peripherals | 13 |
| 1.3.PERCLK – used by PIT and GPT                        | 14 |
| 1.4.UART_CLK_ROOT – used by all LPUARTs                 |    |
| 1.5.USDHC1_CLK_ROOT/USDHC2_CLK_ROOT                     | 18 |
| 1.6.SEMC_CLK_ROOT – external memory controller clock    | 20 |
| 1.7.FLEXSPI_CLK_ROOT                                    | 22 |
| 1.8.LPSPI_CLK_ROOT – used by all LPSPIs                 | 24 |
| 1.9.TRACE_CLK_ROOT                                      | 26 |
| 1.10.SAI1_CLK_ROOT/SAI2_CLK_ROOT/SAI2_CLK_ROOT          | 26 |
| 1.11.LPI2C_CLK_ROOT – used by all LPI2C controllers     | 27 |
| 1.12.CAN_CLK_ROOT                                       |    |
| 1.13.SPDIF0_CLK_ROOT                                    | 32 |
| 1.14.FLEXIO1_CLK_ROOT                                   | 32 |
| 1.15.USB1 Clock                                         | 32 |
| 3. Real Time Clock                                      | 33 |
| 4. Internal Clock Monitoring                            | 34 |
| 5. LPUART                                               | 35 |
| 6. LPI2C                                                |    |
| 7. LPSPI                                                | 38 |
| 8. FLEXCAN                                              |    |
| 9. USDHC                                                | 40 |
| 10. PIT                                                 | 41 |
| 11. GPT                                                 |    |
| 12. Enhanced FlexPWM                                    | 43 |
| 13. Quad Timer (TMR)                                    | 44 |
| 14. DMA                                                 | 45 |
| 15. GPIO                                                | 46 |
| 16. RAM and Cache                                       | 47 |
| 17. Boot Mode                                           |    |
| 18. Total Security and Simplicity                       | 55 |
| 19. Ethernet                                            | 56 |
| 20. XBAR and AOI                                        | 57 |
| 21. Battery Backup                                      | 60 |
| 22. Conclusion.                                         |    |
| Appendix A – Hardware Dependencies                      | 62 |
| a)Space for first Appendix                              |    |

# 1. Introduction

See i.MX RT 1021 document – only changes are here

The **MIMXRT1024-EVK** is used as test vehicle throughout the document but any board using the part can be easily configured based on the contained details.

# 2. Clocking

#### 1.1. ARM Core Clock

The ARM core is clocked by an internal clock signal called **AHB\_CLK\_ROOT**, which can be up to 500MHz. It may be automatically gated off in certain low power mode but these modes are of no concern to the discussion of the clocking capabilities and configuration. The following diagrams show the complete set of possibilities together with the project defines that control them – as well as divider settings and their ranges. The AHB\_CLK\_ROOT speed is probably the most important clock setting and the one that one will define as first frequency setting.

It is seen that there are essentially 7 possible settings. There are in fact some addition permutations but they don't make any sense to use in the i.MX RT 1021 because they are just further duplications of what is possible, with no added advantage; for example there are further paths that could select OSC\_CLK as source which are not shown to avoid unnecessary complications.

The 7 possibilities are now shown, whereby in each case the user can define an optional output divider between 1 and 8. It will be seen that three PLL sources are possible as source for this clock whereby it is useful to understand that these PLLs are all powered down by default and bypassed so that their input clock (OSC\_CLK in every case) is available at their outputs. If they are used they are powered up, their lock waited for and then the bypass removed. Each of these PLLs is a fixed frequency PLLs, meaning that the VCO output is defined to generate a fixed frequency from the fixed 24MHz OSC CLK input. However, some of the PLLs have PFDs (Phase Fractional Dividers) which can be tapped as output too - for example the System PLL (PLL2), which has its main output at 528MHz also has 4 PFD outputs (PFD0..PFD3) with frequencies of 352MHz, 594MHz, 396MHz and 594MHz respectively by default. Each of the PFD output frequencies can however be individually programmed or disabled, whereby the formula for the respective PDF frequency is PLL fixed ((frequency \* 18) / fraction) where fraction can be any integer value between 12 and 35. The following table shows the complete list of frequencies that can be selected for PLL2 and PLL3, whereby the PFD3 or each are optional core clock references as detailed further below.

| Fraction | System PLL (528MHz PLL2)<br>((528MHz * 18) / fraction) | USB1 PLL (480MHz PLL3)<br>((480MHz * 18) / fraction) |  |  |
|----------|--------------------------------------------------------|------------------------------------------------------|--|--|
| 12       | 792MHz                                                 | 720MHz - default PFD0                                |  |  |
| 13       | 731.0769231MHz                                         | 664.6153846MHz - default PFD1                        |  |  |
| 14       | 678.8571429MHz                                         | 617.1428571MHz                                       |  |  |
| 15       | 633.6MHz                                               | 576MHz                                               |  |  |
| 16       | 594MHz - default PFD1 and PDF3                         | 540MHz                                               |  |  |
| 17       | 559.0588235MHz                                         | 508.2352941MHz - default PFD2                        |  |  |
| 18       | 528MHz                                                 | 480MHz                                               |  |  |
| 19       | 500.2105263MHz                                         | 454.7368421MHz - default PFD3                        |  |  |
| 20       | 475.2MHz                                               | 432MHz                                               |  |  |
| 21       | 452.5714285MHz                                         | 411.4285714MHz                                       |  |  |
| 22       | 432MHz                                                 | 392.7272727MHz                                       |  |  |
| 23       | 413.2173913MHz                                         | 375.6521739MHz                                       |  |  |
| 24       | 396MHz - default PFD2                                  | 360MHz                                               |  |  |
| 25       | 380.16MHz                                              | 345.6MHz                                             |  |  |
| 26       | 365.5384615MHz                                         | 332.3076923MHz                                       |  |  |

### μ<u>Tasker</u> – i.MX RT 1024

| 27 | 352MHz - default PFD0 | 320MHz         |  |  |
|----|-----------------------|----------------|--|--|
| 28 | 339.4285714MHz        | 308.5714286MHz |  |  |
| 29 | 327.7241379MHz        | 297.9310345MHz |  |  |
| 30 | 316.8MHz              | 288MHz         |  |  |
| 31 | 306.58064524MHz       | 278.7096774MHz |  |  |
| 32 | 297MHz                | 270MHz         |  |  |
| 33 | 288MHz                | 261.8181818MHz |  |  |
| 34 | 279.5294118MHz        | 254.1176471MHz |  |  |
| 35 | 271.5428571MHz        | 246.8571429MHz |  |  |

Although the system PLL2 is intended to be used as a fixed 528MHz oscillator it can in fact be adjusted within the range 528MHz .. 552MHz be modifying its PLL loop settings. If the frequency is adjusted the values in the previous table are of course no longer exact and need to also be adjusted accordingly!

Very fine adjustments can be made and the optional define

528123456

causes the clock configuration to be performed to give the required frequency.

PLL2 has a lock time of < 11250 refence cycles and also support spread spectrum operation

#define RUN FROM DEFAULT CLOCK



Choose this setting to use the default clock configuration, which is in effect the OSC CLK (bypassed at the system PLL and switched through as shown to give 24MHz). The user can optionally reduce this frequency by a factor or /1 to /8 with the define #define AHB CLK ROOT DIVIDE set to the value desired (defaults to 1 if not defined).

#define PERIPH CLK SOURCE OSC



This is a variation of the same theme as the default clock setting but instead of routing the OSC CLK from the bypassed system PLL it is routed via a different path. In this configuration there is a second optional pre-scaler, #define

PERIPH\_CLK\_SOURCE\_OSC\_DIVIDE, which allows further reduction of the core frequency if needed. If not defined the divider defaults to 1.

#define PERIPH CLK SOURCE PLL6 500M



This configuration is useful for simply obtaining the maximum operating frequency of the i.MX RT 1021 by using the output of the fixed 500MHz ENET PLL. When used, the ENET PLL is powered up, lock is waited for and then its bypass is removed so that the clock can be routed to the core [when Ethernet is used this PLL is also required for its operation]. In addition to the AHB\_CLK\_ROOT divider PERIPH\_CLK\_SOURCE\_PLL6\_DIVIDE can optionally be defined to pre-scale the PLL output by 1..8 (when not defined the default is 1).

#define PERIPH CLK SOURCE PLL3 SW CLK



This configuration uses the USB1 PLL (PLL3) as reference. This is a 480MHz PLL that can be additionally pre-scaled by 1..8 by using the define

PERIPH\_CLK\_SOURCE\_PLL6\_DIVIDE. If not defined the pre-scaler defaults to 1. As in the case of the other PLLs this configuration causes the PLL to be powered, lock waited for and then its bypass removed so that the signal can be routed to the core.

#define PERIPH CLK SOURCE PLL2 528M



This configuration uses the System PLL (PLL2) as reference. This is a fixed 528MHz PLL and, as in the case of the other PLLs, this configuration causes the PLL to be powered, lock waited for and then its bypass removed so that the signal can be routed to the core.

It is to be noted that 528MHz is beyond the specification of AHB\_CLK\_ROOT and so an AHB\_CLK\_ROOT pre-scaler divide of at least 2 is needed!

Note further that, although the system PLL is usually used at 528MHz it can in fact be tuned over the range 528MHz..552MHz

Very fine adjustments can be made and the optional define

#define PLL2 FREQUENCY 528123456

causes the clock configuration to be performed to give the required frequency rather than the default one.

#define PERIPH CLK SOURCE PLL2 PFD3



This configuration uses the System PLL (PLL2) as reference. This is a fixed 528MHz PLL but its PFD3 output is sourced instead.

The PFD3 frequency is calculated by ((528MHz \* 18) / PLL2\_PFD3\_FRACTION) and so the illustrated value of 32 results in 297MHz. The list of possible PLL2-PFD3 frequencies can be found in the introduction to this chapter.

As in the case of the other PLLs, this configuration causes the PLL to be powered, lock waited for and then its bypass removed so that the signal can be routed to the core.

#define PERIPH CLK SOURCE PLL3 PFD3



This configuration uses the USB1 PLL (PLL3) as reference. This is a fixed 480MHz PLL but its PFD3 output is sourced instead.

The PFD3 frequency is calculated by ((480MHz \* 18) / PLL3\_PFD3\_FRACTION) and so the illustrated value of 35 results in 246.857MHz. The list of possible PLL3-PFD3 frequencies can be found in the introduction to this chapter.

As in the case of the other PLLs, this configuration causes the PLL to be powered and lock waited for so that the stable signal can be routed to the core.

The choice of the core clock represents the major work of setting up the clocks. The following details are then specific to peripherals used in the system. *Peripherals of no interest don't need to be specifically configured since they will use defaults and be gated off by the control code.* 

### 1.2. IPG Clock – used for the access of most peripherals

IPG\_CLK\_ROOT is an internal clock that is required to access most peripherals. It is derived exclusively from the AHB\_CLK\_ROOT, which was configured in the initial step.

IPG\_CLK\_ROOT is equal to AHB\_CLK\_ROOT divided by 1 to 4.



is used to configure this ratio, whereby if the define is not used it defaults to 1 (that is, IPG CLK ROOT is equal to AHB\_CLK\_ROOT).

The maximum IPG\_CLK\_ROOT frequency for the i.MX RT 1021 is 150MHz (max. for  $AHB_CLK_ROOT$  is 500MHz) and so it is important to ensure that the divider is set to ensure this speed is not exceeded.

The  $\mu$ Tasker project driver code will signal a build error if it detects that such a frequency has been exceeded. The  $\mu$ Tasker simulator also checks run-time derived frequencies and will exception if it detects such violations.

### 1.3. PERCLK – used by PIT and GPT

PERCLK\_CLK\_ROOT is the internal clock that feeds the PIT and GPT. It is derived either from the IPG\_CLK\_ROOT frequency, which was configured in the previous step, or from the 24.0MHz OSC CLK. In each case it has an optional pre-scaler of divide by 1..64.

#define PERCLK CLK ROOT SOURCE IPG CLK



#define PERCLK CLK ROOT DIVIDE 1 // 1..64

is used to configure this ratio, whereby if the define is not used it defaults to 1.

#define PERCLK CLK ROOT SOURCE OSC CLK



Should neither the PIT nor the GPT be used in the project this clock will automatically be disabled by the control code by disabling its clock gate.

The maximum PERCLK\_CLK\_ROOT frequency for the i.MX RT 1021 is 75MHz and so it is important to ensure that the divider is set to ensure this speed is not exceeded.

The  $\mu$ Tasker project driver code will signal a build error if it detects that and such a frequency has been exceeded. The  $\mu$ Tasker simulator also checked run-time derived frequencies and will exception if it detects such violations.

# 1.4. UART\_CLK\_ROOT - used by all LPUARTs

The LPUARTs in the i.MX RT 1021have a common clock that can be derived from either  $OSC\_CLK$  or from  $pll\_sw\_clk/6$ . A common pre-scaler allows the frequency to be divided by 1..64.

#define UART CLK ROOT FROM PLL3 SW CLK 6



In this configuration the USB1 PLL is used as reference (called pll\_sw\_clk) and divided by a fixed value of 6, resulting in 80MHz. An optional pre-scaler defined by UART CLK ROOT DIVIDER can divide this by 1 to 64 (when not defined, the default is 1)

#define UART\_CLK\_ROOT\_FROM\_OSC\_CLK



In this configuration the  $OSC\_CLK$  (24MHz) is used as reference, with the optional pre-scaler of 1 to 64.

Note that when PLL3 is not enabled it is left in its powered down, bypassed mode and the alternative clock would be 4MHz instead of 80MHz. This does in fact correspond to the default for the LPUART clock out or reset but this is not used as configuration option since it has no advantage.

UART\_CLK\_ROOT supplies all LPUARTs but the clock is automatically disabled at each individual LPUART input when the corresponding LPUART is not used.

## 1.5. USDHC1\_CLK\_ROOT/USDHC2\_CLK\_ROOT

By default the USDHC1\_CLK\_ROOT and USDHC2\_CLK\_ROOT are both sourced from PLL2-PFD2, which has a default frequency of 396MHz when PLL2 is enabled. A 3-bit pre-scale divider reduces this to half the value.





The alternative clock root source is PLL2-PFD0.





When an USDHC controller is used PLL2 is unconditionally used because it needs to be in operation.

The USDHCx CLK ROOT frequencies must not exceed 198MHz.

## 1.6. SEMC\_CLK\_ROOT - external memory controller clock

By default the SEMC\_CLK\_ROOT is sourced from PERIPH\_CLK, which is the clock that supplies the core clock's pre-scaler. Out of reset it is divided by 3. This configuration is shown in the first image:



Other configuration possibilities are for PLL2-PFD2 or PLL3-PFD1 as shown in the further images.





The SEMC CLK ROOT frequency must not exceed 166MHz.

### 1.7. FLEXSPI\_CLK\_ROOT

The default source of the FLEXSPI\_CLK\_ROOT is from the SEMC\_CLK\_ROOT (see previous section) with a pre-scaler of 2. As discussed in the previous section, the SEMC\_CLK\_ROOT is per default the PERIPH CLK (the clock root supplying the core clock pre-scaler) divided by 3. FLEXSPI CLK ROOT frequency must not exceed 322MHz.









## 1.8. LPSPI\_CLK\_ROOT - used by all LPSPIs

The LPSPIs in the i.MX RT 1021have a common clock that can be derived from either PLL2, PLL2 PFD2, PLL3 PFD0 or PLL3 PFD1. A common pre-scaler allows the frequency to be divided by 1..8.

#define LPSPI CLK FROM PLL2



If PLL2 were to be bypassed the source would be 24MHz rather than 528MHz.

#define LPSPI CLK FROM PLL3 PDF0



#define LPSPI\_CLK\_FROM\_PLL3\_PDF1



#define LPSPI\_CLK\_FROM\_PLL2\_PDF2



LPSPI CLK ROOT frequency must not exceed 132MHz.

When the use of a PLL or their phase fractional dividers is selected the PLL and necessary outputs are enabled as required, otherwise they are left in power down states when also not required for other clock roots.

LPSPI\_CLK\_ROOT supplies all LPSPIs but the clock is automatically disabled at each individual LPSPI input when the corresponding LPSPI is not used.

It is to be noted that the LPSPI timing is configured further in the LPSI module itself and in fact there is a minimum divide of 2, meaning that the actual LPSPI output clock speed is half of LPSPI CLK ROOT at least.

# 1.9. TRACE\_CLK\_ROOT

To add..

# 1.10. SAI1\_CLK\_ROOT/SAI2\_CLK\_ROOT/SAI2\_CLK\_ROOT

To add..

# 1.11. LPI2C\_CLK\_ROOT – used by all LPI2C controllers

The LPI2C controllers in the i.MX RT 1021 have a common clock that can be derived from either  $OSC\_CLK$  or from  $pll\_sw\_clk/8$ . A common pre-scaler allows the frequency to be divided by 1..64.

#define LPI2C CLK ROOT FROM PLL3 SW CLK 8



In this configuration the USB1 PLL is used as reference (called  $pll_sw_clk$ ) and divided by a fixed value of 8, resulting in 60MHz. An optional pre-scaler defined by LPI2C\_CLK\_ROOT\_DIVIDER can divide this by 1 to 64 (when not defined, the default is 1)

#define LPI2C\_CLK\_ROOT\_FROM\_OSC\_CLK



In this configuration the OSC\_CLK (24MHz) is used as reference, with the optional pre-scaler of 1 to 64.

Note that when PLL3 is not enabled it is left in its powered down, bypassed mode and the alternative clock would be 3MHz instead of 60MHz. This does in fact correspond to the default for the LPI2C clock out or reset but this is not used as configuration option since it has no advantage.

LPI2C\_CLK\_ROOT supplies all LPI2C controllers but the clock is automatically disabled at each individual LPI2C input when the corresponding LPI2C controller is not used.

## 1.12. CAN CLK ROOT

The FLEXCAN controllers in the i.MX RT 1021have a common clock that can be derived from either OSC\_CLK,  $pll_sw_clk/6$  or from  $pll_sw_clk/8$ . A common pre-scaler allows the frequency to be divided by 1..64.

#define CAN CLK ROOT FROM PLL3 SW CLK 6



In this configuration the USB1 PLL is used as reference (called  $pll_sw_clk$ ) and divided by a fixed value of 6, resulting in 80MHz. An optional pre-scaler defined by CAN CLK ROOT DIVIDER can divide this by 1 to 64 (when not defined, the default is 2)

#define CAN CLK ROOT FROM PLL3 SW CLK 8



In this configuration the USB1 PLL is used as reference (called  $pll_sw_clk$ ) and divided by a fixed value of 8, resulting in 60MHz. An optional pre-scaler defined by CAN\_CLK\_ROOT\_DIVIDER can divide this by 1 to 64 (when not defined, the default is 2)

#define CAN CLK ROOT FROM OSC CLK



In this configuration the OSC\_CLK (24MHz) is used as reference, with the optional pre-scaler of 1 to 64.

Note that when PLL3 is not enabled it is left in its powered down, bypassed mode and the alternative clocks would be 4/3MHz instead of 80/60MHz. This is not used as configuration option since it has no advantage.

CAN\_CLK\_ROOT supplies all FLEXCAN controllers but the clock is automatically disabled at each individual FLEXCAN input when the corresponding FLEXCAN controller is not used.

## 1.13. SPDIF0\_CLK\_ROOT

To add...

## 1.14. FLEXIO1\_CLK\_ROOT

To add..

#### 1.15. **USB1 Clock**

USB1's PHY requires a 480MHz clock which is derived exclusively from PLL3's fixed 480MHz output – for this reason PLL3 is known also as USB1 PLL. This means that, logically, PLL3 must be operating at 480MHz for USB1 to be used.

When USB is enabled with

#define USB\_INTERFACE

it is automatically configured as part of the clock initialisation so that it is subsequently available for use.

## 3. Real Time Clock

The i.MX RT 1021 has a low power RTC in its SNVS (Secure Non-Volatile Storage Module) which can be powered by a coin cell (2.4V .. 3.6V) on its VDD\_SNVS\_IN pin. Although an internal ring oscillator can be used to supply a rough 32kHz clock to the module higher accuracy time keeping is achieved by using a 32kHz crystal connected to the RTC\_XTALI and RTC\_XTALO pins.

# 4. Internal Clock Monitoring

The i.MX RT 1021 has two peripheral outputs called CCM\_CLKO1 (on GPIO\_SD\_B1\_02, or GPIO3-22) and CCM\_CLKO2 (on GPIO\_SD\_B1\_03, or GPIO3-23) which can be attached to various internal clocks. This can be useful to verify that these clocks really have the frequencies that are expected, as well as generating signals for external usage. Fast internal signals can also be divided down by a pre-scaler with a value between 1 and 8.

These are the clocks that can be selected:

### CCM CLKO1

PLL3\_SW\_CLK\_DIV2 PLL2\_DIV2 ENET\_PLL\_DIV2 SEMC\_CLK\_ROOT AHB\_CLK\_ROOT IPG\_CLK\_ROOT PERCLK\_ROOT PLL4\_MAIN\_CLK

### CCM CLKO2

USDHC1\_CLK\_ROOT LPI2C\_CLK\_ROOT OSC\_CLK\_ROOT LPSPI\_CLK\_ROOT USDHC2\_CLK\_ROOT SAI1\_CLK\_ROOT SAI2\_CLK\_ROOT SAI3\_CLK\_ROOT TRACE\_CLK\_ROOT CAN\_CLK\_ROOT FLEXSPI\_CLK\_ROOT UART\_CLK\_ROOT SPDIF0\_CLK\_ROOT

Two macros are made available to configure the pin and output the desired signals:

```
fnSetClock1Output(CLK, div)
fnSetClock2Output(CLK, div)
```

#### whereby examples of utilisation are:

### 5. LPUART

The i.MX RT 1021 LPUART driver is shared with the Kinetis LPUART driver and supports interrupt and DMA driven modes. Minor differences due to the i.MX RT 1021 hardware are controlled by the platform definition \_imx. Sharing the driver is possible due to the high compatibility between the LPUART used in the Kinetis parts and i.MX RT 1021 and improves maintenance since only one source needs to be managed and the i.MX RT 1021 inherits the features from the mature Kinetis driver.

See the UART user's manual for general details of usage: <a href="http://www.utasker.com/docs/uTasker/uTaskerUART.PDF">http://www.utasker.com/docs/uTasker/uTaskerUART.PDF</a>

It should be kept in mind that i.MX RT 1021 peripherals tend to be numbered 1..n and not 0..n-1, as is the case with Kinetis peripherals. To avoid confusion it is recommended to use the defines

iMX\_LPUART\_1
iMX\_LPUART\_2

iMX\_LPUART\_n

instead of channel numbers, whereby iMX\_LPUART\_1 is in fact 0.

The LPUARTs can be multiplexed onto various physical pins. When a specific LPUART is used it defaults to a certain set of pins if nothing else is specified – the following gives a list of the LPUART multiplex pins and the definition that can be used to control the use of alternatives where possible and if required:

| Channel                              |     | Default                        | Alternative 1                              | Alternative 2                                     |
|--------------------------------------|-----|--------------------------------|--------------------------------------------|---------------------------------------------------|
| iMX_LPUART_1 LPUART1_RXD LPUART1_TXD | (0) | GPIO_AD_B0_07<br>GPIO_AD_B0_06 |                                            |                                                   |
| iMX_LPUART_2 LPUART2_RXD LPUART2_TXD | (1) | GPIO_AD_B1_09<br>GPIO_AD_B1_08 | LPUART2_ON_EMC GPIO_EMC_23 GPIO_EMC_22     |                                                   |
| iMX_LPUART_3 LPUART3_RXD LPUART3_TXD | (2) | GPIO_AD_B0_15<br>GPIO_AD_B0_14 | LPUART3_ON_EMC GPIO_EMC_07 GPIO_EMC_06     |                                                   |
| iMX_LPUART_4 LPUART4_RXD LPUART4_TXD | (3) | GPIO_AD_B1_11<br>GPIO_AD_B1_10 | LPUART4_ON_EMC_LOW GPIO_EMC_03 GPIO_EMC_02 | LPUART4_ON_EMC_HIGH<br>GPIO_EMC_33<br>GPIO_EMC_32 |
| iMX_LPUART_5 LPUART5_RXD LPUART5_TXD | (4) | GPIO_AD_B0_11<br>GPIO_AD_B0_10 | LPUART5_ON_EMC GPIO_EMC_39 GPIO_EMC_38     |                                                   |
| iMX_LPUART_6 LPUART6_RXD LPUART6_TXD | (5) | GPIO_SD_B1_01<br>GPIO_SD_B1_00 | LPUART6_ON_EMC GPIO_EMC_13 GPIO_EMC_12     |                                                   |
| iMX_LPUART_7 LPUART7_RXD LPUART7_TXD | (6) | GPIO_SD_B0_05<br>GPIO_SD_B0_04 | LPUART7_ON_EMC GPIO_EMC_35 GPIO_EMC_34     |                                                   |
| iMX_LPUART_8 LPUART8_RXD LPUART8_TXD | (7) | GPIO_SD_B1_03<br>GPIO_SD_B1_02 | LPUART8_ON_EMC GPIO_EMC_27 GPIO_EMC_26     |                                                   |

For example, if LPUART2 is used its default pin-out Tx/RX is on GPIO\_AD\_B1\_08/GPIO\_AD\_B1\_09 [GPIO1-IO24 and GPIO1-IO25] but can be set instead to

GPIO\_EMC\_22/GPIO\_EMC\_23 [GPIO2-IO22 and GPIO2-IO23] by enabling the define LPUART2\_ON\_EMC. This define is set in app\_hw\_iMX.h

It is to be noted that the  $\mu$ Tasker project is often chosen due to its immediate support for free-running UART Rx DMA on all serial interfaces, which is something that is generally not found in other solutions. The i.MX RT 1021 thus could immediately inherit this operation.

## 6. LPI2C

The i.MX RT 1021 LPI2C driver is shared with the Kinetis LPUART driver and supports interrupt and DMA driven modes. Minor differences due to the i.MX RT 1021 hardware are controlled by the platform definition <code>\_imx</code>. Sharing the driver is possible due to the high compatibility between the LPI2C used in the Kinetis parts and i.MX RT 1021 and improves maintenance since only one source needs to be managed and the i.MX RT 1021 inherits the features from the mature Kinetis driver.

See the I<sup>2</sup>C user's manual for general details of usage: http://www.utasker.com/docs/uTasker/uTasker I2C.pdf

It should be kept in mind that i.MX RT 1021 peripherals tend to be numbered 1..n and not 0..n-1, as is the case with Kinetis peripherals. To avoid confusion it is recommended to use the defines

iMX\_LPI2C\_1
iMX\_LPI2C\_2
iMX\_LPI2C\_n

instead of channel numbers, whereby iMX\_LPI2C\_1 is in fact 0.

#### 7. LPSPI

The i.MX RT 1021 LPSPI driver is shared with the Kinetis LPSPI driver. Minor differences due to the i.MX RT 1021 hardware are controlled by the platform definition \_imx. Sharing the driver is possible due to the high compatibility between the LPSPI used in the Kinetis parts and i.MX RT 1021 and improves maintenance since only one source needs to be managed and the i.MX RT 1021 inherits the features from the mature Kinetis driver.

It should be kept in mind that i.MX RT 1021 peripherals tend to be numbered 1..n and not 0..n-1, as is the case with Kinetis peripherals. To avoid confusion it is recommended to use the defines

iMX\_LPSPI\_1

iMX\_LPSPI\_2

iMX\_LPSPI\_n

instead of channel numbers, whereby imx LPSPI 1 is in fact 0.

The LPI2Cs can be multiplexed onto various physical pins. When a specific LPI2C is used it defaults to a certain set of pins if nothing else is specified – the following gives a list of the LPI2C multiplex pins and the definition that can be used to control the use of alternatives where possible and if required:

| Channel                             |     | Default                        | Alternative 1                            | Alternative 2 |
|-------------------------------------|-----|--------------------------------|------------------------------------------|---------------|
| iMX_LPI2C_1 ( LPI2C1_SCL LPI2C1_SDA | (0) | GPIO_AD_B1_14<br>GPIO_AD_B1_15 | LPI2C1_ON_EMC GPIO_EMC_02 GPIO_EMC_03    |               |
| iMX_LPI2C_2 ( LPI2C2_SCL LPI2C2_SDA | (1) | GPIO_AD_B1_08<br>GPIO_AD_B1_09 | LPI2C2_ON_EMC GPIO_EMC_19 GPIO_EMC_18    |               |
| iMX_LPI2C_3 ( LPI2C3_SCL LPI2C3_SDA | (2) | GPIO_AD_B0_08<br>GPIO_AD_B0_09 | LPI2C3_ON_SD GPIO_SD_B0_00 GPIO_SD_B0_01 |               |
| iMX_LPI2C_4 ( LPI2C4_SCL LPI2C4_SDA | (3) | GPIO_EMC_11<br>GPIO_EMC_10     | LPI2C4_ON_SD GPIO_SD_B1_02 GPIO_SD_B1_03 |               |

For example, if LPI2C2 is used its default pin-out SCL/SDA is on GPIO\_AD\_B1\_08/GPIO\_AD\_B1\_09 [GPIO1-IO24 and GPIO1-IO25] but can be set instead to

GPIO\_EMC\_19/GPIO\_EMC\_18 [GPIO2-IO19 and GPIO2-IO18] by enabling the define LPI2C2 ON EMC. This define is set in app hw iMX.h

#### 8. FLEXCAN

The i.MX RT 1021 CAN driver is shared with the Kinetis CAN driver. Minor differences due to the i.MX RT 1021 hardware are controlled by the platform definition \_imx. Sharing the driver is possible due to the high compatibility between the FLEXCAN used in the Kinetis parts and i.MX RT 1021 and improves maintenance since only one source needs to be managed and the i.MX RT 1021 inherits the features from the mature Kinetis driver.

The FLEXCAN in the i.MX RT 1021 supports 64 receive buffers as opposed to the 16 in the FLEXCAN in the Kinetis parts.

See the CAN user's manual for general details of usage: <a href="http://www.utasker.com/docs/uTasker/uTaskerCAN.PDF">http://www.utasker.com/docs/uTasker/uTaskerCAN.PDF</a>

It should be kept in mind that i.MX RT 1021 peripherals tend to be numbered 1..n and not 0..n-1, as is the case with Kinetis peripherals. To avoid confusion it is recommended to use the defines

iMX\_FLEXCAN\_1
iMX\_FLEXCAN\_2

instead of channel numbers, whereby imx\_FLEXCAN\_1 is in fact 0.

## 9. USDHC

The i.MX RT 1021 USDHC driver is shared with the Kinetis SDHC driver. Minor differences due to the i.MX RT 1021 hardware are controlled by the platform definition <code>\_imx</code>. Sharing the driver is possible due to the high compatibility between the SDHC used in the Kinetis parts and i.MX RT 1021 and improves maintenance since only one source needs to be managed and the i.MX RT 1021 inherits the features from the mature Kinetis driver.

See the utFAT user's manual for general details of FAT12/FAT16 and FAT32 on SD cards: <a href="https://www.utasker.com/docs/uTasker/uTasker">https://www.utasker.com/docs/uTasker/uTasker</a> utFAT.PDF

## 10. ADC

The i.MX RT 1021 integrates two successive approximation AD controllers (ADC1 and ADC2) which achieve up to 12 bit resolution (with 10..11 bit accuracy) and up to 1MS/s sampling rate.

Each ADC has 16 input pins which can be multiplexed to. Since the majority of these pins are shared up to 19 ADC pins can actually be used in total.

The ADC inputs are not dedicated pins but instead are multiplexed with other functions. They are selected as GPIO inputs so that the analogue signal can be sampled by the ADC.

The ADCs have fixed ADC\_REFH and ADC\_REFL connected internally to VDDA and VSSA respectively.

The ADC driver interface is highly compatible with the Kinetis interface and is described in the ADC/DAC document <a href="https://www.utasker.com/docs/uTasker/uTaskerADC.pdf">https://www.utasker.com/docs/uTasker/uTaskerADC.pdf</a>

The i.MX RT 1021 supports external triggering of its ADCs via the ADC External Trigger Control (ADC ETC) module.

## 11. PIT

The i.MX RT 1021 PIT driver is shared with the Kinetis PIT driver. Minor differences due to the i.MX RT 1021 hardware are controlled by the platform definition \_iMX. Sharing the driver is possible due to the high compatibility between the PIT used in the Kinetis parts and i.MX RT 1021 and improves maintenance since only one source needs to be managed and the i.MX RT 1021 inherits the features from the mature Kinetis driver.

See the HW timer user's manual for general details of usage: http://www.utasker.com/docs/uTasker/uTaskerHWTimers.PDF

(see TEST\_PIT [TEST\_PIT\_SINGLE\_SHOT, TEST\_PIT\_PERIODIC and TEST\_PIT\_64 BIT in ADC Timers.h as reference to use).

#### 12. GPT

The i.MX RT 1021 has two General Purpose Timers (GPT), which are 32 bit up-counters with 12-bit pre-scaler (1..4096) clocked from PERCLK but with programmable counter clock source. Drivers are active when the project define SUPPORT\_TIMER is enabled in app hw iMX.h.

The GPTs supports capture mode on programmable edges of their associalted timer input pins (GPT\_CAPTUREx) and can generate an event on their output compare pins (GPT\_COMPAREx), as well as an interrupt when the counter reaches a programmed value. Capture mode support is active when SUPPORT CAPTURE is enabled.

Programmable clock sources to the prescaler are:

- Crystal oscillator (with its own 1..16 prescaler)
- External GPT CLK input
- Peripheral Clock
- Low frequency reference clock (32kHz)
- High frequency reference clock

Each GPT has two input capture channels and three output compare channels and operates either in Restart (based on compare channel 1) or Free-run mode of operation.

The GPT driver interface is highy compatible with the FlexTimer/TPM driver in the Kinetis project. Each GPT can be used to generate periodic or single shot interrupts, capture inputs or generate a square wave, low or high edge on match. See the Hardware Timer Guide at <a href="https://www.utasker.com/docs/uTasker/uTaskerHWTimers.PDF">https://www.utasker.com/docs/uTasker/uTaskerHWTimers.PDF</a> for more details.

Unfortunately, although the GPT can generate a square wave output, it can't generate PWM outputs. Also it can't trigger DMA operations.

The GPTs input/outpus can be multiplexed onto various physical pins. When a specific GPT is used it defaults to a certain set of pins if nothing else is specified – the following gives a list of the GPT multiplex pins and the definition that can be used to control the use of alternatives where possible and if required:

| GPT                    | Default     | Alternative 1 | Alternative 2 |
|------------------------|-------------|---------------|---------------|
| iMX_GPT_1 (0) GPT1_CLK | GPIO_EMC_39 |               |               |
| iMX_GPT_2 (1) GPT2_CLK | GPIO_EMC_31 |               |               |

#### 13. Enhanced FlexPWM

The i.MX RT 1021 has 2 enhanced FlexPWM modules. Each has 4 channels with A,B and X outputs and are based on 16 bit counters.

The FlexPWM is a flexible module that can generate complex signals and react to fault conditions. Each channel of each FlexPWM module has its own interrupt vector (capture, compre or reload) and each FlexPWM its own fault interrupt vector. The FlexPWMs can also generate DMA requests directly or via the XBAR.

The µTasker PWM driver allows a compatible interface with the Kinetis one to control the PWM channel outputs and this driver is enabled by the project define SUPPORT PWM MODULE.

See the Hardware Timer Guide at

https://www.utasker.com/docs/uTasker/uTaskerHWTimers.PDF for more details.

Unlike the FlexTimer/TPM in the Kinetis parts which have a single shared counter for each channel the enhanced FlexPWM has a counter for each channel which means that each channel of a FlexPWM can generate PWM signals of different frequencies.

## 14. Quad Timer (TMR)

The i.MX RT 1021 has two 16-bit quad-timers, each with 4 channels. This timer is clocked by inputs from pins, from the XBAR1 module outputs or by the IP bus clock (optionally prescaled by 2, 4, 8, 16, 32, 64 or 128). This means that the quad timer can serve as a timer (when using IP bus clock source) or as event counter.

The timer has inputs that can be used as counting inputs, to trigger the quad-timer capture or control count direction.

The timer has an output that has varous possible functions such as following the input clock, being set or cleared on compare match.

The quad timer driver is enabled by the project define <code>SUPPORT\_QUAD\_TIMER</code>.

See the Hardware Timer Guide at

https://www.utasker.com/docs/uTasker/uTaskerHWTimers.PDF for more details.

## 15. DMA

The i.MX RT 1021 DMA driver is shared with the Kinetis eDMA driver. Minor differences due to the i.MX RT 1021 hardware are controlled by the platform definition \_imx. Sharing the driver is possible due to the high compatibility between the eDMA and DMA MUX used in the Kinetis parts and i.MX RT 1021 and improves maintenance since only one source needs to be managed and the i.MX RT 1021 inherits the features from the mature Kinetis driver.

#### 16. **GPIO**

The i.MX RT 1021 GPIO / peripheral concept is quite different to the Kinetis concept. See the following video for an overview and also details concerning how the project was solved to ensure compatibility between Kinetis and i.MX RT:

https://www.youtube.com/watch?

v=SmFTi8hlba0&list=PLWKIVb MgDQFZAulrUywU30v869JBYi9Q&index=29

GPIOs can also be used as interrupts (see IRQ\_TEST in Port\_Interrupts.h as reference to use).

Each GPIO can be configured to generate an interrupt on low levels, high levels, falling edges or rising edges (or both falling and rising edges). The µTasker GPIO interrupt driver allows the user to assign an individual interrupt callback to each GPIO but it is useful to understand that the i.MX RT 1021 actually has the following interrupt vectors:

```
- {\tt PORT1-0}-individual\ vector\ for\ these\ pins
```

```
- PORT1-1
```

- PORT1-2
- PORT1-3
- PORT1-4
- PORT1-5
- PORT1-6
- PORT1-7
- PORT1-15..PORT1-8 these 8 pins share a single vector
- PORT1-31..PORT1-16 these 16 pins share a single vector
- PORT2-15..PORT2-0 these 16 pins share a single vector
- PORT2-31..PORT2-16 these 16 pins share a single vector
- PORT3-15..PORT3-0 these 16 pins share a single vector
- PORT3-31.. PORT3-16 these 16 pins share a single vector
- PORT5-15..PORT5-0 these 16 pins share a single vector
- PORT5-31.. PORT5-16 these 16 pins share a single vector

PORT1-0..PORT1-7 are the most efficient interrupts since the handler doesn't need to identify which port bits caused the interrupt before dispatching the user interrupt callback. Ports with an interrupt vector shared by more than one pin are slightly less efficient due to the need to identify which source or sources caused the interrupt and then dispatch one or more call-backs. When multiple GPIO input interrupts are pending at the same time the callbacks are dispatched in the order of the lower pin number up to the highest pin number.

It is possible to trigger DMA transfers on some GPIO inputs when they are connected to the XBAR. See the following video for more details: https://www.youtube.com/watch?v=zNWIGO7ZW0&feature=youtu.be

## 17. RAM and Cache

The i.MX RT 1021 contains 256k of internal RAM which is constructed of 8 banks of 32k each. These banks can each be assigned to one of three areas (FlexRAM controller):

- OCRAM General RAM operates at 1/4 the core clock speed (32 bit wide). This is cacheable, meaning that if L1 cache is enabled data content that is already in cache is used to avoid needing to perform the OCRAM access.
- ITCM Instruction Tightly Coupled Memory (64 bit wide) that is optimised for instruction execution at the maximum core speed. Non-cacheable (also since already optimally fast) and so no potential cache synchronisation problems.
- DTCM Data Tightly Coupled Memory (64 bit wide) that is optimised for data access at the maximum core speed. Non-cacheable (also since already optimally fast) and so no potential cache synchronisation problems.

For full details concerning the FlexRAM and optimal configuration to match an applications memory requirements NXP has prepared the application note AN12077 which can be found at <a href="https://www.nxp.com/docs/en/application-note/AN12077.pdf">https://www.nxp.com/docs/en/application-note/AN12077.pdf</a>

The i.MX RT 1021 has L1 cache with 16kBytes instruction cache and 16kBytes data cache. NXP has prepared the application note AN12042 which can be found at <a href="https://www.nxp.com/docs/en/application-note/AN12042.pdf">https://www.nxp.com/docs/en/application-note/AN12042.pdf</a>

Use of the cache can ensure high speed operation even when the source of code or data is in a slower memory by avoiding to have to unnecessarily fetch the data when it has been loaded once to the cache.

When the cache is enabled it caches from OCRAM and QSPI-Flash; it neither caches ITCM nor DTCM, which are already tightly coupled to the core.

The application can decide whether it uses data or instruction cache with the defines

#define ENABLE\_INSTRUCTION\_CACHE

#define ENABLE\_DATA\_CACHE

The FlexRAM controller configures the RAM banks at reset based on eFuse settings. The standard setting (when nothing else has been programmed) is for 128k OCRAM, 64k DTCM and 64k ITCM; the ROM loader may use the first 64k of the OCRAM when it operates.

This default setting is assumed in the  $\mu$ Tasker project to avoid special configuration requirements and therefore out of reset the banks are configured to give this memory map and layout:



Assuming it is decided that an application were best configured to have 5 banks for ITCM (so that the code could be completely located there – 160k) and 3 banks for data (so that up to 96k of data could be accesses at optimal speed) and no OCRAM the 8 banks could be configured as follows:



The µTasker FlexRAM driver always uses the bank order from instruction use to data use (if OCRAM were used it would be inserted between the two). The most important thing to understand is that when the bank use is modified the address range of the bank is also changed (it is moved in the memory map). This means that data that was in OCR before the change (in the default configuration) still exists in the bank memory but is now addressed in the ITC or DTC memory space instead.

This behaviour makes it complicated to change the memory configuration at run time because it means that any memory used before the change (eg. The stack or initialised variables) are usually at completely different locations after the change. Typically this will cause a program that simply changes the bank configuration without respecting the fact that its memory moves during the process to immediately fail. For this reason such changes are generally not performed during program operation; if such a configuration is changed it tends to be performed before any variable initialisation and also from code running in other sources and without stack dependency.

The µTasker concept assumes that code and variables fit in the internal RAM and so OCRAM is avoided. The division between ITC and OTC is performed at system initialisation automatically to allocate ITC banks to the code space and DTC banks to data space in such a way as to have as much DTC available as possible for heap and stack. If code of 130k were encountered it would thus assign 160k ITC and 96k DTC, as in the example. If less code were encountered additional banks would be assigned to DTC in order to maximise heap and stack availability. Code and data are automatically in the highest performance RAM areas and caching is not required to achieve optimal performance (without caching, no additional synchronisation of data is required).

There is an important reason for choosing the bank ordering: In the default configuration bank 7 is assigned to OCR but will not be used by the ROM loader (the ROM loader may use up to 64k only). After the bank swap is performed this bank is the last bank in DTM, whereby the stack pointer is located near the top, but leaving some additional space above it for 'preserved' variables. The advantage of this is that an application can always write values to the preserved area before a reset and these values will not be modified by the ROM loader. The  $\mu$ Tasker boot loader or another application can then read these values, even if the application uses a different RAM bank configuration; as long as its stack pointer is put to near the end of the final bank it will automatically be referenced to the the preserved data area! The preserved area is used in the  $\mu$ Tasker project for communicating between applications and the  $\mu$ Tasker boot loaders, but can also be used by custom applications for holding data that is guaranteed to be preserved across warm resets.

Due to the nature of the memory operation of the i.MX RT 1021, its configuration requirement to achieve optimal performance and the desire to allow  $\mu$ Tasker users to benefit from these with no additional effort the RAM bank management is an integral part of the  $\mu$ Tasker boot strategy and the  $\mu$ Tasker Boot Loader (see Boot Mode section) an integral part of every project (apart from when a stand-alone application is loaded in a debug environment for test purposes).

## 18. Boot Mode

The i.MX RT 1021 doesn't have internal flash and needs to boot from an external program source. This is controlled by an internal ROM which offers various boot loader capabilities which are controlled by two inputs (SRC\_BOOT\_MODE00/GPIO2-IO16 and SRC\_BOOT\_MODE01/GPIO2-IO17) as well as some eFUSEs (or further pins). The  $\mu$ Tasker project avoids the use of eFUSES and instead controls the operation in its code so that chips can be used in their default configuration.

The i.MX RT 1021 supports three main modes:

| SRC_BOOT_MODE01/GPIO<br>2-IO17/GPIO_EMC_17 | SRC_BOOT_MODE00/GPIO2-<br>IO16/GPIO_EMC_16 |                                        |
|--------------------------------------------|--------------------------------------------|----------------------------------------|
| 0                                          | 0                                          | Boot from fuses                        |
| 0                                          | 1                                          | Serial downloader<br>(LPUART1 or USB1) |
| 1                                          | 0                                          | Internal boot                          |

For simplicity the  $\mu$ Tasker project assumes that internal boot option is used as standard, meaning that the ROM loader runs and the exact configuration is taken from eFUSEs or pin overrides, whereby the  $\mu$ Tasker assumes that serial (QSPI) NOR-Flash is used: the MIMXRT1020-EVK has an IS25LP064A-JBLE to this effect, which is an 8 Mbyte part. It is connected in QSPI mode on the primary FlexSPI interface of the i.MX RT 1021.

To ensure that the NOR-Flash mode is used the processor pins

*GPIO\_EMC\_25/*GPIO2\_IO25

GPIO\_EMC\_24/GPIO2\_IO24

GPIO\_EMC\_23/GPIO2\_IO23

GPIO\_EMC\_22/GPIO2\_IO22

should be pulled down at reset.

| SRC_BT_CFG07<br>/GPIO2-<br>IO25/GPIO_EM<br>C_25 | SRC_BT_CFG06/<br>GPIO2-<br>IO24/GPIO_EMC_<br>24 | SRC_BT_CFG05/<br>GPIO2-<br>IO23/GPIO_EMC_<br>23 | SRC_BT_CFG04/<br>GPIO2-<br>IO22/GPIO_EMC_<br>22 |                              |
|-------------------------------------------------|-------------------------------------------------|-------------------------------------------------|-------------------------------------------------|------------------------------|
| 0                                               | 0                                               | 0                                               | 0                                               | FlexSPI1<br>(Serial<br>NOR)  |
| 0                                               | 0                                               | 1                                               | x                                               | SD card                      |
| 1                                               | 0                                               | х                                               | x                                               | MMC/eMMC                     |
| 0                                               | 1                                               | х                                               | х                                               | SEMC<br>(NAND)               |
| 0                                               | 0                                               | 0                                               | 1                                               | SEMC<br>(NOR)                |
| 1                                               | 1                                               | х                                               | х                                               | FlexSPI1<br>(Serial<br>NAND) |

The MIMXRT1020-EVK has all configuration inputs pulled to ground by default and supplies a DIP switch with 4 switches to allow configuring the main boot mode and whether the FlexSPI or SD card is booted from.

The following setting is used:

#### SW8

| DIP-1<br>SRC_BT_CFG00<br>/GPI02-<br>IO18/GPIO_EM<br>C_18 | DIP-2<br>SRC_BT_CFG05<br>/GPI02-<br>I023/GPI0_EM<br>C_23 | DIP-3<br>SRC_BOOT_MOD<br>E01/GPIO2-<br>IO17/GPIO_EM<br>C_17 | DIP-4<br>SRC_BOOT_MOD<br>E00/GPIO2-<br>IO16/GPIO_EM<br>C_16 |                                                                 |
|----------------------------------------------------------|----------------------------------------------------------|-------------------------------------------------------------|-------------------------------------------------------------|-----------------------------------------------------------------|
| OFF                                                      | OFF                                                      | ON                                                          | OFF                                                         | Internal boot<br>from FlexSPI1<br>(serial NOR)                  |
| ON                                                       | OFF                                                      | ON                                                          | OFF                                                         | Internal boot<br>from FlexSPI1<br>(serial NOR)<br>Encrypted XIP |

The eFUSES in a new part are set to supply the following configuration options in NOR Flash boot mode:

BOOT CFG1[0] = 0 = XIP is not encrypted

BOOT\_CFG2[2] = b00 = 500us hold time before read from device

BOOT CFG1[7..4] = b0000 = serial NOR device

BOOT\_CFG1[3..1] = b000 = NOR device supports 0x3b read by default (on primary pin-mux option)

The result is that when the processor starts the very first thing that it does (its internal ROM loader controlling it) is read a block of data from the start of SPI Flash (address 0x60000000 in the memory map) in 2 line mode at 30MHz. This block is expected to contain details about the NOR Flash that is connected (how many lines are connected, what speed is to be used to communicate with it – including setup times - what instruction sequences does it need, plus various other such information.

When the data read is valid (there is also a header to help identify valid content) the ROM loader sets up the FlexSPI interface accordingly, configures the SPI Flash further and begins communicating at the final speed.

The µTasker boot loader strategy flow diagram is show at <a href="https://www.utasker.com/docs/iMX/Loader.pdf">https://www.utasker.com/docs/iMX/Loader.pdf</a> whereby it automatically supplies a complete secure and clone-protected solution for serial loading and fail-safe OTA (Over-the-Air) updates.

The serial loader can be configured to support one or more of a number of loading methods such a USB, UART, Ethernet, and with existing protocols such as KBOOT. It allows for running applications in internal SRAM (for maximum efficiency) or directly from QSPI flash.

The application is responsible for allowing OTA uploads to the Upload Image Area by whatever means is appropriate for the project/product. It can support both OTA loading of new applications (to replace itself) or new serial loader, after which the physical swap of the code is performed by simply commanding a reset so that the µTasker "BM" Loader can complete the work. The "Fall-back" serial loader allows updates of the serial loader itself and provides recovery even when non-operational serial loaders were to be installed.

Typically the remaining QSPI-Flash space is used by the application for storing parameters (µParameterSystem) and files (µFileSystem).

The exact dimensions of the areas shown are configurable, as are the sizes of the OCR/ITC and DTC areas in RAM. Generally it is the "BM" loader that configures the RAM banks to optimally suit the application that is to be loaded and initialises the application's initial stack pointer suitably to the top of DTC memory (see the section on RAM and Cache for more details).

The µTasker boot loader concept is discussed ad demonstrated in the video: https://youtu.be/2XfqZq19XDw

# 19. Total Security and Simplicity

The µTasker concept allows an optimal combination of not just performance and power consumption but also code security. Rather than IP (intellectual property) protection being an after-thought it is already fully implemented so that the user hardly needs to be aware of it being in use.

The "Bare-Minimum" boot loader operates from QSPI flash but is stored in AES128 encrypted form. By using the chip's Bus Encryption Engine (BEE) its content is decrypted on-the-fly so that the "BM" loader - and the secret keys that it may store - are protected from being read in a plain text form directly from the QSPI flash memory.

The serial loader is (optionally) stored in an AES256 encrypted form in QSPI memory and can thus not be read in a plain text form. The "Bare-Minimum" boot loader performs AES256 decryption when in copies it to its RAM execution location. Since there are no subsequent accesses to the QSPI flash the BEE can be powered down as soon as the serial loader has been started. The serial loader then runs at full speed in tightly coupled instruction memory and has no power consumption overhead due to either QSPI access nor BEE operation.

The application is (optionally) saved in an AES256 encrypted form in QSPI memory, whereby the details and benefits are equivalent to those of the serial loader.

When updates are performed over-the-air there are no additional encryption/decryption steps needed (they are simply stored in the format as received since this is already protected both during the transmission and in the storage medium).

The "Bare-minimum" loader, which is responsible for storing the private decryption keys for the particular product and performing the decryption process, also recognises application code that is saved in plain text format and allows it to be executed if required. Developers thus have no addition complication if there is a reason - during development - to temporarily load non-encryped applications.

## 20. Ethernet

The i.MX RT 1021 has an internal 10/100M Ethernet controller but requires an external PHY for the connection to the cable.

There is a dedicated 500MHz fixed frequency PLL (PLL6) which supplies the Ethernet controller clocks. It also has a fixed 25MHz output referred to as  $ref_{enetpll2}$  and a configurable output called  $ref_{enetpll1}$ , programmable to 25MHz, 50MHz, 100MHz or 125MHz.

The MIMXRT1020-EVK uses a Micrel KSZ8081RNB as PHY, connected in RMII mode. This PHY accepts a 50MHz reference clock from the processor (Ethernet controller) which is supplied on GPIO\_AD\_BO\_08 (GPIO1-08) - peripheral function ENET\_REF\_CLK1

## 21. XBAR and AOI

**XBAR** is a Crossbar multiplexer with a certain number of inputs that can be connected to a certain number of outputs. It allows some processor pins and a variety of peripherals to be connected between each other. For example, a timer peripheral could be connected via the XBAR to the hardware trigger input of an ADC.

**AOI** is an "And-Or-Inverter" logic array. This allows some XBAR inputs to be logically treated to generate further XBAR inputs that only trigger events when the defined logical states are as programmed, and allows complex control of XBAR outputs beyond that of simply connecting inputs to outputs.

The i.MX RT 1021 has 3 XBARs. XBARA1, XBARB2 and XBARB3. Each of the XBARs have its own array of possible inputs and outputs that can be connected – specifically, the XBARA1 has 88 inputs and 132 outputs; XBARB2 has 58 inputs and 16 outputs; XBARB3 has 58 inputs and 16 outputs, whereby the XBARB2 and 3 outputs are dedicted to use with the AOI.

XBARA1 is special in that its first 4 outputs can be configured to generate interrupts when the change state (rising edge, falling edge or both edges) or they can generate DMA triggers on the same state changes (rising edge, falling edge or both edges).

The following image shows a simple practical example where a pin input (XBAR\_INOUT4 is connected to a pin output (XBAR\_INOUT5) and also to XBAR\_OUT02, which can also be used to generate interrupts or trigger DMA on the defined signal edge:



The AOI has 4 channels and each channel has 4 inputs which can be selected to be from certain XBAR outputs (see later for example).

These 4 inputs of the channel are fed in to 4 x 4-input AND gates, whereby each of the 16 inputs can be individually configured to be set to logical '0', logical '1', the selected input, or an inverted version fo the selected input (eg. If any of the inputs to an individual AND gate were to be set to '0' it effectively disables that AND gate for the logical array).

The outputs of the 4 x 4-input AND gates are then Ored together to generate the final output signal. The output signal can then be selected by the XBAR as an input which it then connects to one of its outputs to be output on a pin, connected to the input of a peripheral or to trigger an interrupt or DMA transfer.

For simplicity of configuration of a AOI channel logic the uTasker project supplies a macro:

This configures AOI1 channe 0 logically as

```
('0' & '1' & InputC & NOT Input D) | ('1' & '0' & NOT InputC & Input D) | (InputA & NOT InputB & '0' & '1') | (NOT InputA & InputB & '1' & '0')
```

which makes no sense in a real case since the output would always be '0', but shows all of the input controls in use. It should also show that it is very simple to generate a complex logical function based on the 4 inputs (named A, B, C and D).

Control of the AOI inputs from the XBARB2 is performed by macros as follows:

```
XBAR2_OUT_AOI1_IN00(XBAR2_LOGIC_HIGH); // connect a logic high to AOI1 A output
XBAR2_OUT_AOI1_IN01(XBAR2_LOGIC_HIGH); // connect a logic high to AOI1 A output
XBAR2_OUT_AOI1_IN02(XBAR2_PIT_TRIGGER0); // connect the PIT trigger 0 to AOI1 C output
XBAR2_OUT_AOI1_IN03(XBAR2_ACMP1_OUT); // connect the ACMP1 output to AOI1 D output
```

This resuts in the first channel of the AOI1 module to have its 4 inputs (A,B,C and D) connected to the signals defined by these multiplexer settings ('1', '1', PIT\_TRIGGER0, ACMP1\_OUT).

```
The second channel would be controlled by XBAR2_OUT_AOI1_IN04(), XBAR2_OUT_AOI1_IN05(), XBAR2_OUT_AOI1_IN06() and XBAR2_OUT_AOI1_IN07()
```

```
The third channel by XBAR2_OUT_AOI1_IN08(), XBAR2_OUT_AOI1_IN09(), XBAR2_OUT_AOI1_IN10() and XBAR2_OUT_AOI1_IN11()
```

```
And the fourth channel by XBAR2_OUT_AOI1_IN12(), XBAR2_OUT_AOI1_IN13(), XBAR2_OUT_AOI1_IN14() and XBAR2_OUT_AOI1_IN15()
```

The combination of XBARB2, AOI1 and XBARA1 - along with their inputs from pins and peripherals and outputs to pins and periperals - is known as the On-Chip Cross Trigger Network.

The following diagram shows how the IAO is 'inserted' into the path in the previous code example. The logical combination of the two inputs (PIT\_TRIGGER0 and ACMP1\_OUT) is connected to the pin output XBAR\_INOUT05, but could also be used to generate an input or DMA trigger, or be connected to other peripheral inputs that are connected on XBARA1.



For a list of the possible connections see the i.MX RT 1021 user's manual or the iMX.h header file in the uTasker project, which allows cnnection macros to be used with names rather than input/output numbers and also good compatibility between i.MX RT parts due to this abstraction.

See the following video for further details about this module: https://youtu.be/zNWIG-O7ZW0

# 22. Low Power and Battery Backup

The i.MX RT 1021 supports four low power modes:

- System IDLE
- Low Power IDLE
- SUSPEND
- SNVS

The i.MX RT 1021 has a coin cell input that can be used to supply power to parts of the device that require minimum operation and data retention when the main power is removed.

Specifically, the energy supplied by the input (VDD\_SNVS\_IN 2.4V..3.6V) is used to retain the state of the processors pads, supply power to its internal real time clock and retal data in its SNVS (secure non-volatile storage module) module. An internal low drop out regulator reduces the input voltage to a stable 1.1V for the internal operation and requires a 220nF capacitor on the VDD\_SNVS\_CAP output for stability. In addition a serier input resistor is recommended to reduce suge current when connecting the battery (ca. 1K is typically).

The register CCM\_ANALOG\_MISC0 contains a switch called DISCON\_HIGH\_SNVS which by default connects the VDD\_HIGH\_IN power input to the VDD\_SNVS\_IN input when the main supply is present, thus supplying this circuit's energy from the main power supply (and optionally charging the coin cell from the main power supply). This can to be set to '1' to move this connection

There are 128 bits of battery-backed memory in the SNVS which can be read/written as 4 long words as registers SNVS\_LPGPR0, SNVS\_LPGPR1, SNVS\_LPGPR2 and SNVS\_LPGPR3.

These registers are however reset to 0x00000000 when a tamper event is detected by the chip (in order to clear user keys stored here) and in systems not using the tamper detection circuity it is necessary to disable the clearing mechanism by setting bit 24 in SNVS\_LPCR. This bit is referred to by user documentatio but is otherwise not defined by the literature. Since this control flag is also retained across software resets it is suitable for enabling general purpose storage to these registers. Read/write access do however need to be long word accesses for correction operation!

The lowest power state is SNVS, in which the majority of the chip is powered down, leaving on the RTC operational. From this state there are two possibilities to wake the processor:

- RTC wakeup
- GPIO wakeup (WAKEUP pin GPIO5 IO00)

## 23. Conclusion

The present state of development is that the i.MX RT 1021 (MIMXRT1020-EVK) can be operated from SRAM (with debugger or together with the µTasker boot loader) or from QSPI Flash (stand-alone) in various clocking configurations up to maximum speed. The following operations can be demonstrated:

- GPIO, port interrupts
- LPUARTs (interrupt and DMA driven)
- Ethernet
- HS USB device and host
- uSDHC
- Parameter system/File system in QSPI
- PIT / GPT / FlexPWM
- Boot-loading with USB-MSD and application operation in dynamically configured FlexRAM
- Dynamic low power (WAIT) mode
- DMA operations (memory-to-memory) can be demonstrated
- Instruction and data cache can be optionally enabled/disabled
- Operation of i.MX RT 1010, 1015, 1020, 1050, 1060, 1064
- Encrypted code storage and over-the air concept
- Operation from inernal RAM, SDRAM, XiP and on-the-fly decrypted XiP

All such operation is simulated in visual studio.

## Present problems/investigation:

- memory to memory DMA transfer is not giving any speed benefits over CPU copy: <a href="https://community.nxp.com/thread/518925">https://community.nxp.com/thread/518925</a>
- Free-running LPUART DMA reception works correctly when cache is disabled but has disturbances when cache is enabled
- Ethernet works reliably after a power cycle but not always after a push-button reset
- Dynamic low power operation supported but not yet successful on all parts (SYSTICK seems to stop on i.MX RT 1060/64 parts, for example)

#### Modifications:

V0.10 22.3.2020: Initial version in development V0.13 01.9.2020: Added XBAR, AOI chapter

V0.14 06.11.2020: Add GPT, FlexPWM, LPUART mux set

# Appendix A – Hardware Dependencies

a) Space for first Appendix