Loading...
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 | /* * Copyright (c) 2013 Apple Inc. All rights reserved. * * @APPLE_OSREFERENCE_LICENSE_HEADER_START@ * * This file contains Original Code and/or Modifications of Original Code * as defined in and that are subject to the Apple Public Source License * Version 2.0 (the 'License'). You may not use this file except in * compliance with the License. The rights granted to you under the License * may not be used to create, or enable the creation or redistribution of, * unlawful or unlicensed copies of an Apple operating system, or to * circumvent, violate, or enable the circumvention or violation of, any * terms of an Apple operating system software license agreement. * * Please obtain a copy of the License at * http://www.opensource.apple.com/apsl/ and read it before using this file. * * The Original Code and all software distributed under the License are * distributed on an 'AS IS' basis, WITHOUT WARRANTY OF ANY KIND, EITHER * EXPRESS OR IMPLIED, AND APPLE HEREBY DISCLAIMS ALL SUCH WARRANTIES, * INCLUDING WITHOUT LIMITATION, ANY WARRANTIES OF MERCHANTABILITY, * FITNESS FOR A PARTICULAR PURPOSE, QUIET ENJOYMENT OR NON-INFRINGEMENT. * Please see the License for the specific language governing rights and * limitations under the License. * * @APPLE_OSREFERENCE_LICENSE_HEADER_END@ */ #pragma once #include <mach/kern_return.h> #include <stdint.h> #include <sys/cdefs.h> #include <mach/vm_types.h> __BEGIN_DECLS #ifdef XNU_KERNEL_PRIVATE extern ppnum_t *ecc_bad_pages; extern uint32_t ecc_bad_pages_count; /* Counts for sysctls*/ extern uint32_t vm_ecc_db_pages_count; extern uint32_t vm_ecc_zero_pages_count; extern uint32_t vm_ecc_panic_pages_count; extern uint32_t vm_ecc_max_db_pages; #endif /* Old ECC logging mechanism */ #define ECC_EVENT_INFO_DATA_ENTRIES 8 struct ecc_event { uint8_t id; // ID of memory (e.g. L2C), platform-specific uint8_t count; // Of uint64_t's used, starting at index 0 uint64_t data[ECC_EVENT_INFO_DATA_ENTRIES] __attribute__((aligned(8))); // Event-specific data }; #ifdef KERNEL_PRIVATE extern kern_return_t ecc_log_record_event(const struct ecc_event *ev); #endif #ifdef XNU_KERNEL_PRIVATE #include <mach/vm_param.h> #define ECC_PANIC_PAGE_MAGIC 0xEC #define ECC_PANIC_PAGE_SIGN ((1ULL << 63) | (ECC_PANIC_PAGE_MAGIC)) #define ECC_PANIC_PAGE_MASK ((1ULL << 63) | (PAGE_MASK)) extern kern_return_t ecc_log_get_next_event(struct ecc_event *ev); extern uint32_t ecc_log_get_correction_count(void); #endif #define ECC_TESTING (DEVELOPMENT || DEBUG) /* New CoreAnalytics ECC logging mechanism */ #define VM_ECC_PAGE_POISON_GRANULE_SHIFT (7) #define VM_ECC_PAGE_POISON_GRANULE (1 << VM_ECC_PAGE_POISON_GRANULE_SHIFT) /* Flags to describe ECC memory errors */ __options_decl(ecc_flags_t, uint32_t, { ECC_NONE = 0x00000000, /* An error is correctable (1) or uncorrectable (0). */ ECC_IS_CORRECTABLE = 0x00000001, /* The database is corrupt. */ ECC_DB_CORRUPTED = 0x00000002, /* The error was injected for testing purposes. */ ECC_IS_TEST_ERROR = 0x00000004, /* Do not trigger a CA report, just record to the DB (for testing purposes) */ ECC_DB_ONLY = 0x00000008, /* Filter out the given address from the DB*/ ECC_REMOVE_ADDR = 0x00000010 }); /** * ECC versions. */ __options_decl(ecc_version_t, uint32_t, { ECC_V1, // Metadata ECC_NUM_VERSIONS }); /** * ECC event descriptor. * * @note If a new ECC version has been added (e.g. future hardware must * log new or different data) new fields should be appended to this struct to * represent the new data. No fields should be deleted from this struct unless * the field corresponds only to hardware that has been deprecated. */ typedef struct { /* Version of this struct. */ ecc_version_t version; /* Flags describing the reported error. */ ecc_flags_t flags; /* Physical address of failure */ uint64_t physaddr; /* Number of CEs reported at physaddr */ uint32_t ce_count; /* Vendor ID */ uint32_t vendor; /* Reserved for future extension to report row, column, bank, etc. */ uint32_t reserved[4]; } ecc_event_t; _Static_assert(sizeof(ecc_event_t) == 10 * sizeof(uint32_t), "ecc_event_t size must be updated in memory_error_notification.defs"); /** * platform_error_handler_ecc_poll_t is the type of callback registered by the * platform error handler that xnu can use to poll for ECC data. */ typedef int (*platform_error_handler_ecc_poll_t)(uint64_t *addrs, uint32_t *error_count); kern_return_t kern_ecc_poll_register(platform_error_handler_ecc_poll_t poll_func, uint32_t max_errors); /* Flags to describe MCC memory errors */ __options_decl(mcc_flags_t, uint32_t, { MCC_NONE = 0x00000000, MCC_IS_SINGLE_BIT = 0x00000001, MCC_IS_MULTI_BIT = 0x00000002, }); /** * MCC ECC versions. */ typedef enum { MCC_ECC_V1, // Metadata MCC_ECC_NUM_VERSIONS } mcc_ecc_version_t; /** * MCC ECC event descriptor. * * @note If a new MCC ECC version has been added, because i.e. future hardware must log new or different data, * new fields should be appended to this struct to represent the new data. No fields should be * deleted from this struct unless the field corresponds only to hardware that has been deprecated. */ typedef struct { /* Version of this struct. */ mcc_ecc_version_t version; /* Flags used to describe the error. */ mcc_flags_t flags; /* Interrupt status at the time of the MCC error. */ uint32_t status; /* AMCC on which the error occurred. */ uint32_t amcc; /* Plane of the AMCC on which the error occurred. */ uint32_t plane; /* MemCache error Bank of first one bit error. */ uint32_t bank; /* MemCache error Way of first one bit error. */ uint32_t way; /* MemCache error Index of first one bit error. */ uint32_t index; /* Indicates whether the error is in upper half cache line or lower half cache line. */ uint32_t bit_off_cl; /* MemCache one bit error bit offset of first one bit error with in half cache line. */ uint32_t bit_off_within_hcl; } mcc_ecc_event_t; _Static_assert(sizeof(mcc_ecc_event_t) == 10 * sizeof(uint32_t), "ecc_event_t size must be updated in memory_error_notification.defs"); #if KERNEL_PRIVATE /** * Logs any memory error. * * This will notify mmaintenanced of the error. The error * will get added to a database of errors and sent to * CoreAnalytics. If ECC_IS_CORRECTABLE == 0, * the address will be added to dramecc.db and will * be retired for the lifetime of the device. * * If it is too early in boot to send a notification directly * to the deamon, the error will be added to an array to be serviced * later by an mpsc_daemon_queue. * * If ECC_IS_CORRECTABLE flag is set with this function, it * assumes one error. If caller wishes to report the CE count * reported by hardware, use ecc_log_memory_error_ce(). * * @param physical_address address that the error occured on * @param ecc_flags flags used to describe the error * * @returns KERN_SUCCESS if logging supported by hw, KERN_FAILURE if not */ extern kern_return_t ecc_log_memory_error(uint64_t physical_address, ecc_flags_t ecc_flags); extern kern_return_t ecc_log_memory_error_internal(uint64_t physical_address, ecc_flags_t ecc_flags); /* * Used to report delayed errors, scraped after ECC is enabled. */ extern kern_return_t ecc_log_memory_error_delayed(uint64_t physical_address, ecc_flags_t ecc_flags); /** * Logs a correctable memory error. * * ECC_IS_CORRECTABLE is implied. Including this flag or not * makes no difference for this function. * * @param physical_address address that the error occured on * @param ecc_flags flags used to describe the error * @param ce_count number of CEs occured on this page reported by HW * * @returns KERN_SUCCESS if logging supported by hw, KERN_FAILURE if not */ kern_return_t ecc_log_memory_error_ce(uint64_t physical_address, ecc_flags_t ecc_flags, uint32_t ce_count); /** * Logs an MCC error. * * @param event Event to be logged * @returns KERN_SUCCESS on success, KERN_FAILURE otherwise */ kern_return_t mcc_log_memory_error(mcc_ecc_event_t event); #endif /* KERNEL_PRIVATE */ __END_DECLS |