-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathenumerator-traits-body.html
444 lines (322 loc) · 29.9 KB
/
enumerator-traits-body.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
<p>Document number: Nnnnn=yy-nnnn (Not yet assigned)</p>
<p>Date: 2013-10-03</p>
<p>Project: Programming Language C++, Library Working Group</p>
<p>Reply-to: Andrew Tomazos <a href="mailto:andrewtomazos@gmail.com">andrewtomazos@gmail.com</a>, Christian Käser <a href="mailto:christiankaeser87@googlemail.com">christiankaeser87@googlemail.com</a></p>
<h1>Enumerator List Property Queries</h1>
<h2>Table of Contents</h2>
<ul>
<li>Background</li>
<li>Introduction</li>
<li>Motivation and Scope</li>
<li>Design Goals</li>
<li>Design Log</li>
<li>Technical Specifications</li>
<li>Reference Implementation</li>
<li>Use Cases</li>
<li>Performance Issues</li>
<li>Acknowledgements</li>
</ul>
<h2>Background</h2>
<p>An enumeration type is defined by an <em>enum-specifier</em>. An <em>enum-specifier</em> contains an <em>enumerator-list</em>. An <em>enumerator-list</em> is a sequence of zero or more <em>enumerator-definitions</em>. Each <em>enumerator-definition</em> introduces an identifier (the name of the enumerator) and a corresponding value (either implicitly or explicitly).</p>
<h2>Introduction</h2>
<p>We propose to add three Property Queries [meta.unary.prop.query] to the Metaprogramming and Type Traits Standard Library that provide compile-time access to the <em>enumerator-list</em> of an enumeration type.</p>
<p>Specifically:</p>
<ul>
<li><code>std::enumerator_list_size<E></code>: the number of <em>enumerator-definitions</em> in the <em>enumerator-list</em> of <code>E</code>.</li>
<li><code>std::enumerator_identifier<E,I></code>: the identifier from the <code>I</code>‘th <em>enumerator-definition</em>.</li>
<li><code>std::enumerator_value<E,I></code>: the value from the <code>I</code>‘th <em>enumerator-definition</em>.</li>
</ul>
<p>These urgently needed queries will enable metaprogrammers to implement frequently-asked-for higher-level facilities such as static checks and reflection. Acceptance of this proposal does not preclude future standardization of such higher-level facilities.</p>
<h2>Motivation and Scope</h2>
<p>Being able to inspect the <em>enumerator-list</em> during translation is the missing common foundation in C++ of a large number of frequently-asked-for enumeration-related reflection facilities.</p>
<p>Because implementations do not currently expose this basic semantic information about an <em>enumerator-list</em> to library authors, implementing such facilities parameterized by the enumeration type is currently impossible. Current workarounds involve either:</p>
<ul>
<li>Hand-maintaining and keeping synchronized secondary entities/lists for each <em>enum-specifier</em>. Prone to error and tedious.</li>
<li>Wrapping the <em>enum-specifier</em>, with macros and using obscene preprocessor tricks. This obfuscates the <em>enum-specifier</em> to both programmers and programming tools, relies on non-standard preprocessor features and can complicate the build process.</li>
<li>Implementing a “precompiler” tool such as Qt moc to scan the source for enum specifiers and then automatically-generate additional translation units. This is a huge amount of work to implement and maintain in synchronization with the compiler, is inefficient, as it requires parsing the source twice from a separate tool, and complicates the build process.</li>
</ul>
<p>These are all just workarounds to get at information that the compiler has readily available during translation, but does not expose. This proposal corrects that.</p>
<p>The intended user community of this proposal are infrastructure-providers and framework-authors, which will enable them to provide generic tools benefitting most enumeration users.</p>
<p>The use of the proposed feature is intended, as for existing Metaprogramming Property Queries, to support programmers that use metaprogramming and generic programming to form higher-level constructs at compile-time.</p>
<p>We have a complete reference implementation of the proposed feature. It is extremely easy to implement, and for a basic implementation requires only implementing three compiler intrinsics that inspect the annotated AST of the <em>enumerator-list</em>. The interface is setup, as for existing meta property queries, so that these intrinsics only need to be callable at compile-time when the properties are instantiated - and so does not mandate any run-time code generation at all. The compile-time cost should be non-exisiting if the properties are never instantiated, and as it provides direct access the internal compiler data structure the cost when instantiated is minimized.</p>
<h2>Impact On the Standard</h2>
<p>The proposed feature adds three Property Queries to the Metaprogramming and Type Traits library in the typical form. It requires only some minimal compiler support to implement the three queries.</p>
<p>As for all Property Queries in the Metaprogramming and Type Traits library, the proposed three Property Queries are for use by metaprogrammers at compile-time. The current ease-of-use of these constructs is the same as, for example, <code>std:rank</code>, <code>std::extent</code>, <code>std::get(tuple)</code> and working with string literals (array of char defined with constexpr). As C++ continues to evolve more sophisticated metaprogramming features, the proposed queries will become easier to use along with them.</p>
<p>As <code>std::enumerator_value<E,I></code> is a value of enumeration type E, it can be used in combination with the existing <code>std::underlying_type<E></code> if one wishes to convert it to a value of the underlying type instead.</p>
<p><code>std::make_index_sequence</code> is also useful for ease-of-use to transform the properties into a pack or a constexpr array if so desired.</p>
<h2>Design Goals</h2>
<p>Our design goals for the interface were:</p>
<ul>
<li>be complete (provide all semantic information about the <em>enumerator-list</em> of any possible enumeration type)</li>
<li>should provide efficient queries of the internal compiler data structure</li>
<li>be minimal and low-level</li>
<li>entail no run-time cost</li>
<li>entail no compile-time cost if unused</li>
<li>not mandate any specific lookup algorithm or data structure at runtime</li>
<li>be self-contained and free of dependencies</li>
</ul>
<p>There is a more detailed rationale and history of how they were arrived at and applied under the section Design Log below.</p>
<h2>Technical Specifications</h2>
<p><strong>In Existing Section</strong> 20.11.5 [meta.unary.prop.query] Type property queries</p>
<ol>
<li>This sub-clause contains templates that may be used to query properties of types at compile time.</li>
</ol>
<p><strong>Add To</strong> Table 50 - Type property queries:</p>
<table border="1">
<tr>
<td>
<b>Template</b>
</td>
<td>
<b>Value</b>
</td>
</tr>
<tr>
<td>
<code>template<class E> struct enumerator_list_size;</code>
</td>
<td>
An integer value representing the number of enumerators in E<br/>
<br/>
<i>Requires:</i> <code>std::is_enum<E></code> shall be true
</td>
</tr>
<tr>
<td>
<code>template<class E, std::size_t I> struct enumerator_identifier;</code>
</td>
<td>
A value of type array of char, defined with constexpr, holding<br/>
the identifier of the I'th enumerator of E in declared order, where<br/>
indexing of I is zero-based. The value shall be null-terminated,<br/>
UTF-8 encoded, and have any universal-character-names decoded.<br/>
<br/>
<i>Requires:</i> <code>std::is_enum<E></code> shall be true<br/>
<i>Requires:</i> <code>I</code> shall be nonnegative and less than <code>std::enumerator_list_size<E></code>
</td>
</tr>
<tr>
<td>
<code>template<class E, std::size_t I> struct enumerator_value;</code>
</td>
<td>
A value of type <code>E</code> that is the <code>I</code>'th enumerator<br/>
of <code>E</code> in declared order, where indexing of <code>I</code> is zero-based.<br/>
<br/>
<i>Requires:</i> <code>std::is_enum<E></code> shall be true<br/>
<i>Requires:</i> <code>I</code> shall be nonnegative and less than <code>std::enumerator_list_size<E></code>
</td>
</tr>
</table>
<p><strong>Add New Paragraph 4:</strong></p>
<pre><code> [Example:
// In the below π is in source encoding (before translation phase 1)
enum foo
{
bar,
Baz,
Qux = 42,
Quux = 42,
eπ = 96,
f\u03C0 = 300000
};
// the following assertions hold
static_assert(enumerator_list_size<foo>::value == 6);
assert(std::strcmp(enumerator_identifier<foo,0>::value, u8"bar") == 0);
assert(std::strcmp(enumerator_identifier<foo,1>::value, u8"Baz") == 0);
assert(std::strcmp(enumerator_identifier<foo,2>::value, u8"Qux") == 0);
assert(std::strcmp(enumerator_identifier<foo,3>::value, u8"Quux") == 0);
assert(std::strcmp(enumerator_identifier<foo,4>::value, u8"eπ") == 0);
assert(std::strcmp(enumerator_identifier<foo,5>::value, u8"f\u03C0") == 0);
static_assert(enumerator_value<foo,0>::value == Bar);
static_assert(enumerator_value<foo,1>::value == Baz);
static_assert(enumerator_value<foo,2>::value == Qux);
static_assert(enumerator_value<foo,3>::value == Quux);
static_assert(enumerator_value<foo,4>::value == eπ);
static_assert(enumerator_value<foo,5>::value == f\u03C0);
// universal character names handled correctly:
assert(std::strcmp(enumerator_identifier<foo,4>::value, u8"e\u03C0") == 0);
assert(std::strcmp(enumerator_identifier<foo,5>::value, u8"fπ") == 0);
static_assert(enumerator_value<foo,4>::value == e\u03C0);
static_assert(enumerator_value<foo,5>::value == fπ);
// constant lvalue-to-rvalue conversion on array elements ok:
constexpr const char* str = enumerator_identifier<foo,0>::value;
static_assert(str[0] == 'b');
static_assert(str[1] == 'a');
static_assert(str[2] == 'r');
static_assert(str[3] == '\0');
-- end example]
</code></pre>
<h2>Design Log</h2>
<p>The proposal in its current form is the result of a merge of several separate proposals.</p>
<p>The first was to add a core language feature that, given an enumeration type <code>enum E {a, b, c}</code>, a construct of the form <code>E...</code> would expand to a pack of the enumerator list of <code>E</code>, that being <code>a, b, c</code>. This could be used in the usual contexts, <code>braced-init-list</code> contexts or as template or function arguments. This was later improved by suggesting that instead a non-type parameter pack of type <code>E...</code> would match an argument of type <code>E</code>, and instead of being ill-formed, would expand to the enumerator list of <code>E</code>. As such a pack can be formed using the proposed primitives, and as the proposed functions are more appropriate encapsulated as a library feature, it was decided that this proposal was superior as a small library addition rather than a core language change.</p>
<p>The second was to add a predefined template variable, originally called <code>__enumerator__</code> in the spirit of <code>__func__</code>, but we’ll call it <code>std::enumerator_value_to_identifer</code> for sake of discussion, that would return the identifier string of an enumerator based on its value. First we stress that such a facility is trivially definable using a combination of the proposed queries. There are several reasons it was not proposed for standardization in this proposal. Consider the following enumeration types:</p>
<pre><code>enum E1 { a, b, c }; // simple 0,1,2...n-1
enum E2 { a = 42, b = 42, c = 43 }; // ambiguous values
enum E3 { a = 42, b = 420000 }; // sparse values
enum E4 { a = 1 << 0, b = 1 << 1, c = 1 << 2 }; // bit sets
enum E5 { a = 1 << 0, b = 1 << 1, c = 1 << 2, m = 0b111 }; // masks
</code></pre>
<p>There is no single best approach that can be mandated. What shall <code>std::enumerator_value_to_identifier(x)</code> return when x is ambiguous? Shall it return the identifier of the first enumerator to have the value? In what order? Shall it return a delimited string concatenated with all the identifiers of equal value? Again, in what order? And how shall they be delimited? Shall it return an array or <code>std::set</code> of those of equal value instead? Shall it fail at compile-time with an ambiguity? Shall it have an interface like <code>std::multimap</code>? What underlying data structure or algorithm should it use to do the lookup? Should the case of an enumeration with contiguous enumerator values that start from zero be dealt with seperately from ones with sparse values? What about enumerations that are used as bitsets? What about enumerations without ambiguous enumerators at all?</p>
<p>All the many alternative ways of answering these questions can all be built up from the proposed queries, as they provide the complete compile-time information about the enumerator list. We note that acceptance of this proposal would not preclude future standardization of an additional facility such as <code>std::enumerator_value_to_identifier</code> if sufficiently general answers to these questions are later found. As a first step we decided to move standardization of <code>std::enumerator_value_to_identifier</code> to a future proposal, so as not to unnecessarily delay standardization of the more general foundation primitives.</p>
<p>The next proposal that was considered was creating a set of higher-level reflection facilities above the proposed primitives (and leave the primitives as an implementation detail) that would be fully integrated with other standard library facilities. It was quickly realized that, as per <code>std::enumerator_value_to_identifier</code>, there are many different use cases and different application domains and tradeoffs. So we decided to defer such higher-level facilities to a future proposal so as not to delay the urgently needed proposed queries. We added the design goal for this proposal that the primitives should provide complete information, but should be minimal outside of that.</p>
<p>We also realized that the declaration order of enumerators may be significant for some applications and so added this information for completeness to our interface. All other orderings can be sorted from declaration order, but once declaration order is lost it cannot be recovered.</p>
<p>The next key design decision was identifier representation. As per [lex] an identifier is formed during translation phase 1-3 by mapping the source files in source encoding in an implementation-defined manner to ISO 10646 (Unicode) characters with the constraints given in [charname.allowed] and [charname.disallowed]. Characters outside the basic source character set (as with all source characters) logically encoded into univeral-character-name escape sequences. For the remainder of translation identifiers remain in this form. String literals have any universal-character-name escape sequences decoded during translation phase 5. Depending on the string literal prefix a string literal is then encoded into one of five character encodings. One of these is the implementation-defined execution character encoding, three are the standard Unicode character encodings UTF-8, UTF-16 and UTF-32, and the last is a direct encoding which is effectively equivalent to UTF-32.</p>
<p>It was proposed to provide an interface that took as a template parameter which of the five character encodings were desired. For minimality it was decided that such a facility should be defered for higher-level facilities and a single encoding would be provided, given that an encoded string in a constexpr array or string literal can be transcoded with metaprogramming during translation to any other encoding anyway - so completeness remained intact. Originally the single encoding that was considered was the implementation-defined execution encoding, but it was later realized that not all execution encodings can handle all characters (for example ASCII or Latin1), so this would violate the completeness requirement. So it was decided to choose one of the Unicode encodings. Implementations are already required to handle all of these, and all of them can encode all characters. UTF-8 was selected because as an 8-bit encoding it is endianess-agnostic, unlike UTF-16 and UTF-32, and it is the most common execution encoding anyway.</p>
<p>Because universal-character-names escape sequences in identifiers are not logically decoded in translation phase 5, it was noted that we should add this as an explicit requirement to the identifier encoding text.</p>
<p>At this point we had the complete information we wanted to provide. A list of enumerators, and for each its position in declared order, its value and its identifier encoded as described above. The identifier should be encoded in an array of chars (UTF-8 code units) and null-terminated so that it may be used as a C string or to initialize a std::string, and that array must be defined with constexpr or be a string literal so that its elements are usable within a constant expression (such that an lvalue-to-rvalue conversion is allowed on them), so the string could be worked with during translation.</p>
<p>The next step was how to expose that information from the compiler during translation. For efficient mapping from the internal compiler representation it was decided that it should be represented as a sequence in declared order. We then considered using an array to represent such a sequence:</p>
<p>Either:</p>
<pre><code> enum E {a = 42, b = 42, c = 43};
constexpr E enumeration_values[] = {a, b, c};
constexpr const char* enumeration_identifiers[] = {u8"a", u8"b", u8"c"};
</code></pre>
<p>or</p>
<pre><code> struct enumerator_info
{
E value;
const char* identifier;
};
constexpr enumerator_info[] = { {a, u8"a"}, {b, u8"b"}, {c, u8"c"} };
</code></pre>
<p>But it was decided that certain uses of such an array of pointers would cause them to require static storage, and as such they would be subject to relocation costs at load-time. For performance we did not want to unnecessarily mandate such costs.</p>
<p>What we really wanted was a way to expose pure compile-time accessors to the internal compiler data structure that holds the enumerator list, without providing a set data structure or algorithm - leaving this up to the library author, and for a later proposal for higher-level facilities.</p>
<p>To avoid the array relocation costs it was then considered to use functions:</p>
<pre><code> constexpr size_t enumerator_list_size();
constexpr const char* enumerator_identifier(size_t index);
constexpr E enumerator_value(size_t index);
</code></pre>
<p>The functions could then be backed by intrinsics, but for the later two if the index was out-of-bounds for certain uses would mandate either run-time bounds checking and error handling, or undefined behaviour.</p>
<p>So to avoid that we finally decided to make them Type Query Properties of existing form and the final interface was arrived at:</p>
<pre><code> namespace std
{
// number of enumerators in E
template<class E>
struct enumerator_list_size : integral_constant<size_t, /* ... */> {};
// identifier of I'th enumerator in E
template<class E, size_t I>
struct enumerator_identifier { constexpr char value[] = /* ... */; };
// value of I'th enumerator in E
template<class E, size_t I>
struct enumerator_value : integral_constant<E, /* ... */> {};
};
</code></pre>
<p>By making the input template parameters we can allow implementations to implement them as instrinsics that access and return information directly from the internal compiler enumerator list during translation, as per other type property queries. We added requirements that E be an enumeration type and I must be in-bounds.</p>
<p>Alternative interfaces exposing the same functionality would be possible, for example utilising the new template variable form coming in the C++14 standard:</p>
<pre><code> template<class E, size_t I> constexpr E enumerator_value;
</code></pre>
<p>But we considered keeping the API consistent with the existing style of <type_traits> more important than the (slightly) shorter access syntax. The same reasoning speaks against the longer template function syntax:</p>
<pre><code> template<class E, size_t I> constexpr E enumerator_value();
</code></pre>
<h2>Reference Implementation</h2>
<p>Here is an example library interface definition for the three property queries based on the reference implementation:</p>
<p>The <code>static_asserts</code> are shown just for exposition, the checks themselves can take place inside the intrinsics at instantiation-time:</p>
<pre><code>template<typename E>
struct enumerator_list_size : std::integral_constant<size_t, __enumerator_list_size(E)>
{
// static_assert(std::is_enum<E>::value, "E not enum type");
};
template<typename E, std::size_t I>
struct enumerator_identifier
{
// static_assert(std::is_enum<E>::value, "E not enum type");
// static_assert(0 <= I && I < std::enumerator_list_size<E>::value, "I out-of-bounds");
static constexpr decltype(auto) value = __enumerator_identifier(E, I);
};
template<typename E, std::size_t I>
struct enumerator_value : std::integral_constant<E, __enumerator_value(E, I)>
{
// static_assert(std::is_enum<E>::value, "E not enum type");
// static_assert(0 <= I && I < std::enumerator_list_size<E>::value, "I out-of-bounds");
};
</code></pre>
<p>As can be seen these are only thin wrappers around compiler intrinsics (that for example in Clang directly look up simple clang AST EnumDecl node properties). The intrinsics only accept compile-time constants and do not generate any runtime code.</p>
<h2>Use Cases</h2>
<p>An incomplete list of use cases for the three properties follows. All uses listed can be generated during translation with metaprogramming, and can be used during translation or at run-time. This can be shown to be true as the properties provide the complete semantic information available from the source syntax of <em>enumerator-list</em>.</p>
<p>Given an enumeration type E, write a generic library facility that will:</p>
<ul>
<li>Find the minimum and maximum enumerator (either of an enumerator, or the range from the closure of bit operations).</li>
<li>Iterate over the enumerator identifiers.</li>
<li>Iterate over the enumerator values.</li>
<li>Count the number of enumerators.</li>
<li>Count the number of distinct enumerator values.</li>
<li>Count the number of enumerators with a specific value.</li>
<li><code>static_assert</code> that the enumerators are declared in alphabetical order.</li>
<li><code>static_assert</code> that the enumerators are declared in an order dependant on their name.</li>
<li><code>static_assert</code> that the enumerators are declared in an order dependant on their value.</li>
<li><code>static_assert</code> that the enumerator identifiers match a certain naming convention.</li>
<li><code>static_assert</code> that the relationship between the enumerator identifiers and their values fit a constaint.</li>
<li><code>static_assert</code> any constaint on the relationship between enumerator name, value and declared order.</li>
<li>Sort the enumerators identifiers/values in any order.</li>
<li>Lookup the set of enumerators identifiers with a given enumerator value, returned in any encoding or form.</li>
<li>Lookup the value of an enumerator given a string of its identifier in any encoding.</li>
<li>Determine the relationship between enumerator values (sparse, dense) and create an efficient jump table or associative data structure.</li>
<li>Create a jump table or associative data structure based on enumerator identifier (any encoding), or some function of the identifier text.</li>
</ul>
<h2>Example Usage</h2>
<p>As for the existing properties <code>std::rank</code> and <code>std::extent</code>, the individual properties can be expanded into a pack or an array with an index sequence pack (<code>std::make_index_sequence</code>, see N3658). For example:</p>
<pre><code>template<class E, class IdxSeq>
struct enumerator_data_t;
template<class E, size_t... Idx>
struct enumerator_data_t<E, std::index_sequence<Idx...>> {
static constexpr size_t count = std::enumerator_list_size<E>::value;
static constexpr E values[] = { std::enumerator_value<E, Idx>::value... };
static constexpr char const *names[] = { std::enumerator_identifier<E, Idx>::value... };
};
template<class E>
using enumerator_data_t = enumerator_data_t<E, std::make_index_sequence< std::enumerator_list_size<E>::value >>;
template<typename E, size_t... Idx>
constexpr E enumerator_data_t<E, std::index_sequence<Idx...>>::values[];
template<typename E, size_t... Idx>
constexpr char const * enumerator_data_t<E, std::index_sequence<Idx...>>::names[];
// simple usage: list all enumerator in their order of definition
enum class E1 { first = 99, second = first, third };
using E1D = enumerator_data_t<E1>;
for (int i=0; i<E1D::count; i++)
std::cout << E1D::names[i] << " = " << (int)E1D::values[i] << std::endl;
</code></pre>
<p>This builds an intermediate-level interface comparable to e.g. Java’s Class.getEnumConstants(). From here constexpr functions can be written to operate on the enumerator data:</p>
<pre><code>// determines the largest numerical value that is defined by the enum
template<typename E>
constexpr E enum_max_value() {
using ED = enumerator_data_t<E>;
using UT = std::underlying_type_t<E>;
static_assert(ED::count>0, "maximum not defined on empty enum");
E ret = ED::values[0];
for (size_t i=1, e=ED::count; i<e; ++i)
if (static_cast<UT>(ret)<static_cast<UT>(ED::values[i]))
ret = ED::values[i];
return ret;
}
</code></pre>
<p>Having the defined value range of an enumeration at hand allows creating wrappers around std::bitset or std::array taking the enum type as the key. These could be more efficient for certain applications, constexpr enabled replacements for std::set/map for certain tasks.</p>
<p>Certain traits regarding the enum can then be determined quite easily with a similar algorithms:</p>
<pre><code>// determines if the enumerators of E follow the default valuing
// if yes, then calculating the enumerator index from an value of E is trivial
template<typename E>
constexpr bool is_trivially_valued_enum() {
using ED = enumerator_data_t<E>;
using UT = std::underlying_type_t<E>;
for (size_t i=0, e=ED::count; i<e; ++i)
if (static_cast<UT>(ret) != static_cast<UT>(i))
return false;
return true;
}
</code></pre>
<p>… which could be used to check correct usage of a very simple value-to-identifier conversion function:</p>
<pre><code>template<typename E>
constexpr const char *enum_value_to_str(E val) {
static_assert(is_trivially_valued_enum<E>(), "only implemented for simple enums");
size_t idx = static_cast<size_t>(val);
if (idx >= std::enumerator_list_size<E>::value)
throw std::out_of_range("invalid enum value");
return enumerator_data_t<E>::names[idx];
}
</code></pre>
<p>Of course, more universally applyable lookup algorithms can also be implemented. One option would be simply using a std::map or std::unordered_map for lookup, another to perform a binary search on a sorted array. Creating a lookup table for enumerations with a small value range should deliver performance comparable to compiler generated switch statements. An automatic selection of the most appropriate algorithm for a given enum type could also be implemented given the right support traits.</p>
<p>Having good lookup method(s) in place would enable treating any enum as an ordinal type. The bitset/array wrappers mentioned above could apply this transformation instead of using the enum value directly to get a (for any given enum type) gapless index range.</p>
<p>Performing a reverse lookup from an identifier string to the associated value is of course also possible. For this task, specific requirements will probably vary widely between applications (case (in)sensitive comparision, adding/removing pre-/post-fixes, fuzzy search etc.) But all of these are implementable given the proposed interface.</p>
<p>While some of the advanced functionality mentioned in this section might be of general enough use to potentially warrant standardization, the specifics of such APIs will take longer to establish existing practice, and so should not delay initial adoption of the general underlying interface proposed here.</p>
<h2>Acknowledgements / References</h2>
<p>We would like to thank David Krauss and Thiago Macieira for valuable initial feedback on this proposal.</p>
<p>A reference implementation of the proposed property queries is part of a clang branch (containing additional intrinsics for other reflection tasks) that can be checked out from github: https://github.com/ChristianKaeser/clang-reflection</p>