Picking 'random' numbers which will always be the same for a given event? #682

patrickbryant · 2022-06-03T13:46:32Z

In the c++ version of my analysis code there are a few spots where we make use of random numbers. In order to ensure the results are always the same for a given event we set the generator seed using the event number. Is there a fast/efficient way to do this with numpy arrays? Seems bad to generate the array one event at a time in a for loop.

Something like:

np.random.uniform(0, 1, size=len(events), seed=events.event)

From what I can find this functionality does not exist in numpy. Perhaps what I actually want is a hash of the event number where the hashing function samples the uniform distribution.

The text was updated successfully, but these errors were encountered:

andrzejnovak · 2022-06-03T14:18:07Z

Generate a hash from filename:start:stop and use that for the seed?

patrickbryant · 2022-06-03T14:27:51Z

Yeah I was thinking that but I can imagine scenarios where I end up running the same file with different chunk sizes

NJManganelli · 2022-06-03T14:58:32Z

Perhaps the performance cost would not be too bad from using e.g. in CMs the luminosity block as the seed for the random generator and filling an array via masking/where. I don’t know if I ever checked to see if that varies in CMS MC or not, though, the lumi… run is always 1 IIRC. Anything else that might be common across 1000s of events but not dependent on chunking/skimming/etc?

Nick M

nsmith- · 2022-06-22T15:29:46Z

A similar issue came up with JER smearing as discussed in #454.
The approach still is chunk-splitting-sensitive. We'll need to add a feature to correctionlib to do this anyway so perhaps that can be stolen/reused: cms-nanoAOD/correctionlib#130

lgray · 2023-12-06T23:53:49Z

This appears answered. Re-open if not.

nsmith- · 2023-12-19T13:33:42Z

Yes, correctionlib includes a facility to generate deterministic pseudorandom data. See https://cms-nanoaod.github.io/correctionlib/correctionlib_tutorial.html#Resolution-models for an example

patrickbryant added the question Further information is requested label Jun 3, 2022

lgray closed this as completed Dec 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Picking 'random' numbers which will always be the same for a given event? #682

Picking 'random' numbers which will always be the same for a given event? #682

patrickbryant commented Jun 3, 2022

andrzejnovak commented Jun 3, 2022

patrickbryant commented Jun 3, 2022

NJManganelli commented Jun 3, 2022

nsmith- commented Jun 22, 2022

lgray commented Dec 6, 2023

nsmith- commented Dec 19, 2023

Picking 'random' numbers which will always be the same for a given event? #682

Picking 'random' numbers which will always be the same for a given event? #682

Comments

patrickbryant commented Jun 3, 2022

andrzejnovak commented Jun 3, 2022

patrickbryant commented Jun 3, 2022

NJManganelli commented Jun 3, 2022

nsmith- commented Jun 22, 2022

lgray commented Dec 6, 2023

nsmith- commented Dec 19, 2023