-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a tool to generate account mapping #34
Conversation
FYI @mikemccand @dweiss |
Thanks @mocobeta! I was wondering what to pass as the account mapping as I ran the tooling ;) Today all of my migrated issues are all commented / opened by Given how important this mapping file is, maybe we should 1) commit this PR and further iterate on it in future PRs, and 2) commit the mapping file, so all of us can scrutinize it, maybe correct / insert our own mapping, etc.? Once we do the migration, the mapping is burned into the GitHub issues so we really want to try to account for everyone. So we should treat this file as a vital source code I think? |
As for the "author" of each GitHub issue/comments, we won't be able to preserve/migrate the original Jira author. The author will be the caller's account. Please see #4 for the details. In short, the author for all issues/comments will be an Infra's account. |
Ahh OK got it. So it's the @-calls inside issues that we will replace with the corresponding github id? If we do check-in the account mapping file, I suggest we break it into two sections: unverified and verified. This tool will put them all as unverified to start? And those of us that "know" (or just for our own mapping) we can commit a change to move it to the verified section? |
Yes I'll run the tool and commit a result file. |
I committed a candidate mapping file (without any manual checks/editing). e336bdc
would effectively extract committers' accounts. |
As other possible clues, we could
|
This is the list of accounts that have push access to apache/lucene (i.e., committers' accounts) There are 95 committers in total according to this page, so 25 people do not associate their GitHub accounts with ASF/Jira accounts. This means we can't make an assumption that "committers' github accounts have push access to apache/lucene repo in GitHub"; although they should have write access on Apache's GitBox repo I think. |
For verification, I'll do
For accounts that do not satisfy the above criteria, I would just omit them. There should be some false negatives (for example, Jira issue reporters are omitted if their possible GitHub accounts were not logged in the commit history). I'd put priority on avoiding false positives. |
Wow, the mapping file is massive! 5,793 developers. We've had so many contributors over the years ;) Inspiring.
+1 |
Here's the re-taken candidate and verified (with the above criteria) mapping.
|
#3
This adds a helper tool to create a Jira user - GitHub account mapping file; this is used in "Convert Jira issues to GitHub issues" step.
We could do this a bit smarter way, I would start with this...
NOTE: there are 2200+ committers and contributors in Jira (this number includes duplication since some people seem to have multiple Jira accounts).