Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider other ways of constructing name queries with middle names #1081

Open
peetucket opened this issue Jun 11, 2019 · 2 comments
Open

Consider other ways of constructing name queries with middle names #1081

peetucket opened this issue Jun 11, 2019 · 2 comments
Labels

Comments

@peetucket
Copy link
Member

We have tried to write our queries to be as inclusive as possible for as many users as possible, but the algorithm doesn't always work out. For example, Dr James Chen has data like this:

Primary identity = James K. Chen
Alternate identity = J K Chen

We then produce all of the variants of "Last, First", "Last, First, Middle" and "Last, First, Middle Initial" for all of the entered names (just two in his case) and "or" them together. For him, this looks like: "Chen,James" OR "Chen,James,K." OR "Chen,James,K" OR "Chen,J" OR "Chen,J,K"

If we were to skip the "Last, First" variant (i.e. dropping the middle name), this would miss cases were people have entered a middle name in their profile, but then don't publish with it.

For example, "Jackson, Samuel Paul" may publish under "Jackson, Samuel" … so to get these publications, we would need a variant that doesn't include the middle name or else we'd miss those publications.

In cases like Professor Chen, since he had to use "J" as his first name in the alternate identity, our algorithm then adds this variant without the middle initial.

One possibility is to not add this variant without the middle name for just the alternate identities (but leave it for the primary identity). This may still present cases where the user is better served by it.

@peetucket
Copy link
Member Author

Other ideas:

  • conditionally skip the search that removes middle name / initial when authors have specified the middle initial specifically in their pub import settings.
  • adding a flag in the settings in Profiles that the user can check if they want the pub import / harvest to be limited to the specified name and then alter the search criteria to not add/remove variants of the name

@peetucket
Copy link
Member Author

Notes from Grace:

WoS does not use commas in the author index and also only uses initials for first and middle name.   Only one citation had his middle initial.   So here’s the format you should use:
 
AU=(McKeown N OR McKeown NW) AND (AD=Stanford Univ OR OG=Stanford University)

@peetucket peetucket changed the title Consider other ways of constructing name queries Consider other ways of constructing name queries with middle names May 4, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant