Interesting. The point 2 article by van Dijk seems decent. Figure 1B says that the impact factor of journals, volume of publications, and cites/h-index are all fairly predictive. University rank gets some independent weighting (among 38 features, as shown in their supplementary Table S1), but not much.
Looks like although the web version has gone offline, the source code of their model is still online!
I strongly agree with Ryan that success is to a relatively large degree predictable, as can be done in the PCA decomposition of point 2 above, figure 1C. I think it would be very valuable to have such a model, but the current code is only for biology (the impact factor will fail for instance for anything different). If one wanted to fit a model to predict it, it could probably use google scholar and arxiv, but the trickiest part would be to recover the position of those people (the target), which may partially be done using google scholar.
Interesting. The point 2 article by van Dijk seems decent. Figure 1B says that the impact factor of journals, volume of publications, and cites/h-index are all fairly predictive. University rank gets some independent weighting (among 38 features, as shown in their supplementary Table S1), but not much.
Looks like although the web version has gone offline, the source code of their model is still online!
I strongly agree with Ryan that success is to a relatively large degree predictable, as can be done in the PCA decomposition of point 2 above, figure 1C.
I think it would be very valuable to have such a model, but the current code is only for biology (the impact factor will fail for instance for anything different). If one wanted to fit a model to predict it, it could probably use google scholar and arxiv, but the trickiest part would be to recover the position of those people (the target), which may partially be done using google scholar.
I just posted another article I found on average publication rates in Norway for different positions, ages, fields and gender.