Work already done

So far I have been working on taking data from wikimedia about EuropePMC and taking data held by EuropePMC and looked at ways to make it accessible to the wikimedia community.

For the former I have been using the excellent mwcites utility created by Aaron Halfaker (and some output kindly generated by him from it). I have been doing some simple analysis of the PMCids it found in english wikipedia dumps to make some rather nice graphs using plot.ly.

While trying to annotate this work I discovered that some fraction of the citations found by the utility were not correct (ID did not resolve as PMCIDS) which needs further investigation. It may be a very small number or it may be non-negligible.

I have also been writing a script using pywikibot to push data held by EuropePMC into a wikibase repository (the same software that runs wikidata)

A Classic First Post

As you’d imagine for any new blog I’m publishing that most stereotypical of posts: the first post.

Here I discuss what this blog is about and so on.

I’ll be talking here about a variety of things but principally about the work I am currently undertaking at the EBI where I am an intern/trainee in the EuropePMC group run by Jo McEntyre.

I’m principally looking at making links and collaborations between EuropePMC and the Wikimedia movement. Particularly looking at ways I can make some of the data held by EuropePMC accessible through wikidata but also in analysing the prevalence of papers cited in Wikipedia that are held by EuropePMC.