Plans for the first week of december

This week I will be my 4th week at EuropePMC and I hope to achieve a huge amount.

I have two strands to the work I’ll be doing; 1) taking data from wikimedia community about EuropePMC and the papers contained within it. 2) Taking data from EuropePMC and try to make it more available to the wikimedia community.

I shall be further analysing the mwcites data; particularly trying to resolve the ID’s created therein. This will be done by revamping the crude epmclib utility I wrote earlier this month so that by default it caches data downloaded and then using it to resolve everything found. It may even be integrated into mwcites to do this automatically on analysing the dumps. I’ll have to have a think about it.

Hopefully I can then make a good judgement as to if the majority of the citations found are legitimate. If so I’ll then make some nice annotated plot.ly graphs of which citations were first cited when on wikipedia.

I’ll also be trying to get a working, continually updated sparql endpoint for librarybase working so that it can be quickly queried. If this is possible I should be able to finish work on the pywikibot script such that it can start putting all PMCIDs  and PMIDs that appear on wikipedia (according to mwcites) into librarybase.

Finally after discussions on Friday with Joe Wass from crossref I hope to perhaps roll out a live feed of citations from wikipedia recent changes that contain PMIDs/PMCIDs.

Leave a Reply

Your email address will not be published. Required fields are marked *