RSS in, RSS out. Experimenting with WordPress for scholarly publishing

My presentation for the RSP event: Doing it differently. No slides, just a live demo using the outline below.

1. WordPress is an excellent feed generator:

http://joss.dev.lincoln.ac.uk/2009/04/15/addicted-to-feeds/

2. It's also an excellent, personal, scholarly CMS

http://joss.dev.lincoln.ac.uk/2009/08/25/scholarly-publishing-with-wordpress/

3. If you have an RSS feed, you can create other document types, too

http://joss.dev.lincoln.ac.uk/2010/01/04/creating-a-pdf-or-ebook-from-an-rss-feed/

4. We conceived a WordPress site as a document (and a WordPress
Multisite install as a personal/team/dept/institutional multi-document
authoring environment)

http://jiscpress.dev.lincoln.ac.uk
http://jiscpress.org

5. Here's my MA Dissertation as a WordPress site using digress.it

http://tait.josswinn.org/

6. WordPress allows you to perform certain actions on feeds, such as
reversing the post/section order

http://tait.josswinn.org/feed/?orderby=post_date&order=ASC

7. EPrints allows you to 'capture' data from a URI

http://eprints.lincoln.ac.uk/2004/

8. Suck it into your feed reader, for storage/reading - it's searchable
there, too.

https://www.google.com/reader/view/feed/http://eprints.lincoln.ac.uk/2004/2/index.html%253Forderby%253Dpost_date%2526order%253DASC

9. And use another service to create an ebook or PDF version

http://www.feedbooks.com/news

10. RSS. Loosely joined services:

Author: WordPress -->
                   Preserve: EPrints -->
                                        Read: GReader
                                              Feedbooks
                                              etc...

11. p.s. How about using EPrints to drive a WordPress site, too? Why extend a perfectly good preservation and storage application to include web 2.0 features, when it can be used to populate a cutting edge CMS with repo data?

Total ReCal (or Calendar Combiner C^2). A proposed JISC Rapid Innovation project

I’ve just submitted this funding proposal to JISC, under their Flexible Service Delivery programme call. As usual, I’m keen to share bids sooner rather than later, whether they are successful or not. Go here for the full bid or just read the summary below. Comments always welcome. Thanks.

Building on a university-wide initiative to improve collaborative, undergraduate research, this student-driven project will discuss, document and develop API plugins for a number of common corporate applications in the HE sector. The plugins will expose space-time data in an open, standardised format that can then be queried and aggregated by a student-centred calendaring service, which will also be developed during the course of the project.

The work undertaken by the project will improve the student experience by providing end-users with a cutting-edge, centrally supported calendaring service driven by existing aggregate services at the University of Lincoln. The plugins, full documentation and further libraries and code examples for the service will be offered to the JISC community for use by their own institutions.

UPDATE: I’m pleased to say that this funding application was successful. 🙂

Short URLs for the university

Nick Jackson, a computing student and part-time member of ICT staff, has developed lncn.eu, a University of Lincoln branded service for shortening links. Read all about it here. Be sure to read the comments to get a good idea of how he intends to improve the service.

This little project was in response to reading Bernie’s blog post about how a student wanted to use short links in his dissertation.

I see that Nick’s in contact with the Internet Archive’s 301Works to guarantee long-term sustainability of the links, too.

ePub downloads from EPrints

I’m at JISC’s #dev8D conference. There’s no end of developer challenges but I’m not a developer. Still, here’s an idea that maybe someone will pick up and run with:

The use of eBook readers is on the rise. Anyone with an iPhone, Android phone, as well as Kindles and Sony Readers, has an eBook reader.

Institutional Repositories provide scholarly articles in PDF format, which eBook readers don’t handle very well at all, especially the phone versions.

Why not provide a Word-to-PDF conversion facility in your repository? EPrints currently offers Word-to-PDF conversion durinng the deposit process. Why not Word-to-ePub format, too?

Why not provide an ePub file as an alternative to the PDF download? ePub is a free, open, standards-based (XHTML/CSS) file format for eBook Readers. There are many advantages for the reader to having an ePub version rather than a PDF version when using an e-Book reader. i.e. better page navigation, search, bookmarks, variable font sizing.

There are PDF-to-ePub converters on the web, so technically it’s possible. They are  a bit hit and miss, but so are the Word-to-PDF converters.

Anyone interested? I’d be keen to help if required.

Displaying a dynamic publications list from a repository on a staff profile page

I repeat this to people all the time. If I write it down here, then I only have to share a link 😉

RSS feeds are a very popular way of syndicating content from one source website to another subscribing website.

Some university websites, such as the Institutional Repository or University blogs, produce RSS feeds but not all university websites can easily subscribe to them. However, by using ‘feed2js’, any website can display a syndicated news feed in just a few steps. This way, you can embed your blog or publication list in Blackboard or on your personal web profile, for example.

Creating a publications list from the repository

We use EPrints as our Institutional Repository. EPrints provides news feeds (RSS, RSS2, Atom) for every search query. Therefore you can create a news feed of publications by Faculty, School, Department, Research Team or Staff member. Having created the news feed, you can then display that list of publications on any web page of your choice.

An example staff profile using Feed2JS
Click on the image to see a real example

The advantage of this is that every time you deposit something new in the repository, the list will automatically update on your chosen web page. You never need to edit your publications list again.

Steps to embedding your feed

Create your publications list. Use the Advanced Search page to construct your publications list. If you want a personal publications list, simply search for your name. If you have a common name, your search may return publications that belong to someone else. In that case, you should keyword all your repository items with a unique ‘key’, such as ‘q73g’. You can then search for that keyword and your name and only your items will be returned by the search.

Search results

Copy your feed URL. Typically, you need to right-click on the orange RSS 2.0 icon on the search results page and copy the link.

Go to http://feed2js.org/index.php?s=build and paste your link into the URL box. If you are a member of the University of Lincoln, contact me for a better link, hosted at the university.


From this point on, you can click the ‘Preview Feed’ button at any time to see what your feed will look like. Read the listed options carefully. They allow you to choose whether you wish to display the title of the feed; whether you wish to show the full content of the feed or just the titles; whether you wish to show images or video content in the feed (if there is any in the original source), etc. Experiment by previewing the feed to see what looks best for you.

Previewing a feed
Previewing a feed

When you are happy with your feed, click the ‘Generate Javascript’ button. Copy everything inside the Get Your Code Here box. Note how the box scrolls. Copy it all!

Example generated javascript
Example generated javascript

Paste the javascript into the appropriate place in your website’s HTML code. Save your web page and examine your work. The embedded feed should fit in well with your existing web site design and use the colour scheme you have chosen for your site. If you wish to make the publications list stand out from your web page, you should read the page about dressing up your output.

There is no more you need to do. The feed will automatically update every hour or so with any new content from the source website.

Feeding WordPress with EPrints: A Social Repo?

I’ve just knocked up a ‘Social Repo‘ site and would be keen to get some feedback on the general idea.

It’s a WordPress site in microblog mode driven by feeds from our repo via the FeedWordPress plugin. Just an experiment in automating something similar to our Post2Blog plugin.

As a way of making EPrints content more ‘social’, I thought that specific subject feeds from different IRs could be aggregated into a single subject site where interested people could follow and comment on the research outputs.

I’m a fan of aaaargh.org which is a site where people share hard-to-obtain texts, mostly academic level material and largely related to critical, social theory. There’s a discussion board attached to it, too. No-one really controls it and it’s a great way of finding hard to obtain texts 🙂

Along loosely similar lines, I was thinking earlier that IRs could aggregate their feeds into a site, like my example, that provided a way to search, filter and discuss the source research outputs. If there was a site that aggregated feeds from IRs around the world, pulling in only content relating to critical, social theory, for example, had a twitter account attached, too, as well as useful RSS feeds of its own, I’d be keen to follow it and contribute to the discussion of work as it appeared and looked of interest.

I can imagine that some texts could spark quite detailed threaded discussions.

One way to improve my quick example would be to show the EPrints abstract in the post content below the citation. Alas, that’s not in the source EPrints feed right now. I would also make a few tweaks to the theme so that the permalinks didn’t all point to the source record, but that the source link was clearly provided.

The plugin that we created for the JISCPress project could provide a background service to create semantic tags and do term extraction on the abstract, to automate keywords for each item. Crikey! we could even use the other Linked Data plugin we developed and push the RDF to the Talis Platform, aggregating Linked Data around subject feeds from Institutional Repositories.

I’m sure I can think of more improvements, but as a 30 min exercise, I’ve found it interesting. I think that once a Repo record becomes joined to a WordPress record, it’s got a lot more going for it in terms of added levels of interaction and malleability. Any thoughts?

OPACPress: Our Talis Incubator proposal

Yesterday, I submitted a proposal to Talis under their Incubator fund. If successful, I would have the pleasure of working with Paul Stainthorp, E-Resources Librarian at the University of Lincoln, and Casey Bisson,  Information Architect at Plymouth State University. The bid is to develop an idea which I’ve posted about before, based on Casey’s work on Scriblio and our adventures with WordPress MU, in particular, JISCPress.

Anyway, rather than re-iterating the bid here. You can read it in full by clicking here.

Comments are very welcome. Thanks.

UPDATE: We made it into the second round of judging but were unsuccessful in the end. Here’s the useful and fair feedback we received.

  • like the idea and how, like the Moodle repository, it can help open up existing content through data sharing. The same question as for others remains of how and why institutions would subscribe to the service.
  • I like this but I think it significantly underestimates the IP issues around library catalogue records which has been a major stumbling block for other activities in this area. That said, I think it is worth taking forward at this stage. The team looks very strong.
  • Ambitious in scope and technology, but /feels/ right for the innovative approach of this fund.
  • “Imagine that a significant number of UK universities and colleges… chose to make use of such a platform.” This type of language frightens me, indicating that they have no partnerships established, where other proposals already do. The point on issues with catalog records (above) should not be overlooked.
  • The use cases won me over. Not without risks ( as they say) and some major challenges
  • this one strikes me as particularly promising, because it has such strong ties to UK institutions and could connect to things Talis does

Creating a PDF or eBook from an RSS feed

This morning, I found myself on Baseline Scenario, a well-known site which discusses the economic crisis. I noticed that the authors of the site had laboured over producing a PDF version for each month of their archive, by copying and pasting to Word and producing a PDF. There’s a nicer way of doing this, I think. When you’ve done it once, it should take you no more than ten minutes to go through the whole process any other time.

  1. WordPress provides a way to filter content by date. In our example, we’ll grab the RSS feed from the first month of publications: http://baselinescenario.com/2008/09/feed The permalink structure is clear enough on WordPress. For Blogger, it’s nowhere near as intuitive.
  2. The feed will display the articles in descending date order. When you are reading the PDF or eBook version, you don’t want to read the last article first, as you would on the website. To reverse the order of the feed, use Yahoo Pipes (or for WordPress, see @mhawksey’s comment below). You can clone my example. If you’ve not used Yahoo Pipes before, don’t worry. You just need a Yahoo account. The example I give is as simple a pipe as you will see and should make sense as soon as you look at it.
  3. Once you’ve created the pipe of the feed in ascending order, save and run the pipe. Look for the RSS icon and copy the pipe’s RSS link, which should look like this: http://pipes.yahoo.com/pipes/pipe.run?_id=cb438b51b2819eb1f4f5ec6f10daf09e&_render=rss
  4. Next, go to FeedBooks. Sign up for an account if you don’t already have one. Now, we create a Newspaper.
  5. Click on News in the menu and then on Create a newspaper. Give it a name and tag it. In our example, we’ll call it Baseline Scenario Archive.
  6. Click on ‘Add a RSS feed’. Give it a name (in our case ‘September 2008’)  and paste your RSS feed into the box. Once it’s found and accepted your feed, click ‘Publish’.
  7. You can now click on the name of the specific feed and you’ll be presented with a page that offers an ePub, Kindle and PDF versions of your feed. Here’s the Baseline Scenario September 2008 example.
  8. That’s it. You can do it with whole sites, too, if you like. Here’s one I did earlier (Blogger). The only thing you need to remember is to ensure that the RSS feed contains all the items you’re looking for. For the Blogger site, the source feed looks like this: http://www.blogger.com/feeds/27481991/posts/default?max-results=1000 A thousand items is more than enough to capture this site for quite some time. For WordPress, the site owner has to change their Reading Settings to include sufficient items. For the Baseline Scenario, they need to set this at a number high enough to ensure that a month’s worth of posts are included. I would just set it at 3000 and then forget about it. It would mean the entire site could be captured this way for the next year or so.

Having problems? Got a question? Leave a comment.