fringelogo-2017-justlogo

 

tl; dr

Repository Fringe is a gathering for repository managers and others interested in research data repositories and publication repositories.

I collected an archive of #rfringe17, containing 1118 Tweet IDs. I then analysed the text in the tweets with Voyant Tools to identify most frequent terms and manually refined the results to 230 terms.

I collected an archive of #rfringe17 tweets using TAGS. The key stats from the archive:

Number of Tweets in Archive 1,118
Number of usernames in Archive 215
First Tweet Collected 26/07/2017 14:58:12
Last Tweet Collected 05/08/2017 08:00:06

From http://www.repositoryfringe.org/:

Repository Fringe is a gathering for repository managers and others interested in research data repositories and publication repositories. Participation is a key element – the event is designed to encourage all attendees to share their repository experiences and expertise.

2017 marks the 10th Repo Fringe where we will be celebrating progress we have made over the last 10 years to share content beyond borders and debating future trends and challenges.

It took place in Edinburgh,  3 – 4 August 2017.

If you are not new to this blog you will then guess that I could not resist running the text of the tweets collected through Voyant Tools to obtain the term counts in the corpus with their Terms tool. As usual I applied the English stop words filter which I customised to include Twitter-specific terms (such as https, t.co, etc.) and the list of usernames.

I then manually refined the resulting data to remove smileys and any remaining usernames (some might have survived as it’s hard to disambiguate sometimes normal terms from usernames). I limited the results to 230 top terms.

Do take the counts with a pinch of salt as I did not clean the export from TAGS so Tweet duplicates and perhaps even some spam (who knows) might have remained.

Term Count
research 109
open 106
data 104
wikidata 75
oa 72
openscience 66
repository 63
repofringe 56
repositories 53
libraries 51
openresleeds 49
copyright 46
just 43
science 42
good 41
impact 41
thanks 41
day 39
access 38
poster 36
work 35
openaccess 34
talk 34
edinburgh 30
today 30
great 29
ucl 29
sherpa 28
read 27
want 27
event 26
project 26
really 26
time 26
cool 25
fringe 25
policy 24
metadata 23
publishers 23
publishing 23
says 23
colleague 22
policies 22
wikipedia 22
workflow 22
guide 21
millar 21
useful 21
comprehensive 20
content 20
fascinating 20
interesting 20
liveblogs 20
rdm 20
institutional 19
issue 19
it’s 19
liveblog 19
look 19
new 19
think 19
workshop 19
check 18
citizen 18
events 18
group 18
ip 18
management 18
need 18
outputs 18
presentation 18
rescue 18
session 18
trump 18
casrai 17
cycle 17
excellent 17
journal 17
lots 17
promotion 17
query 17
resource 17
uk 17
best 16
future 16
press 16
stuff 16
gallery 15
i’m 15
key 15
ref 15
showing 15
successful 15
support 15
thank 15
working 15
art 14
come 14
core 14
fun 14
miss 14
nice 14
process 14
provide 14
reminding 14
university 14
using 14
way 14
add 13
beautiful 13
demo 13
deposit 13
eprints 13
forward 13
funders 13
importance 13
keynote 13
looking 13
paper 13
phd 13
researchers 13
vote 13
e.g 12
era 12
especially 12
feedback 12
generation 12
got 12
let 12
needed 12
observation 12
recent 12
report 12
review 12
showcase 12
site2cite 12
star 12
theses 12
try 12
we’re 12
weirdness 12
advises 11
attendees 11
boat 11
broken 11
coar 11
control 11
criteria 11
exposure 11
global 11
institutions 11
like 11
model 11
prof 11
scholarly 11
survey 11
trek 11
use 11
years 11
articles 10
award 10
case 10
excited 10
exposing 10
figshare 10
gifts 10
hear 10
highlighted 10
important 10
initiative 10
integrating 10
introducing 10
live 10
opening 10
platform 10
ref2021 10
spend 10
vision 10
week 10
won 10
workshops 10
altmetric 9
colleagues 9
current 9
discussion 9
evidence 9
field 9
getting 9
i’ll 9
infrastructure 9
inspiring 9
library 9
link 9
list 9
local 9
long 9
make 9
meeting 9
peer 9
post 9
practice 9
preservation 9
problem 9
role 9
service 9
shoutout 9
shows 9
slides 9
sure 9
team 9
thought 9
touch 9
tweets 9
works 9
added 8
based 8
believe 8
better 8
change 8
conference 8
contributing 8
days 8
european 8
example 8
far 8
favourite 8
fully 8
here’s 8
image 8
included 8

Logically sharing this data as an HTML table is not the best way of doing it but hey. I have the source data if anyone is interested; Twitter developer guidelines allow the sharing of tweet IDs. In this case the source data is composed by the dataset of 1118 tweet ID strings (id_str).

Maybe I missed it but in the list above I could not find ‘bepress’ or ‘elsevier‘, by the way…

Advertisements