Skip to content

Commit

Permalink
post: couperin update
Browse files Browse the repository at this point in the history
  • Loading branch information
cbroschinski committed Oct 15, 2024
1 parent 7884993 commit b88ae39
Show file tree
Hide file tree
Showing 4 changed files with 230 additions and 0 deletions.
124 changes: 124 additions & 0 deletions Rmd/2024-10-15-couperin.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,124 @@
---
layout: post
author: Christoph Broschinski
author_lnk: https://github.com/cbroschinski
title: Couperin.org provides additional APC data for 14 institutions
date: 2024-10-15 06:00:00
summary:
categories: [general, openAPC]
comments: true
---


```{r, echo = FALSE}
knitr::opts_knit$set(base.url = "/")
knitr::opts_chunk$set(
comment = "#>",
collapse = TRUE,
warning = FALSE,
message = FALSE,
echo = FALSE,
fig.width = 9,
fig.height = 6
)
options(scipen = 1, digits = 2)
knitr::knit_hooks$set(inline = function(x) {
prettyNum(x, big.mark=",")
})
```

The [Couperin consortium](https://couperin.org) collects data on APC expenditures from universities and other research institutions in France.

APC expenditures for 14 different institutions have now been made available to OpenAPC, with one institution being included for the first time.

Contact Persons for Couperin are [Valerie Larroque](mailto:valerie.larroque@couperin.org) and [Sandrine Malotaux](mailto:sandrine.malotaux@inp-toulouse.fr).

## Cost data

```{r, cache.lazy = TRUE}
#' Download APC spreadsheet from github which requires to Curl installed
download_apc <- function(path = NULL, dir = "tmp", file = "apc_de.csv"){
if(is.null(path)) {
path <- c("https://raw.githubusercontent.com/OpenAPC/openapc-de/master/data/apc_de.csv")
}
dir.create(dir)
download.file(url = path, destfile = paste(dir, file, sep = "/"), method = "curl")
read.csv(paste(dir, file, sep = "/"), header = T,sep =",")
}
my.apc <- download_apc(c("https://raw.githubusercontent.com/OpenAPC/openapc-de/refs/tags/v4.133.1-0-1/data/apc_de.csv"))
my.apc <- my.apc[226991:227201,]
my.apc <- my.apc[my.apc$institution != "",]
my.apc <- droplevels(my.apc)
```

The new data set provided by Couperin covers publication fees for `r format(nrow(my.apc), big.mark =",")` articles, total expenditure amounts to `r sum(my.apc$euro)`€ and the average fee is `r sum(my.apc$euro)/nrow(my.apc)`€.

The data originates from the following institutions:

- Université de Bourgogne
- Ecole centrale de Lyon
- ENAC - Ecole Nationale de l'Aviation Civile
- Toulouse INP - Institut National Polytechnique de Toulouse
- IRSN - Institut de Radioprotection et de Sûreté Nucléaire
- Le Mans Université
- Mines Paris - Ecole nationale supérieure des Mines de Paris
- Université d'Orléans
- Université de Pau et des Pays de l'Adour - UPPA
- La Rochelle Université
- Sciences Po Paris - Institut d'Etudes Politiques de Paris
- Université du Littoral Côte d'Opale
- Université Paris-Est Créteil Val de Marne - Paris XII
- Université Toulouse - Jean Jaurès (**new**)



## Overview

A detailed analysis of the contributed data sets provides the following overview:

### Breakdown by publisher

```{r}
d_frame = data.frame(table(my.apc$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
my.apc$publisher <- factor(my.apc$publisher, levels = d_frame$Publisher)
df.summary <-cbind(tapply(my.apc$euro, my.apc$publisher, length),
tapply(my.apc$euro, my.apc$publisher, sum),
tapply(my.apc$euro, my.apc$publisher, mean))
colnames(df.summary) <- c("Articles", "Fees paid in EURO", "Mean Fee paid")
knitr::kable(as.data.frame(df.summary), digits = 2)
```

### Fees paid per publisher (in EURO)

```{r tree_couperin_2024_10_15_full}
tt <- aggregate(my.apc$euro, by = list(my.apc$publisher), sum)
colnames(tt) <- c("Publisher", "Euro")
treemap::treemap(tt, index = c("Publisher"), vSize = "Euro", palette = "Paired")
```

### Average costs per publisher (in EURO)

```{r box_couperin_2024_10_15_full, echo = FALSE, message = FALSE}
require(ggplot2)
require(tidyverse)
my.apc <- my.apc %>%
mutate(publisher = str_replace(publisher, ".+\\((\\w+)\\)", "\\1"))
d_frame = data.frame(table(my.apc$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
publishers = as.character(d_frame$Publisher[d_frame$Freq > 3])
my.apc_reduced = my.apc[my.apc$publisher %in% publishers,]
q <- ggplot(my.apc_reduced, aes(publisher, euro)) + geom_boxplot() + geom_point()
q <- q + ylab("Fees paid (in EURO)") + theme(legend.position="top") + theme_bw(base_size = 18) + coord_flip()
q + xlab("Publisher (> 3 articles)") + ylab("APC")
```
106 changes: 106 additions & 0 deletions _posts/2024-10-15-couperin.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,106 @@
---
layout: post
author: Christoph Broschinski
author_lnk: https://github.com/cbroschinski
title: Couperin.org provides additional APC data for 14 institutions
date: 2024-10-15 06:00:00
summary:
categories: [general, openAPC]
comments: true
---




The [Couperin consortium](https://couperin.org) collects data on APC expenditures from universities and other research institutions in France.

APC expenditures for 14 different institutions have now been made available to OpenAPC, with one institution being included for the first time.

Contact Persons for Couperin are [Valerie Larroque](mailto:valerie.larroque@couperin.org) and [Sandrine Malotaux](mailto:sandrine.malotaux@inp-toulouse.fr).

## Cost data



The new data set provided by Couperin covers publication fees for 211 articles, total expenditure amounts to 426,677€ and the average fee is 2,022€.

The data originates from the following institutions:

- Université de Bourgogne
- Ecole centrale de Lyon
- ENAC - Ecole Nationale de l'Aviation Civile
- Toulouse INP - Institut National Polytechnique de Toulouse
- IRSN - Institut de Radioprotection et de Sûreté Nucléaire
- Le Mans Université
- Mines Paris - Ecole nationale supérieure des Mines de Paris
- Université d'Orléans
- Université de Pau et des Pays de l'Adour - UPPA
- La Rochelle Université
- Sciences Po Paris - Institut d'Etudes Politiques de Paris
- Université du Littoral Côte d'Opale
- Université Paris-Est Créteil Val de Marne - Paris XII
- Université Toulouse - Jean Jaurès (**new**)



## Overview

A detailed analysis of the contributed data sets provides the following overview:

### Breakdown by publisher



| | Articles| Fees paid in EURO| Mean Fee paid|
|:----------------------------------------------------------|--------:|-----------------:|-------------:|
|MDPI AG | 55| 100807| 1833|
|Elsevier BV | 33| 55867| 1693|
|Springer Nature | 27| 69732| 2583|
|Frontiers Media SA | 26| 61448| 2363|
|Copernicus GmbH | 8| 12669| 1584|
|Informa UK Limited | 8| 23045| 2881|
|Public Library of Science (PLoS) | 6| 10843| 1807|
|Institute of Electrical & Electronics Engineers (IEEE) | 4| 7035| 1759|
|IOP Publishing | 4| 9898| 2474|
|Oxford University Press (OUP) | 4| 9669| 2417|
|Wiley-Blackwell | 3| 7798| 2600|
|American Chemical Society (ACS) | 2| 4365| 2183|
|Institution of Engineering and Technology (IET) | 2| 4710| 2355|
|SAGE Publications | 2| 4200| 2100|
|Acoustical Society of America (ASA) | 1| 1150| 1150|
|AIP Publishing | 1| 1263| 1263|
|American Association for the Advancement of Science (AAAS) | 1| 2017| 2017|
|American Physical Society (APS) | 1| 2402| 2402|
|American Society for Microbiology | 1| 3580| 3580|
|Cambridge University Press (CUP) | 1| 2940| 2940|
|Canadian Center of Science and Education | 1| 300| 300|
|CSIRO Publishing | 1| 946| 946|
|EDP Sciences | 1| 1070| 1070|
|Hindawi Publishing Corporation | 1| 1171| 1171|
|Hogrefe Publishing Group | 1| 2500| 2500|
|IGI Global | 1| 485| 485|
|Institute of Navigation | 1| 932| 932|
|Japan Laser Processing Society | 1| 339| 339|
|Life Science Alliance, LLC | 1| 1779| 1779|
|Magnus Med Club LLC - USA | 1| 497| 497|
|Mary Ann Liebert Inc | 1| 3600| 3600|
|Optica Publishing Group | 1| 1536| 1536|
|Proceedings of the National Academy of Sciences | 1| 2099| 2099|
|Royal Society of Chemistry (RSC) | 1| 2842| 2842|
|Science Publishing Group | 1| 276| 276|
|Scientific Societies | 1| 2134| 2134|
|University of California Press | 1| 896| 896|
|University of Uludag | 1| 1520| 1520|
|University of Wisconsin Press | 1| 2017| 2017|
|Walter de Gruyter GmbH | 1| 2000| 2000|
|Whioce Publishing Pte Ltd | 1| 2299| 2299|



### Fees paid per publisher (in EURO)

![plot of chunk tree_couperin_2024_10_15_full](/figure/tree_couperin_2024_10_15_full-1.png)

### Average costs per publisher (in EURO)

![plot of chunk box_couperin_2024_10_15_full](/figure/box_couperin_2024_10_15_full-1.png)
Binary file added figure/box_couperin_2024_10_15_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added figure/tree_couperin_2024_10_15_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit b88ae39

Please sign in to comment.