CfP #41: Big Scholarly Data – Große Datenmengen über wissenschaftliche Informationsprozesse

Posted in LIBREAS Call for Papers by Karsten Schuldt on 3. November 2021

[English version below]

Am 4. Mai 2021 verkündete Microsoft Research, eine Forschungsabteilung von Microsoft, das Ende von Microsoft Academics Services, ihrer Suchmaschine für wissenschaftliche Literatur. [1] Sie wurde gegründet, um, nach eigener Aussage, der Forschung den Zugriff auf große Datenmengen über wissenschaftliche Literatur und deren automatisierte Nachnutzung zu ermöglichen. Nun möchte sich Microsoft Research anderen Geschäftsfeldern widmen und verweist auf eine Vielzahl an Alternativen wie Crossref, Semantic Scholar oder Dimensions.

“We remain confident in open and community-driven alternatives to MAS and are pleased to see the recent momentum across the academic ecosystem.” [2]

Dem Echo auf diese Entscheidung nach zu urteilen, sind nicht alle ebenso von diesem Schritt überzeugt. Insbesondere der Service, der die Datengrundlage als Graph technisch verfügbar machte, wurde in den letzten Jahren sowohl in der Forschung, etwa in der quantitativen Wissenschaftsforschung, als auch durch Discovery- und Analysedienste [3] breit nachgenutzt. Forschende, die ihre Arbeit darauf ausgerichtet haben, sehen nun vor allem, dass ihnen ihr Bezugspunkt aufgrund einer Unternehmungsentscheidung mit knapper Frist verloren geht.

Wir möchten das Ende der Microsoft Academics Services zum Anlass nehmen, über den Stand und Perspektiven von so genannten Big Scholarly Data an Bibliotheken und weiteren Informationseinrichtungen zu reflektieren. Mit Big Scholarly Data sind Metadaten über das Gesamtaufkommen wissenschaftlicher Aktivität und ihrer Informationsprozesse gemeint. Sie umfassen bibliographische Informationen, Angaben über Autor*innenschaften oder Zitationsdaten und die Inhalte wissenschaftlicher Publikationen selbst. [4]

In praktischer Hinsicht fragen wir nach der gegenwärtigen Rolle von Big Scholarly Data bei der Weiterentwicklung von Informationsangeboten und Dienstleistungen. Welche Angebote werden wie eingesetzt und wie gelingt die Auswertung großer Datenmengen sowohl technisch als auch inhaltlich-organisatorisch? Wo an welchen Einrichtungen wird Big Scholarly Data angewandt und welche Kompetenzen bedarf es? Welche Fragestellungen lassen sich mit Rückgriff auf Big Scholarly Data wie beantworten?

Neben dem Bekanntmachen und Teilen praktischer Erfahrungen möchten wir Big Scholarly Data verstärkt als sozio-kulturelles Phänomen verstehen. Was bedeutet die Abhängigkeit von wenigen Anbietern, deren Daten in Informations- und Monitoringangeboten breit nachgenutzt werden? Entsteht eine Pfadabhängigkeit, die dazu führt, dass unterrepräsentierte Publikationspraxen noch stärker marginalisiert werden? Welchen Einfluss spielen Indexierungs- und Typologisierungsentscheidungen der Anbieter bei der Analyse von wissenschaftlichen Informationsressourcen im erweiterten Kontext des Wissenschaftsmanagements, etwa bei der Planung der Transformation des Zeitschriftenetats einer Bibliothek ins Open Access? Ist Big Scholarly Data die Lösung oder der Katalysator der gegenwärtigen “Auffindbarkeitskrise”? [5]

Einreichungsschluss ist der 31. März 2022.

Ihre / eure Redaktion LIBREAS. Library Ideas

(Aarhus, Berlin, Hannover, Lausanne, München)

Fussnoten:

[1] https://web.archive.org/web/20210902200418/https://www.microsoft.com/en-us/research/project/academic/articles/microsoft-academic-to-expand-horizons-with-community-driven-approach/#

[2] https://www.microsoft.com/en-us/research/project/academic/articles/microsoft-academic-to-expand-horizons-with-community-driven-approach/

[3] Siehe etwa den Open-Access-Datendienst Unpaywall. OurResearch, das Team dahinter, hat im Juni 2021 angekündigt, einen Nachfolger für den “Microsoft Academic Graph” zu entwickeln, vergleiche https://blog.ourresearch.org/were-building-a-replacement-for-microsoft-academic-graph/.

[4] Vgl. u.a. Feng Xia; Wei Wang; Teshome Megersa Bekele; Huan Liu: Big Scholarly Data: A Survey. In: IEEE Transactions on Big Data, vol. 3, no. 1, pp. 18-35, 1 March 2017, doi: 10.1109/TBDATA.2016.2641460. – “The term Big Scholarly Data is coined for the rapidly growing scholarly data, which contains information including millions of authors, papers, citations, figures, tables, as well as scholarly networks and digital libraries.“

[5] Kraker, P., Schramm, M., & Kittel, C. (2021). Discoverability in (a) Crisis. ABI Technik, 41(1), 3–12. https://doi.org/10.1515/abitech-2021-0003.

English version:

On May 4, 2021, Microsoft Research, a research division of Microsoft, announced the end of Microsoft Academics Services, their search engine for scientific literature. It was founded, according to their own statement, to enable research access to large amounts of data on scientific literature and to automate its subsequent use. Now Microsoft Research wants to devote itself to other business areas and refers to a variety of alternatives such as Crossref, Semantic Scholar or Dimensions.

“We remain confident in open and community-driven alternatives to MAS and are pleased to see the recent momentum across the academic ecosystem.” [6]

Judging by the response to this decision, not everyone is equally convinced by this step. In particular, the service that made the database technically available as a graph has been widely followed in recent years both in research, such as quantitative science research, and by discovery and analysis services. Researchers who have based their work on it are now facing the loss of their reference pointdue to an enterprise decision with a tight deadline.

We would like to take the end of Microsoft Academics Services as an occasion to reflect on the status quo and the prospects of so-called Big Scholarly Data at libraries and other information institutions. Big Scholarly Data refers to metadata about a high amount of scholarly activity and its information processes. It includes bibliographic information, authorship or citation data, and the content of scholarly publications themselves.

From a practical point of view, we ask about the current role of Big Scholarly Data in the further development of access to information and the related services. Which services are used and how, and how does the analysis of big data work both technically and in terms of content and organization? Where at which institutions is Big Scholarly Data applied and what competencies are required? Which questions can be answered with Big Scholarly Data and how?

In addition to publicizing and sharing practical experiences, we would like to understand Big Scholarly Data more as a socio-cultural phenomenon. What does dependence on a few providers whose data is widely reused in information and monitoring services mean? Does a path dependency emerge that leads to underrepresented publishing practices becoming even more marginalized? What influence do indexing and typologization decisions by vendors play in analyzing scholarly information resources in the broader context of science management, such as planning the transformation of a library’s journal budget to open access? Is Big Scholarly Data the solution or catalyst to the current „discoverability crisis“?

Deadline for submissions is March 31, 2022.

Your Editors LIBREAS. Library Ideas

(Aarhus, Berlin, Hannover, Lausanne, Munich)

Footnotes:

[6] https://www.microsoft.com/en-us/research/project/academic/articles/microsoft-academic-to-expand-horizons-with-community-driven-approach/

Tagged with: Big Scholarly Data, CfP, Forschungsdaten, Forschungsinfrastruktur

LIBREAS.Library Ideas