Skip to main content

Showing 1–50 of 65 results for author: Accomazzi, A

  1. arXiv:2409.19750  [pdf, other

    astro-ph.IM cs.CL

    AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs for Astronomy

    Authors: Rui Pan, Tuan Dung Nguyen, Hardik Arora, Alberto Accomazzi, Tirthankar Ghosal, Yuan-Sen Ting

    Abstract: Continual pretraining of large language models on domain-specific data has been proposed to enhance performance on downstream tasks. In astronomy, the previous absence of astronomy-focused benchmarks has hindered objective evaluation of these specialized LLM models. Leveraging a recent initiative to curate high-quality astronomical MCQs, this study aims to quantitatively assess specialized LLMs in… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: 10 pages, 1 figure, 1 table, accepted to AI4S: The 5th Workshop on Artificial Intelligence and Machine Learning for Scientific Applications at the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC24). Models will be released at https://huggingface.co/AstroMLab. AstroMLab homepage: https://astromlab.org/

  2. arXiv:2408.01556  [pdf, other

    astro-ph.IM cs.DL cs.IR

    pathfinder: A Semantic Framework for Literature Review and Knowledge Discovery in Astronomy

    Authors: Kartheik G. Iyer, Mikaeel Yunus, Charles O'Neill, Christine Ye, Alina Hyk, Kiera McCormick, Ioana Ciuca, John F. Wu, Alberto Accomazzi, Simone Astarita, Rishabh Chakrabarty, Jesse Cranney, Anjalie Field, Tirthankar Ghosal, Michele Ginolfi, Marc Huertas-Company, Maja Jablonska, Sandor Kruk, Huiling Liu, Gabriel Marchidan, Rohit Mistry, J. P. Naiman, J. E. G. Peek, Mugdha Polimera, Sergio J. Rodriguez , et al. (5 additional authors not shown)

    Abstract: The exponential growth of astronomical literature poses significant challenges for researchers navigating and synthesizing general insights or even domain-specific knowledge. We present Pathfinder, a machine learning framework designed to enable literature review and knowledge discovery in astronomy, focusing on semantic searching with natural language instead of syntactic searches with keywords.… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 25 pages, 9 figures, submitted to AAS jorunals. Comments are welcome, and the tools mentioned are available online at https://pfdr.app

  3. arXiv:2407.11194  [pdf, other

    astro-ph.IM astro-ph.EP astro-ph.GA astro-ph.SR cs.AI cs.CL

    AstroMLab 1: Who Wins Astronomy Jeopardy!?

    Authors: Yuan-Sen Ting, Tuan Dung Nguyen, Tirthankar Ghosal, Rui Pan, Hardik Arora, Zechang Sun, Tijmen de Haan, Nesar Ramachandra, Azton Wells, Sandeep Madireddy, Alberto Accomazzi

    Abstract: We present a comprehensive evaluation of proprietary and open-weights large language models using the first astronomy-specific benchmarking dataset. This dataset comprises 4,425 multiple-choice questions curated from the Annual Review of Astronomy and Astrophysics, covering a broad range of astrophysical topics. Our analysis examines model performance across various astronomical subfields and asse… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 45 pages, 12 figures, 7 tables. Submitted to ApJ. Comments welcome. AstroMLab homepage: https://astromlab.org/

  4. arXiv:2405.10725  [pdf, other

    cs.CL cs.IR

    INDUS: Effective and Efficient Language Models for Scientific Applications

    Authors: Bishwaranjan Bhattacharjee, Aashka Trivedi, Masayasu Muraoka, Muthukumaran Ramasubramanian, Takuma Udagawa, Iksha Gurung, Rong Zhang, Bharath Dandala, Rahul Ramachandran, Manil Maskey, Kaylin Bugbee, Mike Little, Elizabeth Fancher, Lauren Sanders, Sylvain Costes, Sergi Blanco-Cuaresma, Kelly Lockhart, Thomas Allen, Felix Grezes, Megan Ansdell, Alberto Accomazzi, Yousef El-Kurdi, Davis Wertheimer, Birgit Pfitzmann, Cesar Berrospi Ramis , et al. (9 additional authors not shown)

    Abstract: Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specialized tasks. Inspired by this pivotal insight, we developed INDUS, a comprehensive suite of LLMs tailored for the Earth science, biology, physics, heliophysics,… ▽ More

    Submitted 20 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  5. arXiv:2401.09685  [pdf, ps, other

    astro-ph.IM cs.DL

    Decades of Transformation: Evolution of the NASA Astrophysics Data System's Infrastructure

    Authors: Alberto Accomazzi

    Abstract: The NASA Astrophysics Data System (ADS) is the primary Digital Library portal for researchers in astronomy and astrophysics. Over the past 30 years, the ADS has gone from being an astronomy-focused bibliographic database to an open digital library system supporting research in space and (soon) earth sciences. This paper describes the evolution of the ADS system, its capabilities, and the technolog… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 10 pages, 3 figures, submitted to the ADASS 2023 proceedings

  6. arXiv:2312.17297  [pdf, ps, other

    astro-ph.IM

    Improving the visibility and citability of exoplanet research software

    Authors: Alice Allen, Alberto Accomazzi, Joe P. Renaud

    Abstract: The Astrophysics Source Code Library (ASCL) is a free online registry for source codes of interest to astronomers, astrophysicists, and planetary scientists. It lists, and in some cases houses, software that has been used in research appearing in or submitted to peer-reviewed publications. As of December 2023, it has over 3300 software entries and is indexed by NASA's Astrophysics Data System (ADS… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 3 figures

  7. arXiv:2312.14211  [pdf, ps, other

    cs.CL astro-ph.IM cs.AI

    Experimenting with Large Language Models and vector embeddings in NASA SciX

    Authors: Sergi Blanco-Cuaresma, Ioana Ciucă, Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Kelly E. Lockhart, Felix Grezes, Thomas Allen, Golnaz Shapurian, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Daniel Chivvis, Fernanda de Macedo Alves, Jean-Claude Paquin, Jennifer Bartlett, Mugdha Polimera, Stephanie Jarmak

    Abstract: Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

  8. arXiv:2312.08579  [pdf, other

    cs.CL astro-ph.IM cs.LG

    Identifying Planetary Names in Astronomy Papers: A Multi-Step Approach

    Authors: Golnaz Shapurian, Michael J Kurtz, Alberto Accomazzi

    Abstract: The automatic identification of planetary feature names in astronomy publications presents numerous challenges. These features include craters, defined as roughly circular depressions resulting from impact or volcanic activity; dorsas, which are elongate raised structures or wrinkle ridges; and lacus, small irregular patches of dark, smooth material on the Moon, referred to as "lake" (Planetary Na… ▽ More

    Submitted 17 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

  9. arXiv:2311.04272  [pdf, other

    astro-ph.IM

    The Future of Astronomical Data Infrastructure: Meeting Report

    Authors: Michael R. Blanton, Janet D. Evans, Dara Norman, William O'Mullane, Adrian Price-Whelan, Luca Rizzi, Alberto Accomazzi, Megan Ansdell, Stephen Bailey, Paul Barrett, Steven Berukoff, Adam Bolton, Julian Borrill, Kelle Cruz, Julianne Dalcanton, Vandana Desai, Gregory P. Dubois-Felsmann, Frossie Economou, Henry Ferguson, Bryan Field, Dan Foreman-Mackey, Jaime Forero-Romero, Niall Gaffney, Kim Gillies, Matthew J. Graham , et al. (47 additional authors not shown)

    Abstract: The astronomical community is grappling with the increasing volume and complexity of data produced by modern telescopes, due to difficulties in reducing, accessing, analyzing, and combining archives of data. To address this challenge, we propose the establishment of a coordinating body, an "entity," with the specific mission of enhancing the interoperability, archiving, distribution, and productio… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 59 pages; please send comments and/or questions to foadi@googlegroups.com

  10. arXiv:2309.06126  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA astro-ph.HE cs.CL cs.LG

    AstroLLaMA: Towards Specialized Foundation Models in Astronomy

    Authors: Tuan Dung Nguyen, Yuan-Sen Ting, Ioana Ciucă, Charlie O'Neill, Ze-Chang Sun, Maja Jabłońska, Sandor Kruk, Ernest Perkowski, Jack Miller, Jason Li, Josh Peek, Kartheik Iyer, Tomasz Różański, Pranav Khetarpal, Sharaf Zaman, David Brodrick, Sergio J. Rodríguez Méndez, Thang Bui, Alyssa Goodman, Alberto Accomazzi, Jill Naiman, Jesse Cranney, Kevin Schawinski, UniverseTBD

    Abstract: Large language models excel in many human-language tasks but often falter in highly specialized domains like scholarly astronomy. To bridge this gap, we introduce AstroLLaMA, a 7-billion-parameter model fine-tuned from LLaMA-2 using over 300,000 astronomy abstracts from arXiv. Optimized for traditional causal language modeling, AstroLLaMA achieves a 30% lower perplexity than Llama-2, showing marke… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 6 pages, 3 figures, submitted to IJCNLP-AACL 2023. Comments are welcome. The model can be found on Hugging Face - https://huggingface.co/universeTBD/astrollama

  11. arXiv:2212.00744  [pdf, ps, other

    cs.CL astro-ph.IM

    Improving astroBERT using Semantic Textual Similarity

    Authors: Felix Grezes, Thomas Allen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Pavlos Protopapas

    Abstract: The NASA Astrophysics Data System (ADS) is an essential tool for researchers that allows them to explore the astronomy and astrophysics scientific literature, but it has yet to exploit recent advances in natural language processing. At ADASS 2021, we introduced astroBERT, a machine learning language model tailored to the text used in astronomy papers in ADS. In this work we: - announce the first… ▽ More

    Submitted 29 November, 2022; originally announced December 2022.

  12. arXiv:2202.00777  [pdf, ps, other

    cs.HC astro-ph.IM

    Web accessibility trends and implementation in dynamic web applications

    Authors: Timothy W. Hostetler, Shinyi Chen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Edwin Henneken, Donna M. Thompson, Roman Chyla, Golnaz Shapurian, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Stephen McDonald, Felix Grezes

    Abstract: The NASA Astrophysics Data System (ADS), a critical research service for the astrophysics community, strives to provide the most accessible and inclusive environment for the discovery and exploration of the astronomical literature. Part of this goal involves creating a digital platform that can accommodate everybody, including those with disabilities that would benefit from alternative ways to pre… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Submitted to ADASS XXXI (2021)

  13. arXiv:2112.00590  [pdf, ps, other

    cs.CL astro-ph.IM

    Building astroBERT, a language model for Astronomy & Astrophysics

    Authors: Felix Grezes, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Shinyi Chen, Chris Tanner, Pavlos Protopapas

    Abstract: The existing search tools for exploring the NASA Astrophysics Data System (ADS) can be quite rich and empowering (e.g., similar and trending operators), but researchers are not yet allowed to fully leverage semantic search. For example, a query for "results from the Planck mission" should be able to distinguish between all the various meanings of Planck (person, mission, constant, institutions and… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  14. Best Practices for Data Publication in the Astronomical Literature

    Authors: Tracy X. Chen, Marion Schmitz, Joseph M. Mazzarella, Xiuqin Wu, Julian C. van Eyken, Alberto Accomazzi, Rachel L. Akeson, Mark Allen, Rachael Beaton, G. Bruce Berriman, Andrew W. Boyle, Marianne Brouty, Ben Chan, Jessie L. Christiansen, David R. Ciardi, David Cook, Raffaele D'Abrusco, Rick Ebert, Cren Frayer, Benjamin J. Fulton, Christopher Gelino, George Helou, Calen B. Henderson, Justin Howell, Joyce Kim , et al. (20 additional authors not shown)

    Abstract: We present an overview of best practices for publishing data in astronomy and astrophysics journals. These recommendations are intended as a reference for authors to help prepare and publish data in a way that will better represent and support science results, enable better data sharing, improve reproducibility, and enhance the reusability of data. Observance of these guidelines will also help to… ▽ More

    Submitted 16 April, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: 19 pages, 1 figure, 3 tables, accepted for publication in ApJS

  15. arXiv:2009.14323  [pdf

    astro-ph.IM cs.DL

    Enabling Synergy: Improving the Information Infrastructure for Planetary Science

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin A. Henneken

    Abstract: In this whitepaper we advocate that the Planetary Science (PS) community build a discipline-specific digital library, in collaboration with the existing astronomy digital library, ADS. We suggest that the PS data archives increase their level of curation to allow for direct linking between the archival data and the derived journal articles. And we suggest that a new component of the PS information… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

    Comments: 8 pages, submitted to the Planetary Science and Astrobiology Decadal Survey 2023-2032

  16. arXiv:2009.05048  [pdf, ps, other

    cs.SE astro-ph.IM

    Agile methodologies in teams with highly creative and autonomous members

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi

    Abstract: The Agile manifesto encourages us to value individuals and interactions over processes and tools, while Scrum, the most adopted Agile development methodology, is essentially based on roles, events, artifacts, and the rules that bind them together (i.e., processes). Moreover, it is generally proclaimed that whenever a Scrum project does not succeed, the reason is because Scrum was not implemented c… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: To appear in the proceedings of the 29th annual international Astronomical Data Analysis Software & Systems (ADASS XXIX)

  17. arXiv:2007.10549  [pdf

    astro-ph.IM astro-ph.EP

    Enabling Effective Exoplanet / Planetary Collaborative Science

    Authors: Mark S. Marley, Chester Harman, Heidi B. Hammel, Paul Byrne, Jonathan Fortney, Alberto Accomazzi, Sarah E. Moran, Michael Way, Jessie Christiansen, Noam Izenberg, Timothy Holt, Sanaz Vahidinia, Erika Kohler, Karalee Brugman

    Abstract: The field of exoplanetary science has emerged over the past two decades, rising up alongside traditional solar system planetary science. Both fields focus on understanding the processes which form and sculpt planets through time, yet there has been less scientific exchange between the two communities than is ideal. This white paper explores some of the institutional and cultural barriers which imp… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: 8 pages; white paper submitted to the Planetary Science and Astrobiology Decadal Survey 2023-2032

  18. arXiv:1911.00295  [pdf, other

    cs.DL

    Practice meets Principle: Tracking Software and Data Citations to Zenodo DOIs

    Authors: Stephanie van de Sandt, Lars Holm Nielsen, Alexandros Ioannidis, August Muench, Edwin Henneken, Alberto Accomazzi, Chiara Bigarella, Jose Benito Gonzalez Lopez, Sünje Dallmeier-Tiessen

    Abstract: Data and software citations are crucial for the transparency of research results and for the transmission of credit. But they are hard to track, because of the absence of a common citation standard. As a consequence, the FORCE11 recently proposed data and software citation principles as guidance for authors. Zenodo is recognized for the implementation of DOIs for software on a large scale. The min… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

  19. arXiv:1903.06634  [pdf

    astro-ph.GA astro-ph.EP astro-ph.HE astro-ph.IM astro-ph.SR

    Increasing the Discovery Space in Astrophysics - A Collation of Six Submitted White Papers

    Authors: G. Fabbiano, M. Elvis, A. Accomazzi, G. B. Berriman, N. Brickhouse, S. Bose, D. Carrera, I. Chilingarian, F. Civano, B. Czerny, R. D'Abrusco, B. Diemer, J. Drake, R. Emami Meibody, J. R. Farah, G. G. Fazio, E. Feigelson, F. Fornasini, Jay Gallagher, J. Grindlay, L. Hernquist, D. J. James, M. Karovska, V. Kashyap, D. -W. Kim , et al. (24 additional authors not shown)

    Abstract: We write in response to the call from the 2020 Decadal Survey to submit white papers illustrating the most pressing scientific questions in astrophysics for the coming decade. We propose exploration as the central question for the Decadal Committee's discussions.The history of astronomy shows that paradigm changing discoveries are not driven by well formulated scientific questions, based on the kn… ▽ More

    Submitted 18 March, 2019; v1 submitted 15 March, 2019; originally announced March 2019.

  20. arXiv:1903.00297  [pdf

    astro-ph.IM

    From Dark Energy to Exolife: Improving the Digital Information Infrastructure for Astrophysics

    Authors: Michael J. Kurtz, Alberto Accomazzi

    Abstract: Some of the most exciting and promising areas of Astronomy research today are found at the boundaries of the discipline: the search for Exoplanets and Multi-Messenger Astronomy. In order to achieve breakthroughs in these research fields over the next decade, innovation and expansion of the digital information infrastructure which supports this research is required. Astronomy has been well-served b… ▽ More

    Submitted 1 March, 2019; originally announced March 2019.

    Comments: 6 pages, whitepaper submitted to Astro2020, the Astronomy and Astrophysics Decadal Survey

  21. arXiv:1901.05463  [pdf, ps, other

    astro-ph.IM cs.DL

    Fundamentals of effective cloud management for the new NASA Astrophysics Data System

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi, Nathan Rapport

    Abstract: The new NASA Astrophysics Data System (ADS) is designed with a serviceoriented architecture (SOA) that consists of multiple customized Apache Solr search engine instances plus a collection of microservices, containerized using Docker, and deployed in Amazon Web Services (AWS). For complex systems, like the ADS, this loosely coupled architecture can lead to a more scalable, reliable and resilient s… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: To appear in the proceedings of the 28th annual international Astronomical Data Analysis Software & Systems (ADASS XXVIII)

  22. arXiv:1803.03598  [pdf

    astro-ph.IM cs.DL physics.soc-ph

    Merging the Astrophysics and Planetary Science Information Systems

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin A. Henneken

    Abstract: Conceptually exoplanet research has one foot in the discipline of Astrophysics and the other foot in Planetary Science. Research strategies for exoplanets will require efficient access to data and information from both realms. Astrophysics has a sophisticated, well integrated, distributed information system with archives and data centers which are interlinked with the technical literature via the… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: Whitepaper submitted to the Committee on an Exoplanet Science Strategy

  23. arXiv:1801.01021  [pdf, other

    astro-ph.IM cs.DL

    The Unified Astronomy Thesaurus: Semantic Metadata for Astronomy and Astrophysics

    Authors: Katie Frey, Alberto Accomazzi

    Abstract: Several different controlled vocabularies have been developed and used by the astronomical community, each designed to serve a specific need and a specific group. The Unified Astronomy Thesaurus (UAT) attempts to provide a highly structured controlled vocabulary that will be relevant and useful across the entire discipline, regardless of content or platform. As two major use cases for the UAT incl… ▽ More

    Submitted 3 January, 2018; originally announced January 2018.

    Comments: Submitted to the Astrophysical Journal Supplements, 10 pages, 3 tables

  24. arXiv:1712.06704  [pdf, ps, other

    stat.ML cs.CL cs.IR

    Multilingual Topic Models

    Authors: Kriste Krstovski, Michael J. Kurtz, David A. Smith, Alberto Accomazzi

    Abstract: Scientific publications have evolved several features for mitigating vocabulary mismatch when indexing, retrieving, and computing similarity between articles. These mitigation strategies range from simply focusing on high-value article sections, such as titles and abstracts, to assigning keywords, often from controlled vocabularies, either manually or through automatic annotation. Various document… ▽ More

    Submitted 18 December, 2017; originally announced December 2017.

    Comments: 18 pages, 9 figures

  25. New ADS Functionality for the Curator

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Steven McDonald, Taylor J. Shaulis, Sergi Blanco-Cuaresma, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton

    Abstract: In this paper we provide an update concerning the operations of the NASA Astrophysics Data System (ADS), its services and user interface, and the content currently indexed in its database. As the primary information system used by researchers in Astronomy, the ADS aims to provide a comprehensive index of all scholarly resources appearing in the literature. With the current effort in our community… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

    Comments: Submitted to the Proceedings of Library and Information Services in Astronomy VIII, Strasbourg, France

  26. arXiv:1709.09566  [pdf, ps, other

    astro-ph.IM

    NASA's Long-Term Astrophysics Data Archives

    Authors: L. M. Rebull, V. Desai, H. Teplitz, S. Groom, R. Akeson, G. B. Berriman, G. Helou, D. Imel, J. M. Mazzarella, A. Accomazzi, T. McGlynn, A. Smale, R. White

    Abstract: NASA regards data handling and archiving as an integral part of space missions, and has a strong track record of serving astrophysics data to the public, beginning with the the IRAS satellite in 1983. Archives enable a major science return on the significant investment required to develop a space mission. In fact, the presence and accessibility of an archive can more than double the number of pape… ▽ More

    Submitted 27 September, 2017; originally announced September 2017.

    Comments: To appear in ADASS 2016 conference proceedings (invited talk)

  27. arXiv:1601.07858  [pdf, ps, other

    astro-ph.IM cs.DL

    Aggregation and Linking of Observational Metadata in the ADS

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Alexandra Holachek, Jonathan Elliott

    Abstract: We discuss current efforts behind the curation of observing proposals, archive bibliographies, and data links in the NASA Astrophysics Data System (ADS). The primary data in the ADS is the bibliographic content from scholarly articles in Astronomy and Physics, which ADS aggregates from publishers, arXiv and conference proceeding sites. This core bibliographic information is then further enriched b… ▽ More

    Submitted 28 January, 2016; originally announced January 2016.

    Comments: 4 pages, Proceedings of the ADASS XXV conference

  28. arXiv:1503.05881  [pdf, other

    cs.DL

    ADS 2.0: new architecture, API and services

    Authors: Roman Chyla, Alberto Accomazzi, Alexandra Holachek, Carolyn S. Grant, Jonathan Elliott, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray, Vladimir Sudilovsky

    Abstract: The ADS platform is undergoing the biggest rewrite of its 20-year history. While several components have been added to its architecture over the past couple of years, this talk will concentrate on the underpinnings of ADS's search layer and its API. To illustrate the design of the components in the new system, we will show how the new ADS user interface is built exclusively on top of the API using… ▽ More

    Submitted 19 March, 2015; originally announced March 2015.

    Comments: ADASS Conference 2014

  29. arXiv:1503.04194  [pdf, other

    astro-ph.IM cs.DL

    ADS: The Next Generation Search Platform

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Roman Chyla, James Luker, Carolyn S. Grant, Donna M. Thompson, Alexandra Holachek, Rahul Dave, Stephen S. Murray

    Abstract: Four years after the last LISA meeting, the NASA Astrophysics Data System (ADS) finds itself in the middle of major changes to the infrastructure and contents of its database. In this paper we highlight a number of features of great importance to librarians and discuss the additional functionality that we are currently developing. Starting in 2011, the ADS started to systematically collect, parse… ▽ More

    Submitted 13 March, 2015; originally announced March 2015.

    Comments: Submitted to Library and Information Services in Astronomy VII, Naples, Italy

  30. arXiv:1406.4542  [pdf, ps, other

    cs.DL astro-ph.IM

    Computing and Using Metrics in the ADS

    Authors: Edwin A. Henneken, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Donna Thompson, Jay Luker, Roman Chyla, Alexandra Holachek, Stephen S. Murray

    Abstract: Finding measures for research impact, be it for individuals, institutions, instruments or projects, has gained a lot of popularity. More papers than ever are being written on new impact measures, and problems with existing measures are being pointed out on a regular basis. Funding agencies require impact statistics in their reports, job candidates incorporate them in their resumes, and publication… ▽ More

    Submitted 17 June, 2014; originally announced June 2014.

    Comments: to appear in proceedings of LISA VII conference, Naples, Italy

  31. arXiv:1403.6656  [pdf, other

    astro-ph.IM cs.DL

    The Unified Astronomy Thesaurus

    Authors: Alberto Accomazzi, Norman Gray, Chris Erdmann, Chris Biemesderfer, Katie Frey, Justin Soles

    Abstract: The Unified Astronomy Thesaurus (UAT) is an open, interoperable and community-supported thesaurus which unifies the existing divergent and isolated Astronomy & Astrophysics vocabularies into a single high-quality, freely-available open thesaurus formalizing astronomical concepts and their inter-relationships. The UAT builds upon the existing IAU Thesaurus with major contributions from the astronom… ▽ More

    Submitted 26 March, 2014; originally announced March 2014.

    Comments: 4 pages, 1 figure, to appear in Proceedings of Astronomical Data Analysis Software and Systems XXIII, which took place September 29-October 3, 2013

  32. arXiv:1210.8030  [pdf, other

    astro-ph.IM cs.DL

    Astronomy and Computing: a New Journal for the Astronomical Computing Community

    Authors: Alberto Accomazzi, Tamás Budavári, Christopher Fluke, Norman Gray, Robert G Mann, William O'Mullane, Andreas Wicenec, Michael Wise

    Abstract: We introduce \emph{Astronomy and Computing}, a new journal for the growing population of people working in the domain where astronomy overlaps with computer science and information technology. The journal aims to provide a new communication channel within that community, which is not well served by current journals, and to help secure recognition of its true importance within modern astronomy. In… ▽ More

    Submitted 30 October, 2012; originally announced October 2012.

    Comments: 5 pages, no figures; editorial for first edition of journal

  33. arXiv:1206.6352  [pdf, other

    astro-ph.IM cs.DL

    Telescope Bibliographies: an Essential Component of Archival Data Management and Operations

    Authors: Alberto Accomazzi, Edwin Henneken, Christopher Erdmann, Arnold Rots

    Abstract: Assessing the impact of astronomical facilities rests upon an evaluation of the scientific discoveries which their data have enabled. Telescope bibliographies, which link data products with the literature, provide a way to use bibliometrics as an impact measure for the underlying data. In this paper we argue that the creation and maintenance of telescope bibliographies should be considered an inte… ▽ More

    Submitted 30 July, 2012; v1 submitted 27 June, 2012; originally announced June 2012.

    Comments: 10 pages, 3 figures, to appear in SPIE Astronomical Telescopes and Instrumentation, SPIE Conference Series 8448

  34. arXiv:1112.1688  [pdf, ps, other

    astro-ph.IM cs.DL

    Why don't we already have an Integrated Framework for the Publication and Preservation of all Data Products?

    Authors: Alberto Accomazzi, Sebastien Derriere, Chris Biemesderfer, Norman Gray

    Abstract: Astronomy has long had a working network of archives supporting the curation of publications and data. The discipline has already created many of the features which perplex other areas of science: (1) data repositories: (supra)national institutes, dedicated to large projects; a culture of user-contributed data; practical experience of long-term data preservation; (2) dataset identifiers: the commu… ▽ More

    Submitted 7 December, 2011; originally announced December 2011.

    Comments: 4 pages, submitted to the ADASS XXI proceedings

  35. arXiv:1111.3618  [pdf, ps, other

    cs.DL astro-ph.IM

    Linking to Data - Effect on Citation Rates in Astronomy

    Authors: Edwin A. Henneken, Alberto Accomazzi

    Abstract: Is there a difference in citation rates between articles that were published with links to data and articles that were not? Besides being interesting from a purely academic point of view, this question is also highly relevant for the process of furthering science. Data sharing not only helps the process of verification of claims, but also the discovery of new findings in archival data. However, li… ▽ More

    Submitted 15 November, 2011; originally announced November 2011.

    Comments: 4 pages, 3 figures, will appear proceedings of ADASS XXI

  36. arXiv:1106.5644  [pdf, ps, other

    astro-ph.IM cs.DL

    The ADS in the Information Age - Impact on Discovery

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi

    Abstract: The SAO/NASA Astrophysics Data System (ADS) grew up with and has been riding the waves of the Information Age, closely monitoring and anticipating the needs of its end-users. By now, all professional astronomers are using the ADS on a daily basis, and a substantial fraction have been using it for their entire professional career. In addition to being an indispensable tool for professional scientis… ▽ More

    Submitted 28 June, 2011; originally announced June 2011.

    Comments: 10 pages, 5 figures, to appear in "Organizations, People and Strategies in Astronomy (OPSA)", volume 8

  37. arXiv:1103.5958  [pdf, other

    astro-ph.IM cs.DL

    Semantic Interlinking of Resources in the Virtual Observatory Era

    Authors: Alberto Accomazzi, Rahul Dave

    Abstract: In the coming era of data-intensive science, it will be increasingly important to be able to seamlessly move between scientific results, the data analyzed in them, and the processes used to produce them. As observations, derived data products, publications, and object metadata are curated by different projects and archived in different locations, establishing the proper linkages between these reso… ▽ More

    Submitted 30 March, 2011; originally announced March 2011.

    Comments: 10 pages, 3 figures, to appear in: ASPC 442 (2011), Proceedings of Astronomical Data Analysis Software and Systems XX

  38. Linking Literature and Data: Status Report and Future Efforts

    Authors: Alberto Accomazzi

    Abstract: In the current era of data-intensive science, it is increasingly important for researchers to be able to have access to published results, the supporting data, and the processes used to produce them. Six years ago, recognizing this need, the American Astronomical Society and the Astrophysics Data Centers Executive Committee (ADEC) sponsored an effort to facilitate the annotation and linking of dat… ▽ More

    Submitted 22 March, 2011; originally announced March 2011.

    Comments: 9 pages, 2 figures, to appear in: Future Professional Communication in Astronomy II (FPCA-II)

  39. arXiv:1006.0670  [pdf, ps, other

    astro-ph.IM cs.DL

    Astronomy 3.0 Style

    Authors: Alberto Accomazzi

    Abstract: Over the next decade we will witness the development of a new infrastructure in support of data-intensive scientific research, which includes Astronomy. This new networked environment will offer both challenges and opportunities to our community and has the potential to transform the way data are described, curated and preserved. Based on the lessons learned during the development and management o… ▽ More

    Submitted 3 June, 2010; originally announced June 2010.

    Comments: 9 pages, 2 figures, to appear in Library and Information Services in Astronomy VI, ASP Conference Proceedings

  40. Finding Your Literature Match -- A Recommender System

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi, Carolyn Grant, Donna Thompson, Elizabeth Bohlen, Giovanni Di Milia, Jay Luker, Stephen S. Murray

    Abstract: The universe of potentially interesting, searchable literature is expanding continuously. Besides the normal expansion, there is an additional influx of literature because of interdisciplinary boundaries becoming more and more diffuse. Hence, the need for accurate, efficient and intelligent search tools is bigger than ever. Even with a sophisticated search engine, looking for information can still… ▽ More

    Submitted 13 May, 2010; originally announced May 2010.

    Comments: Contribution to the proceedings of the colloquium Future Professional Communication in Astronomy II, 13-14 April 2010, Cambridge, Massachusetts. 11 pages, 4 figures.

  41. arXiv:1005.1886  [pdf, other

    astro-ph.IM

    Towards a Resource-Centric Data Network for Astronomy

    Authors: Alberto Accomazzi, Michael J. Kurtz, Stephen S. Murray

    Abstract: Over the past decade, astronomers have been using an increasingly larger number of web-based applications and archives to conduct their research. However, despite the early success in creating links across projects and data centers, the promise of a single integrated digital library environment supporting e-science in astronomy has proven elusive. While some of the issues hampering progress in t… ▽ More

    Submitted 11 May, 2010; originally announced May 2010.

    Comments: 6 pages, 1 figure, proceedings of IAU Special Session 5, "Accelerating the Rate of Astronomical Discovery." To be published in Proceedings of Science

  42. arXiv:0912.5235  [pdf, ps, other

    astro-ph.IM cs.DL cs.IR physics.soc-ph

    Using Multipartite Graphs for Recommendation and Discovery

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin Henneken, Giovanni Di Milia, Carolyn S. Grant

    Abstract: The Smithsonian/NASA Astrophysics Data System exists at the nexus of a dense system of interacting and interlinked information networks. The syntactic and the semantic content of this multipartite graph structure can be combined to provide very specific research recommendations to the scientist/user.

    Submitted 30 December, 2009; originally announced December 2009.

    Comments: To appear in ADASS XIX, ASP Conf Proc

  43. arXiv:0909.4789  [pdf

    cs.DL physics.soc-ph

    The Bibliometric Properties of Article Readership Information

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Markus Demleitner, Stephen S. Murray, Nathalie Martimbeau, Barbara Elwell

    Abstract: The NASA Astrophysics Data System (ADS), along with astronomy's journals and data centers (a collaboration dubbed URANIA), has developed a distributed on-line digital library which has become the dominant means by which astronomers search, access and read their technical literature. Digital libraries such as the NASA Astrophysics Data System permit the easy accumulation of a new type of bibliome… ▽ More

    Submitted 25 September, 2009; originally announced September 2009.

    Comments: ADS bibcode: 2005JASIS..56..111K This is the second paper (the first is Worldwide Use and Impact of the NASA Astrophysics Data System Digital Library) from the original article The NASA Astrophysics Data System: Sociology, Bibliometrics, and Impact, which went on-line in the summer of 2003

    Journal ref: The Journal of the American Society for Information Science and Technology, Vol. 56, p. 111 (2005)

  44. arXiv:0909.4786  [pdf

    cs.DL physics.soc-ph

    Worldwide Use and Impact of the NASA Astrophysics Data System Digital Library

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Markus Demleitner, Stephen S. Murray

    Abstract: By combining data from the text, citation, and reference databases with data from the ADS readership logs we have been able to create Second Order Bibliometric Operators, a customizable class of collaborative filters which permits substantially improved accuracy in literature queries. Using the ADS usage logs along with membership statistics from the International Astronomical Union and data o… ▽ More

    Submitted 25 September, 2009; originally announced September 2009.

    Comments: ADS bibcode: 2005JASIS..56...36K This is a portion (The bibliometric properties of article readership information is the other part) of the article: The NASA Astrophysics Data System: Sociology, bibliometrics and impact, which went on-line in the summer of 2003

    Journal ref: The Journal of the American Society for Information Science and Technology, Vol. 56, p. 36. (2005)

  45. arXiv:0903.3228  [pdf

    astro-ph.IM cs.DL

    The Smithsonian/NASA Astrophysics Data System (ADS) Decennial Report

    Authors: Michael J. Kurtz, Alberto Accomazzi, Stephen S. Murray

    Abstract: Eight years after the ADS first appeared the last decadal survey wrote: "NASA's initiative for the Astrophysics Data System has vastly increased the accessibility of the scientific literature for astronomers. NASA deserves credit for this valuable initiative and is urged to continue it." Here we summarize some of the changes concerning the ADS which have occurred in the past ten years, and we de… ▽ More

    Submitted 18 March, 2009; originally announced March 2009.

    Comments: 6 pages, whitepaper submitted to the National Research Council Astronomy and Astrophysics Decadal Survey

  46. Use of Astronomical Literature - A Report on Usage Patterns

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: In this paper we present a number of metrics for usage of the SAO/NASA Astrophysics Data System (ADS). Since the ADS is used by the entire astronomical community, these are indicative of how the astronomical literature is used. We will show how the use of the ADS has changed both quantitatively and qualitatively. We will also show that different types of users access the system in different ways… ▽ More

    Submitted 3 October, 2008; v1 submitted 1 August, 2008; originally announced August 2008.

    Comments: 12 pages, 8 figures, 2 tables. Accepted by Journal of Informetrics

  47. arXiv:cs/0701035  [pdf, ps, other

    cs.DL astro-ph

    Finding Astronomical Communities Through Co-readership Analysis

    Authors: Edwin A. Henneken, Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: Whenever a large group of people are engaged in an activity, communities will form. The nature of these communities depends on the relationship considered. In the group of people who regularly use scholarly literature, a relationship like ``person i and person j have cited the same paper'' might reveal communities of people working in a particular field. On this poster, we will investigate the r… ▽ More

    Submitted 5 January, 2007; originally announced January 2007.

    Comments: poster presented at the 209th AAS Meeting, 7 pages, 4 figures

  48. arXiv:astro-ph/0611575  [pdf, ps, other

    astro-ph

    Closing the loop: Linking Datasets to Publications and Back

    Authors: Alberto Accomazzi, Guenther Eichhorn, Arnold Rots

    Abstract: With the mainstream adoption of references to datasets in astronomical manuscripts, researchers today are able to provide direct links from their papers to the original data that were used in their study. Following a process similar to the verification of references in manuscripts, publishers have been working with the NASA Astrophysics Data System (ADS) to validate and maintain links to these d… ▽ More

    Submitted 17 November, 2006; originally announced November 2006.

    Comments: 4 pages, submitted to the proceedings of the Astronomical Data Analysis Software & Systems XVI

  49. arXiv:cs/0610030  [pdf, ps, other

    cs.DL cs.HC

    Paper to Screen: Processing Historical Scans in the ADS

    Authors: Donna M. Thompson, Alberto Accomazzi, Guenther Eichhorn, Carolyn Grant, Edwin Henneken, Michael J. Kurtz, Elizabeth Bohlen, Stephen S. Murray

    Abstract: The NASA Astrophysics Data System in conjunction with the Wolbach Library at the Harvard-Smithsonian Center for Astrophysics is working on a project to microfilm historical observatory publications. The microfilm is then scanned for inclusion in the ADS. The ADS currently contains over 700,000 scanned pages of volumes of historical literature. Many of these volumes lack clear pagination or other… ▽ More

    Submitted 5 October, 2006; originally announced October 2006.

    Comments: 4 pages; submitted to the proceedings of Library and Information Services in Astronomy; to be published in the ASP Conference Series

  50. arXiv:cs/0610029  [pdf, ps, other

    cs.DL cs.DB

    Data in the ADS -- Understanding How to Use it Better

    Authors: Carolyn S. Grant, Alberto Accomazzi, Donna Thompson, Edwin Henneken, Guenther Eichhorn, Michael J. Kurtz, Stephen S. Murray

    Abstract: The Smithsonian/NASA ADS Abstract Service contains a wealth of data for astronomers and librarians alike, yet the vast majority of usage consists of rudimentary searches. Hints on how to obtain more focused search results by using more of the various capabilities of the ADS are presented, including searching by affiliation. We also discuss the classification of articles by content and by referee… ▽ More

    Submitted 5 October, 2006; originally announced October 2006.

    Comments: 4 pages; submitted to the proceedings of the Library and Information Services in Astronomy V; to be published by ASP Conference Proceedings