31 items found

Organisations: RISIS2OpenData Groups: Datacite Open Access ZENODO Tags: ACM.ComputingMethodologies_DOCUMENTANDTEXTPROCESSING

Filter Results
  • dataset

    Replication Package: A Study on the Accuracy of OCR Engines for Source Code T...

    The replication package of the paper "A Study on the Accuracy of OCR Engines for Source Code Transcription from Programming Screencasts" including the dataset,...
  • dataset

    Medical_Abstracts_Fin

    ~140 medical abstracts in XML format (standard format downloaded from PMC)
  • dataset

    OCR17: GT for 17th French prints

    Machine learning starts with machine teaching: with our dataset, we distribute the data that we have gathered and created to train reliable OCR models for 17th c. French prints.
  • dataset

    In01038 Tagare Grant Of Bhogivarman. Sanskrit Xml File

    IN01038 Tagare Grant of Bhogivarman. Sanskrit XML file. Sanskrit XML file (without metadata).
  • dataset

    Database schema quality analysis dataset

    Database schema quality analysis dataset
  • dataset

    Graphing/Markup Data: TNA SC8 Petitionary Texts and Community Narrative

    Cleaned CSV files of data samples taken from the British National Archives database; TNA petition XML/TEI data model for the purpose of digitizing and cataloguing documents...
  • dataset

    Survey Questions In Pdf-What Industry Wants From Academia In Software Testing...

    Survey questions in PDF-What industry wants from academia in software testing research
  • dataset

    ArCo Knowledge Graph

    The ArCo knowledge graph contains the ontology network and the data about the cultural properties catalogued by the Italian Institute of the General Catalogue and...
  • dataset

    Contextual Documentation Referencing on Stack Overflow — Supplementary Material

    Supplementary material for our paper "Contextual Documentation Referencing on Stack Overflow".
  • dataset

    dedup_wf_001--5b7a4fce39e3a541b035fa5cba26c33c

    A corpus of 471,085,690 English sentences extracted from the ClueWeb12 Web Crawl. The sentences were sampled from a larger corpus to achieve a level of sentence complexity...
  • dataset

    TEI files in Recogito/Archivos TEI en REcogito

    Video tutorial about the TEI/XML download options of Recogito. Bilingual video. Subtitles in Spanish.
  • dataset

    TEI files in Recogito / Archivos TEI en Recogito

    Video tutorial about the TEI/XML upload options and file sharing using Recogito. Bilingual video. Subtitles in Spanish.
  • dataset

    Bioimages Release 2016-02-25

    Included missing thomas.rdf file in the RDF zip archive
  • dataset

    beast2-paper bModelTest files

    beast2-paper.xml BEAST 2 XML file + outputs of 4 runs of the XML file. beast2-paper/beast2-paper.xml beast2-paper/run/yang.log trace log beast2-paper/run/yang.trees tree log...
  • dataset

    Source codes for preprint Newmark algorithm for dynamic analysis with general...

    Snapshot of a GitLab repository, containing Python source codes of results presented in the preprint by the same authors.
  • publication

    Deep Learning Approaches to Text Production

    Text production is a key component of many NLP applications. In data-driven approaches, it is used for instance, to generate dialogue turns from dialogue moves, to verbalise the...
  • publication

    dedup_wf_001--38d1ce1c3c9bfdd2103c7e7ceb5b1804

    The heydays of XML have come to a close, and XML finds itself confronted with competitors that represent—in some respects—steps backwards.  In this paper we argue that at this...
  • publication

    #MIILS2013 Practical: Bioclipse for RDF minting

    Slides to go with a practical about creating RDF in Bioclipse.
  • publication

    The odd couple: Contrasting openness in innovation and science

    This is a pre-print of a manuscript which is currently under peer review.