<?xml version='1.0' encoding='utf-8'?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-05-18T00:53:07Z</responseDate><request verb="GetRecord" metadataPrefix="oai_dc" identifier="oai:www.bilketa.eus:ark:/27020/Artxiker-hal02553655">https://www.bilketa.eus/in/rest/oai</request><GetRecord><record><header><identifier>oai:www.bilketa.eus:ark:/27020/Artxiker-hal02553655</identifier><setSpec>ALL</setSpec><datestamp>2026-04-05T17:00:08Z</datestamp></header><metadata> <oai_dc:dc xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><dc:identifier>https://www.bilketa.eus/ark:/27020/Artxiker-hal02553655</dc:identifier><dc:contributor>Universidad del País Vasco / Euskal Herriko Unibertsitatea (UPV / EHU)</dc:contributor><dc:contributor>Centre de recherche sur la langue et les textes basques (IKER)</dc:contributor><dc:contributor>Ekaitz Santazilia</dc:contributor><dc:contributor>Dorota Krajewska</dc:contributor><dc:contributor>Eneko Zuloaga</dc:contributor><dc:contributor>Borja Ariztimuño</dc:contributor><dc:creator>Estarrona, Ainara</dc:creator><dc:creator>Etxeberria, Izaskun</dc:creator><dc:creator>Etxepare, Ricardo (1968-....)</dc:creator><dc:creator>Padilla-Moyano, Manuel (19..-....)</dc:creator><dc:creator>Soraluze, Ander</dc:creator><dc:source>IKER, artxibo-hal02553655</dc:source><dc:source>IKER, hal-02553655</dc:source><dc:date>2020</dc:date><dc:description>Lan honetan morfologikoki eta sintaktikoki etiketatutako euskararen corpus historikoaren proiektua aurkezten dugu. Corpusak xv - xviii . mende bitarteko euskalki guztietako ekoizpen idatzi esanguratsuena besarkatuko du, haren tamaina milioi bat hitz ingurukoa izanik. Etiketatze morfosintaktikoak hainbat konplexutasun-mailatako bilaketa sistematikoak ahalbidetuko ditu: lemaren, formaren, kategoria gramatikalaren, tasun morfosintaktikoaren, bai eta zenbait egitura sintaktikoren arabera ere. Halaber, corpusa metadatu-sorta batekin hornituko da, irizpide sozio-historikoen araberako bilaketak ere posible egiteko. Euskararen lehen corpus historiko anotatua sortzeaz gainera, proiektu honi esker hizkuntzaren prozesamenduko tresnak gaur egungo euskara batutik urruntzen diren barietateekin trebatuko dira. Harago, proiektu honek etorkizuneko euskara historikoaren corpusgintzarako oinarriak finkatu nahi lituzke.</dc:description><dc:description>International audience</dc:description><dc:description>In this paper we present an ongoing project to build a morphosyntactically annotated historical corpus of Basque. The corpus will have around one million words, encompassing the most significant written production of Basque between the 15th and 18th centuries. Morphosyntactic tagging will allow for systematic searches at different levels of complexity: lemma, form, part of speech, morphosyntactic feature, and also a number of syntactic constructions. In addition, a set of metadata will enable searches based on socio-historical criteria too. Beyond being the first annotated historical corpus of Basque, through this project tools for language processing will be improved byanalysing Basque historical varieties more or less distant from present-day standard Basque. Moreover, this project aims to establish a model for further works in historicalcorpora of Basque.</dc:description><dc:identifier>https://hal.science/hal-02553655</dc:identifier><dc:identifier>https://univ-pau.hal.science/hal-02553655v1/file/fontes50urte.15.pdf</dc:identifier><dc:format>Chapitre de livre | Liburu zatia</dc:format><dc:relation>vignette : https://www.bilketa.eus/in/rest/Thumb/image?id=ark:/27020/Artxiker-hal02553655&amp;mat=articleNum</dc:relation><dc:language>baq</dc:language><dc:rights>https://hal.science/licences/copyright/Archive ouverte HAL | HAL artxibo irekia</dc:rights><dc:subject>Digital Humanities</dc:subject><dc:subject>historical corpus</dc:subject><dc:subject>Natural Language Processing ( nlp)</dc:subject><dc:subject>diachronic syntax</dc:subject><dc:subject>sintaxi diakronikoa</dc:subject><dc:subject>Humanitate Digitalak</dc:subject><dc:subject>corpus historikoa</dc:subject><dc:subject>Hizkuntzaren Prozesamendua (hp)</dc:subject><dc:subject>Linguistika</dc:subject><dc:subject>Linguistique</dc:subject><dc:title>Sintaktikoki etiketatutako euskarazko corpus historikoa eraikitzen</dc:title><dc:title>Building a syntactically annotated historical corpus of Basque</dc:title></oai_dc:dc></metadata></record></GetRecord></OAI-PMH>