You are browsing the archive for linked data.

Dados Conectados

- November 20, 2017 in dados, Dados Abertos, Destaque, Internet, linked data, W3C

Por Thiago Ávila* Vamos começar este artigo analisando a figura 01, extraída do site DataPortals.org [1], que mostra a ocorrência de catálogos de dados abertos no mundo:

Figura 01 – Distribuição dos catálogos de dados abertos governamentais no mundo [1]

Hummm … 200 catálogos na Europa, 140 na América do Norte, 22 na América do Sul, 23 na África, 21 na Ásia e 15 na Oceania. Tudo bem. E como fazer para responder algumas questões relevantes como:
  • Quais os dados disponíveis sobre ocorrência de doenças no hemisfério sul ?
  • Ou ainda, quais as ocorrências de determinado tipo de crime nestes países ?
  • Quantas escolas foram abertas desde 2008 em cada país que possua um catálogo de dados ?
Como se tratam de catálogos de dados abertos, provavelmente, para tentar responder a pelo menos uma dessas perguntas será preciso acessar os 424 catálogos, buscar em cada um deles o dado desejado, fazer o download de cada conjunto de dado, padronizar o formato de dados, metadados, levar para uma ferramenta de extração, tratamento de carga de dados (ETL), construir uma consulta para depois ter um resultado. Muito esforço, não? E deixando a coisa um pouco mais complexa, se a licença de uso de um conjunto de dados impedir que esse dado seja cruzado com outro dado, ou ainda, se o formato disponibilizado seja proprietário ou um formato de baixa qualidade, como o PDF? Provavelmente você ficará sem responder as suas perguntas. A web que conhecemos atualmente é a web dos documentos onde são priorizados e disponibilizados páginas HTML, arquivos de diversos formatos, como planilhas, documentos de texto, mapas, coordenadas geográficas, animações, conteúdo multimídia, etc. Acontece que os dados, mesmo que estejam disponíveis em formatos abertos, para serem acessíveis primeiro é preciso encontrar o arquivo que armazena os dados, para aí sim, acessar cada dado, pois, em sua maioria são formatos não estruturados e são adequados para facilitar o acesso e leitura para humanos e não são compreensíveis por máquina [2]. Considerando situações corriqueiras como essa, o World Wide Web Consortium – W3C tem desenvolvido muitos esforços para não apenas estabelecer os padrões da internet global, mas para a oferta de dados na Web, como já apresentamos no post anterior. E como seria se pudéssemos acessar diretamente os dados disponíveis na web, mediante consultas a servidores de dados? Consultas que acessem dados de diversas origens, espalhados ao longo do mundo e ainda, obtendo não apenas os dados, mas a semântica relacionada a eles?  Buscando construir esta web dos dados que, dentre outras muitas coisas, resolvem aos problemas corriqueiros do início do artigo que ao longo destes esforços e pesquisas desenvolvidas pelo W3C, Tim Berners-Lee (ele mesmo, o mesmo cara que inventou a Web) propôs um conceito muito promissor que são os Dados Conectados, do termo em inglês, Linked Data [3]. Em definição, Linked Data se resume ao conjunto de boas práticas para a publicação de dados na web. Linked Data define princípios para a publicação e consumo dos dados e os classificam de acordo com sua disponibilidade, acesso, estruturação e conexão [2]. Assim como a web do hipertexto, a web dos dados é construída a partir de documentos na web, porém, diferentemente da web do hipertexto, onde os links são âncoras que relacionam uma página web a outra (ou a um arquivo), na web dos dados, os links são apontados para os dados que são descritos por um framework de recursos, conhecido como RDF (Resource Description Framework). Além disso, cada dado é identificado por um identificador universal – URI (Universal Resource Identifier) e ainda, podem ser acessados mediante uma linguagem de consulta que é o SPARQL (SPARQL Protocol and RDF Query Language). Para um dado ser conectado, ele precisa obedecer aos quatro princípios para publicação [4]:
  1. Use URIs para definir coisas;
  2. Use HTTP URIs para que os dados possam ser encontrados por humanos e agentes na web;
  3. Quando um dado for solicitado através de HTTP URIs, fornecer todas as informações sobre o mesmo, em um formato de dados estruturados utilizando padrões como RDF e SPARQL;
  4. Incluir links para outras fontes de dados relacionados (usando URIs) para que seja possível obter mais informações.
A partir do conceito de Dados Conectados, algumas nações globais já estão considerando este novo paradigma e incentivando a sua produção e oferta. Países como o Reino Unido e os Estados Unidos da América já possuem uma boa oferta de dados em formato RDF nos seus catálogos de dados governamentais. Além disso, grandes projetos em escala global tem crescido a cada ano, como a DBPedia[5], que é a base de dados conectada a partir da Wikipedia ou a LODSpringer[6], que visa ofertar dados conectados sobre artigos, periódicos e conferências científicas editorados pela Springer. Enfim, sobre o Reino Unido já é possível responder a terceira pergunta do início deste artigo “Quantas escolas foram abertas desde 2008 em cada país que possua um catálogo de dados ?”. Basta executar a seguinte consulta SPARQL abaixo: PREFIX sch-ont: <http://education.data.gov.uk/ontology/school#> PREFIX xsd: <http://www.w3.org/2001/XMLSchema#> SELECT ?school ?name ?date ?easting ?northing WHERE { ?school a sch-ont:School; sch-ont:establishmentName ?name; sch-ont:openDate ?date ; sch-ont:easting ?easting ; sch-ont:northing ?northing . FILTER (?date > “2008-01-01″^^xsd:date && ?date < “2009-01-01″^^xsd:date) } Nos próximos artigos, continuaremos apresentando o potencial, casos de uso, vantagens, limitações e muito mais sobre o universo dos Dados Conectados. Até a próxima!!!
  • Thiago Ávila é conselheiro consultivo da Open Knowledge Brasil.
  •  Estes artigos são oriundos de pesquisas científicas desenvolvidas no Núcleo de Excelência em Tecnologias Sociais (NEES), do Instituto de Computação da Universidade Federal de Alagoas (UFAL) e contam com a contribuição direta dos pesquisadores Dr. Ig Ibert Bittencourt (UFAL), Dr. Seiji Isotani (USP), e Armando Barbosa, Danila Oliveira, Judson Bandeira, Thiago Ávila e Williams Alcântara (UFAL).
[1] DataPortals. (2015). A Comprehensive List of Open Data Portals from Around the World. Open Knowledge Foundation. Acesso em: jul. 2015. Disponível em: http://www.dataportals.org [2] Bandeira, Judson; Alcantara; Williams;  Barbosa, Armando; Ávila, Thiago; Oliveira, Danila; Bittencourt, I. & Isotani, S. (2014). Dados Abertos Conectados. Jornada de Atualização em Tecnologia da Informação. Anais do III Simpósio Brasileiro de Tecnologia da Informação – SBTI 2014. [3]Berners-Lee, Tim (2006). Linked Data. W3C. Acesso em: jul. 2015. Disponível em: http://www.w3.org/DesignIssues/LinkedData.html [4] Bizer, Christian; Hheath, Tom; Berners-Lee, Tim (2009). Linked data – the story so far. International Journal On Semantic Web And Information Systems, v. 5, n. 3, p. 1-22. [5] DBPedia – http://www.dbpedia.org [6] Springer Linked Open Data – http://lod.springer.com Texto publicado no site Thiago Ávila. Ele faz parte da série de artigos Dados abertos conectados. Flattr this!

Fourth Workshop on Linked Data in Linguistics (LDL-2015): Resources and Applications

- December 18, 2014 in ldl, ldl-2015, linguistics, linked data, linked data in linguistics, llod, natural language processing, ontologies, open linguistics, Semantic Web, WG Linguistics

We are very happy to announce the next instantiation of the OWLG’s Linked Data in Linguistics (LDL) workshop series. The OWLG’s fourth Workshop on Linked Data in Linguistics is becoming increasingly international, and, for the first time, will be held outside of Europe: on June 31st, 2015, in Beijing, China, collocated with ACL-IJCNLP 2015.

See you in Beijing!


 

4th Workshop on Linked Data in Linguistics (LDL-2015): Resources and Applications
Beijing, June 31st, 2015, http://ldl2015.linguistic-lod.org, collocated with ACL-IJCNLP 2015

 

Workshop Description

The substantial growth in the quantity, diversity and complexity of linguistic data accessible on the Web has led to many new and interesting research areas in Natural Language Processing (NLP) and linguistics. However, resource interoperability represents a major challenge that still needs to be addressed, in particular if information from different sources is combined. With its fourth instantiation, the Linked Data in Linguistics workshop continues to provide a major forum to discuss the creation of linguistic resources on the web using linked data principles, as well as issues of interoperability, distribution protocols, access and integration of language resources and natural language processing pipelines developed on this basis.

As a result of the preceding workshops, a considerable number of resources is now available in the Linguistic Linked Open Data (LLOD) cloud [1]. LDL-2015 will thus specifically welcome papers addressing the usage aspect of Linked Data and related technologies in NLP, linguistics and neighboring fields, such as Digital Humanities.

Organized by the interdisciplinary Open Linguistics Working Group (OWLG) [2], the LDL workshop series is open to researchers from a wide range of disciplines, including (computational) linguistics and NLP, but also the Semantic Web, linguistic typology, corpus linguistics, terminology and lexicography. In 2015, we plan to increase the involvement of the LIDER project [3] and the W3C Community Group on Linked Data for Language Technology (LD4LT) [4], to build on their efforts to facilitate the use of linked data and language resources for commercial applications, and to continue the success of LIDER‘s roadmapping workshop series in engagement with enterprise.

[1] http://linguistics.okfn.org/resources/llod/
[2] http://linguistics.okfn.org/
[3] http://www.lider-project.eu/
[4] http://www.w3.org/community/ld4lt/

Topics of Interest

We invite presentations of algorithms, methodologies, experiments, use cases, project proposals and position papers regarding the creation, publication or application of linguistic data collections and their linking with other resources, as well as descriptions of such data. This includes, but is not limited to, the following:

A. Resources

  • Modelling linguistic data and metadata with OWL and/or RDF.
  • Ontologies for linguistic data and metadata collections as well as cross-lingual retrieval.
  • Descriptions of data sets following Linked Data principles.
  • Legal and social aspects of Linguistic Linked Open Data.
  • Best practices for the publication and linking of multilingual knowledge resources.

B. Applications

  • Applications of such data, other ontologies or linked data from any subdiscipline of linguistics or NLP.
  • The role of (Linguistic) Linked Open Data to address challenges of multilinguality and interoperability.
  • Application and applicability of (Linguistic) Linked Open Data for knowledge extraction, machine translation and other NLP tasks.
  • NLP contributions to (Linguistic) Linked Open Data.

We invite both long (8 pages and 2 pages of references, formatted according to the ACL-IJCNLP guidelines) and short papers (4 pages and 2 pages of references) representing original research, innovative approaches and resource types, use cases or in-depth discussions. Short papers may also represent project proposals, work in progress or data set descriptions.

Dataset Description Papers

In addition to full papers and regular short papers, authors may submit short papers with a dataset descriptions describing a resource’s availability, published location and key statistics (such as size). Such papers do not need to show a novel method for the creation or publishing of the data but *instead* will be judged on the quality, usefulness and clarity of description given in the paper.

For contact information, submission details and last-minute updates, please consult our website under http://ldl2015.linguistic-lod.org

Important Dates

  • May 8th, 2015: Paper submission
  • June 5th, 2015: Notification of Acceptance
  • June 21st, 2015: Camera-Ready Copy
  • June 31st, 2015: Workshop

Organizing Committee

  • Christian Chiarcos (Goethe University Frankfurt, Germany)
  • Philipp Cimiano (Bielefeld University, Germany)
  • Nancy Ide (Vassar College, USA)
  • John P. McCrae (Bielefeld University, Germany)
  • Petya Osenova (Bulgarian Academy of Sciences, Bulgaria)

Program Committee

  • Eneko Agirre (University of the Basque Country, Spain)
  • Guadalupe Aguado (Universidad Politécnica de Madrid, Spain)
  • Claire Bonial (University of Colorado at Boulder, USA)
  • Peter Bouda (Interdisciplinary Centre for Social and Language Documentation, Portugal)
  • Antonio Branco (University of Lisbon, Portugal)
  • Martin Brümmer (University of Leipzig, Germany)
  • Paul Buitelaar (INSIGHT, NUIG Galway, Ireland)
  • Steve Cassidy (Macquarie University, Australia)
  • Nicoletta Calzolari (ILC-CNR, Italy)
  • Thierry Declerck (DFKI, Germany)
  • Ernesto William De Luca (University of Applied Sciences Potsdam, Germany)
  • Gerard de Melo (University of California at Berkeley)
  • Judith Eckle-Kohler (Technische Universität Darmstadt, Germany)
  • Francesca Frontini (ILC-CNR, Italy)
  • Jeff Good (University at Buffalo)
  • Asunción Gómez Pérez (Universidad Politécnica de Madrid, Spain)
  • Jorge Gracia (Universidad Politécnica de Madrid, Spain)
  • Yoshihiko Hayashi (Waseda University, Japan)
  • Fahad Khan (ILC-CNR, Italy)
  • Seiji Koide (National Institute of Informatics, Japan)
  • Lutz Maicher (Universität Leipzig, Germany)
  • Elena Montiel-Ponsoda (Universidad Politécnica de Madrid, Spain)
  • Steven Moran (Universität Zürich, Switzerland)
  • Sebastian Nordhoff (Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany)
  • Antonio Pareja-Lora (Universidad Complutense Madrid, Spain)
  • Maciej Piasecki (Wroclaw University of Technology, Poland)
  • Francesca Quattri (Hong Kong Polytechnic University, Hong Kong)
  • Laurent Romary (INRIA, France)
  • Felix Sasaki (Deutsches Forschungszentrum für Künstliche Intelligenz, Germany)
  • Andrea Schalley (Griffith University, Australia)
  • Gilles Sérraset (Joseph Fourier University, France)
  • Kiril Simov (Bulgarian Academy of Sciences, Sofia, Bulgaria)
  • Milena Slavcheva (JRC-Brussels, Belgium)
  • Armando Stellato (University of Rome, Tor Vergata, Italy)
  • Marco Tadic (University of Zagreb, Croatia)
  • Marieke van Erp (VU University Amsterdam, The Netherlands)
  • Daniel Vila (Universidad Politécnica de Madrid)
  • Cristina Vertan (University of Hamburg, Germany)
  • Walther v. Hahn (University of Hamburg, Germany)
  • Menzo Windhouwer (Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands)

Fourth Workshop on Linked Data in Linguistics (LDL-2015): Resources and Applications

- December 18, 2014 in ldl, ldl-2015, linguistics, linked data, linked data in linguistics, llod, natural language processing, ontologies, open linguistics, Semantic Web, WG Linguistics

We are very happy to announce the next instantiation of the OWLG’s Linked Data in Linguistics (LDL) workshop series. The OWLG’s fourth Workshop on Linked Data in Linguistics is becoming increasingly international, and, for the first time, will be held outside of Europe: on June 31st, 2015, in Beijing, China, collocated with ACL-IJCNLP 2015. See you in Beijing!
  4th Workshop on Linked Data in Linguistics (LDL-2015): Resources and Applications
Beijing, June 31st, 2015, http://ldl2015.linguistic-lod.org, collocated with ACL-IJCNLP 2015   Workshop Description The substantial growth in the quantity, diversity and complexity of linguistic data accessible on the Web has led to many new and interesting research areas in Natural Language Processing (NLP) and linguistics. However, resource interoperability represents a major challenge that still needs to be addressed, in particular if information from different sources is combined. With its fourth instantiation, the Linked Data in Linguistics workshop continues to provide a major forum to discuss the creation of linguistic resources on the web using linked data principles, as well as issues of interoperability, distribution protocols, access and integration of language resources and natural language processing pipelines developed on this basis. As a result of the preceding workshops, a considerable number of resources is now available in the Linguistic Linked Open Data (LLOD) cloud [1]. LDL-2015 will thus specifically welcome papers addressing the usage aspect of Linked Data and related technologies in NLP, linguistics and neighboring fields, such as Digital Humanities. Organized by the interdisciplinary Open Linguistics Working Group (OWLG) [2], the LDL workshop series is open to researchers from a wide range of disciplines, including (computational) linguistics and NLP, but also the Semantic Web, linguistic typology, corpus linguistics, terminology and lexicography. In 2015, we plan to increase the involvement of the LIDER project [3] and the W3C Community Group on Linked Data for Language Technology (LD4LT) [4], to build on their efforts to facilitate the use of linked data and language resources for commercial applications, and to continue the success of LIDER‘s roadmapping workshop series in engagement with enterprise. [1] http://linguistics.okfn.org/resources/llod/
[2] http://linguistics.okfn.org/
[3] http://www.lider-project.eu/
[4] http://www.w3.org/community/ld4lt/ Topics of Interest
We invite presentations of algorithms, methodologies, experiments, use cases, project proposals and position papers regarding the creation, publication or application of linguistic data collections and their linking with other resources, as well as descriptions of such data. This includes, but is not limited to, the following: A. Resources
  • Modelling linguistic data and metadata with OWL and/or RDF.
  • Ontologies for linguistic data and metadata collections as well as cross-lingual retrieval.
  • Descriptions of data sets following Linked Data principles.
  • Legal and social aspects of Linguistic Linked Open Data.
  • Best practices for the publication and linking of multilingual knowledge resources.
B. Applications
  • Applications of such data, other ontologies or linked data from any subdiscipline of linguistics or NLP.
  • The role of (Linguistic) Linked Open Data to address challenges of multilinguality and interoperability.
  • Application and applicability of (Linguistic) Linked Open Data for knowledge extraction, machine translation and other NLP tasks.
  • NLP contributions to (Linguistic) Linked Open Data.
We invite both long (8 pages and 2 pages of references, formatted according to the ACL-IJCNLP guidelines) and short papers (4 pages and 2 pages of references) representing original research, innovative approaches and resource types, use cases or in-depth discussions. Short papers may also represent project proposals, work in progress or data set descriptions. Dataset Description Papers In addition to full papers and regular short papers, authors may submit short papers with a dataset descriptions describing a resource’s availability, published location and key statistics (such as size). Such papers do not need to show a novel method for the creation or publishing of the data but instead will be judged on the quality, usefulness and clarity of description given in the paper. For contact information, submission details and last-minute updates, please consult our website under http://ldl2015.linguistic-lod.org Important Dates
  • May 8th, 2015: Paper submission
  • June 5th, 2015: Notification of Acceptance
  • June 21st, 2015: Camera-Ready Copy
  • June 31st, 2015: Workshop
Organizing Committee
  • Christian Chiarcos (Goethe University Frankfurt, Germany)
  • Philipp Cimiano (Bielefeld University, Germany)
  • Nancy Ide (Vassar College, USA)
  • John P. McCrae (Bielefeld University, Germany)
  • Petya Osenova (Bulgarian Academy of Sciences, Bulgaria)
Program Committee
  • Eneko Agirre (University of the Basque Country, Spain)
  • Guadalupe Aguado (Universidad Politécnica de Madrid, Spain)
  • Claire Bonial (University of Colorado at Boulder, USA)
  • Peter Bouda (Interdisciplinary Centre for Social and Language Documentation, Portugal)
  • Antonio Branco (University of Lisbon, Portugal)
  • Martin Brümmer (University of Leipzig, Germany)
  • Paul Buitelaar (INSIGHT, NUIG Galway, Ireland)
  • Steve Cassidy (Macquarie University, Australia)
  • Nicoletta Calzolari (ILC-CNR, Italy)
  • Thierry Declerck (DFKI, Germany)
  • Ernesto William De Luca (University of Applied Sciences Potsdam, Germany)
  • Gerard de Melo (University of California at Berkeley)
  • Judith Eckle-Kohler (Technische Universität Darmstadt, Germany)
  • Francesca Frontini (ILC-CNR, Italy)
  • Jeff Good (University at Buffalo)
  • Asunción Gómez Pérez (Universidad Politécnica de Madrid, Spain)
  • Jorge Gracia (Universidad Politécnica de Madrid, Spain)
  • Yoshihiko Hayashi (Waseda University, Japan)
  • Fahad Khan (ILC-CNR, Italy)
  • Seiji Koide (National Institute of Informatics, Japan)
  • Lutz Maicher (Universität Leipzig, Germany)
  • Elena Montiel-Ponsoda (Universidad Politécnica de Madrid, Spain)
  • Steven Moran (Universität Zürich, Switzerland)
  • Sebastian Nordhoff (Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany)
  • Antonio Pareja-Lora (Universidad Complutense Madrid, Spain)
  • Maciej Piasecki (Wroclaw University of Technology, Poland)
  • Francesca Quattri (Hong Kong Polytechnic University, Hong Kong)
  • Laurent Romary (INRIA, France)
  • Felix Sasaki (Deutsches Forschungszentrum für Künstliche Intelligenz, Germany)
  • Andrea Schalley (Griffith University, Australia)
  • Gilles Sérraset (Joseph Fourier University, France)
  • Kiril Simov (Bulgarian Academy of Sciences, Sofia, Bulgaria)
  • Milena Slavcheva (JRC-Brussels, Belgium)
  • Armando Stellato (University of Rome, Tor Vergata, Italy)
  • Marco Tadic (University of Zagreb, Croatia)
  • Marieke van Erp (VU University Amsterdam, The Netherlands)
  • Daniel Vila (Universidad Politécnica de Madrid)
  • Cristina Vertan (University of Hamburg, Germany)
  • Walther v. Hahn (University of Hamburg, Germany)
  • Menzo Windhouwer (Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands)

Fourth Workshop on Linked Data in Linguistics (LDL-2015): Resources and Applications

- December 18, 2014 in ldl, ldl-2015, linguistics, linked data, linked data in linguistics, llod, natural language processing, ontologies, open linguistics, Semantic Web, WG Linguistics

We are very happy to announce the next instantiation of the OWLG’s Linked Data in Linguistics (LDL) workshop series. The OWLG’s fourth Workshop on Linked Data in Linguistics is becoming increasingly international, and, for the first time, will be held outside of Europe: on June 31st, 2015, in Beijing, China, collocated with ACL-IJCNLP 2015. See you in Beijing!
  4th Workshop on Linked Data in Linguistics (LDL-2015): Resources and Applications Beijing, June 31st, 2015, http://ldl2015.linguistic-lod.org, collocated with ACL-IJCNLP 2015   Workshop Description The substantial growth in the quantity, diversity and complexity of linguistic data accessible on the Web has led to many new and interesting research areas in Natural Language Processing (NLP) and linguistics. However, resource interoperability represents a major challenge that still needs to be addressed, in particular if information from different sources is combined. With its fourth instantiation, the Linked Data in Linguistics workshop continues to provide a major forum to discuss the creation of linguistic resources on the web using linked data principles, as well as issues of interoperability, distribution protocols, access and integration of language resources and natural language processing pipelines developed on this basis. As a result of the preceding workshops, a considerable number of resources is now available in the Linguistic Linked Open Data (LLOD) cloud [1]. LDL-2015 will thus specifically welcome papers addressing the usage aspect of Linked Data and related technologies in NLP, linguistics and neighboring fields, such as Digital Humanities. Organized by the interdisciplinary Open Linguistics Working Group (OWLG) [2], the LDL workshop series is open to researchers from a wide range of disciplines, including (computational) linguistics and NLP, but also the Semantic Web, linguistic typology, corpus linguistics, terminology and lexicography. In 2015, we plan to increase the involvement of the LIDER project [3] and the W3C Community Group on Linked Data for Language Technology (LD4LT) [4], to build on their efforts to facilitate the use of linked data and language resources for commercial applications, and to continue the success of LIDER‘s roadmapping workshop series in engagement with enterprise. [1] http://linguistics.okfn.org/resources/llod/ [2] http://linguistics.okfn.org/ [3] http://www.lider-project.eu/ [4] http://www.w3.org/community/ld4lt/ Topics of Interest We invite presentations of algorithms, methodologies, experiments, use cases, project proposals and position papers regarding the creation, publication or application of linguistic data collections and their linking with other resources, as well as descriptions of such data. This includes, but is not limited to, the following: A. Resources
  • Modelling linguistic data and metadata with OWL and/or RDF.
  • Ontologies for linguistic data and metadata collections as well as cross-lingual retrieval.
  • Descriptions of data sets following Linked Data principles.
  • Legal and social aspects of Linguistic Linked Open Data.
  • Best practices for the publication and linking of multilingual knowledge resources.
B. Applications
  • Applications of such data, other ontologies or linked data from any subdiscipline of linguistics or NLP.
  • The role of (Linguistic) Linked Open Data to address challenges of multilinguality and interoperability.
  • Application and applicability of (Linguistic) Linked Open Data for knowledge extraction, machine translation and other NLP tasks.
  • NLP contributions to (Linguistic) Linked Open Data.
We invite both long (8 pages and 2 pages of references, formatted according to the ACL-IJCNLP guidelines) and short papers (4 pages and 2 pages of references) representing original research, innovative approaches and resource types, use cases or in-depth discussions. Short papers may also represent project proposals, work in progress or data set descriptions. Dataset Description Papers In addition to full papers and regular short papers, authors may submit short papers with a dataset descriptions describing a resource’s availability, published location and key statistics (such as size). Such papers do not need to show a novel method for the creation or publishing of the data but instead will be judged on the quality, usefulness and clarity of description given in the paper. For contact information, submission details and last-minute updates, please consult our website under http://ldl2015.linguistic-lod.org Important Dates
  • May 8th, 2015: Paper submission
  • June 5th, 2015: Notification of Acceptance
  • June 21st, 2015: Camera-Ready Copy
  • June 31st, 2015: Workshop
Organizing Committee
  • Christian Chiarcos (Goethe University Frankfurt, Germany)
  • Philipp Cimiano (Bielefeld University, Germany)
  • Nancy Ide (Vassar College, USA)
  • John P. McCrae (Bielefeld University, Germany)
  • Petya Osenova (Bulgarian Academy of Sciences, Bulgaria)
Program Committee
  • Eneko Agirre (University of the Basque Country, Spain)
  • Guadalupe Aguado (Universidad Politécnica de Madrid, Spain)
  • Claire Bonial (University of Colorado at Boulder, USA)
  • Peter Bouda (Interdisciplinary Centre for Social and Language Documentation, Portugal)
  • Antonio Branco (University of Lisbon, Portugal)
  • Martin Brümmer (University of Leipzig, Germany)
  • Paul Buitelaar (INSIGHT, NUIG Galway, Ireland)
  • Steve Cassidy (Macquarie University, Australia)
  • Nicoletta Calzolari (ILC-CNR, Italy)
  • Thierry Declerck (DFKI, Germany)
  • Ernesto William De Luca (University of Applied Sciences Potsdam, Germany)
  • Gerard de Melo (University of California at Berkeley)
  • Judith Eckle-Kohler (Technische Universität Darmstadt, Germany)
  • Francesca Frontini (ILC-CNR, Italy)
  • Jeff Good (University at Buffalo)
  • Asunción Gómez Pérez (Universidad Politécnica de Madrid, Spain)
  • Jorge Gracia (Universidad Politécnica de Madrid, Spain)
  • Yoshihiko Hayashi (Waseda University, Japan)
  • Fahad Khan (ILC-CNR, Italy)
  • Seiji Koide (National Institute of Informatics, Japan)
  • Lutz Maicher (Universität Leipzig, Germany)
  • Elena Montiel-Ponsoda (Universidad Politécnica de Madrid, Spain)
  • Steven Moran (Universität Zürich, Switzerland)
  • Sebastian Nordhoff (Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany)
  • Antonio Pareja-Lora (Universidad Complutense Madrid, Spain)
  • Maciej Piasecki (Wroclaw University of Technology, Poland)
  • Francesca Quattri (Hong Kong Polytechnic University, Hong Kong)
  • Laurent Romary (INRIA, France)
  • Felix Sasaki (Deutsches Forschungszentrum für Künstliche Intelligenz, Germany)
  • Andrea Schalley (Griffith University, Australia)
  • Gilles Sérraset (Joseph Fourier University, France)
  • Kiril Simov (Bulgarian Academy of Sciences, Sofia, Bulgaria)
  • Milena Slavcheva (JRC-Brussels, Belgium)
  • Armando Stellato (University of Rome, Tor Vergata, Italy)
  • Marco Tadic (University of Zagreb, Croatia)
  • Marieke van Erp (VU University Amsterdam, The Netherlands)
  • Daniel Vila (Universidad Politécnica de Madrid)
  • Cristina Vertan (University of Hamburg, Germany)
  • Walther v. Hahn (University of Hamburg, Germany)
  • Menzo Windhouwer (Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands)

Fourth Workshop on Linked Data in Linguistics (LDL-2015): Resources and Applications

- December 18, 2014 in ldl, ldl-2015, linguistics, linked data, linked data in linguistics, llod, natural language processing, ontologies, open linguistics, Semantic Web, WG Linguistics

We are very happy to announce the next instantiation of the OWLG’s Linked Data in Linguistics (LDL) workshop series. The OWLG’s fourth Workshop on Linked Data in Linguistics is becoming increasingly international, and, for the first time, will be held outside of Europe: on June 31st, 2015, in Beijing, China, collocated with ACL-IJCNLP 2015. See you in Beijing!
  4th Workshop on Linked Data in Linguistics (LDL-2015): Resources and Applications Beijing, June 31st, 2015, http://ldl2015.linguistic-lod.org, collocated with ACL-IJCNLP 2015   Workshop Description The substantial growth in the quantity, diversity and complexity of linguistic data accessible on the Web has led to many new and interesting research areas in Natural Language Processing (NLP) and linguistics. However, resource interoperability represents a major challenge that still needs to be addressed, in particular if information from different sources is combined. With its fourth instantiation, the Linked Data in Linguistics workshop continues to provide a major forum to discuss the creation of linguistic resources on the web using linked data principles, as well as issues of interoperability, distribution protocols, access and integration of language resources and natural language processing pipelines developed on this basis. As a result of the preceding workshops, a considerable number of resources is now available in the Linguistic Linked Open Data (LLOD) cloud [1]. LDL-2015 will thus specifically welcome papers addressing the usage aspect of Linked Data and related technologies in NLP, linguistics and neighboring fields, such as Digital Humanities. Organized by the interdisciplinary Open Linguistics Working Group (OWLG) [2], the LDL workshop series is open to researchers from a wide range of disciplines, including (computational) linguistics and NLP, but also the Semantic Web, linguistic typology, corpus linguistics, terminology and lexicography. In 2015, we plan to increase the involvement of the LIDER project [3] and the W3C Community Group on Linked Data for Language Technology (LD4LT) [4], to build on their efforts to facilitate the use of linked data and language resources for commercial applications, and to continue the success of LIDER‘s roadmapping workshop series in engagement with enterprise. [1] http://linguistics.okfn.org/resources/llod/ [2] http://linguistics.okfn.org/ [3] http://www.lider-project.eu/ [4] http://www.w3.org/community/ld4lt/ Topics of Interest We invite presentations of algorithms, methodologies, experiments, use cases, project proposals and position papers regarding the creation, publication or application of linguistic data collections and their linking with other resources, as well as descriptions of such data. This includes, but is not limited to, the following: A. Resources
  • Modelling linguistic data and metadata with OWL and/or RDF.
  • Ontologies for linguistic data and metadata collections as well as cross-lingual retrieval.
  • Descriptions of data sets following Linked Data principles.
  • Legal and social aspects of Linguistic Linked Open Data.
  • Best practices for the publication and linking of multilingual knowledge resources.
B. Applications
  • Applications of such data, other ontologies or linked data from any subdiscipline of linguistics or NLP.
  • The role of (Linguistic) Linked Open Data to address challenges of multilinguality and interoperability.
  • Application and applicability of (Linguistic) Linked Open Data for knowledge extraction, machine translation and other NLP tasks.
  • NLP contributions to (Linguistic) Linked Open Data.
We invite both long (8 pages and 2 pages of references, formatted according to the ACL-IJCNLP guidelines) and short papers (4 pages and 2 pages of references) representing original research, innovative approaches and resource types, use cases or in-depth discussions. Short papers may also represent project proposals, work in progress or data set descriptions. Dataset Description Papers In addition to full papers and regular short papers, authors may submit short papers with a dataset descriptions describing a resource’s availability, published location and key statistics (such as size). Such papers do not need to show a novel method for the creation or publishing of the data but instead will be judged on the quality, usefulness and clarity of description given in the paper. For contact information, submission details and last-minute updates, please consult our website under http://ldl2015.linguistic-lod.org Important Dates
  • May 8th, 2015: Paper submission
  • June 5th, 2015: Notification of Acceptance
  • June 21st, 2015: Camera-Ready Copy
  • June 31st, 2015: Workshop
Organizing Committee
  • Christian Chiarcos (Goethe University Frankfurt, Germany)
  • Philipp Cimiano (Bielefeld University, Germany)
  • Nancy Ide (Vassar College, USA)
  • John P. McCrae (Bielefeld University, Germany)
  • Petya Osenova (Bulgarian Academy of Sciences, Bulgaria)
Program Committee
  • Eneko Agirre (University of the Basque Country, Spain)
  • Guadalupe Aguado (Universidad Politécnica de Madrid, Spain)
  • Claire Bonial (University of Colorado at Boulder, USA)
  • Peter Bouda (Interdisciplinary Centre for Social and Language Documentation, Portugal)
  • Antonio Branco (University of Lisbon, Portugal)
  • Martin Brümmer (University of Leipzig, Germany)
  • Paul Buitelaar (INSIGHT, NUIG Galway, Ireland)
  • Steve Cassidy (Macquarie University, Australia)
  • Nicoletta Calzolari (ILC-CNR, Italy)
  • Thierry Declerck (DFKI, Germany)
  • Ernesto William De Luca (University of Applied Sciences Potsdam, Germany)
  • Gerard de Melo (University of California at Berkeley)
  • Judith Eckle-Kohler (Technische Universität Darmstadt, Germany)
  • Francesca Frontini (ILC-CNR, Italy)
  • Jeff Good (University at Buffalo)
  • Asunción Gómez Pérez (Universidad Politécnica de Madrid, Spain)
  • Jorge Gracia (Universidad Politécnica de Madrid, Spain)
  • Yoshihiko Hayashi (Waseda University, Japan)
  • Fahad Khan (ILC-CNR, Italy)
  • Seiji Koide (National Institute of Informatics, Japan)
  • Lutz Maicher (Universität Leipzig, Germany)
  • Elena Montiel-Ponsoda (Universidad Politécnica de Madrid, Spain)
  • Steven Moran (Universität Zürich, Switzerland)
  • Sebastian Nordhoff (Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany)
  • Antonio Pareja-Lora (Universidad Complutense Madrid, Spain)
  • Maciej Piasecki (Wroclaw University of Technology, Poland)
  • Francesca Quattri (Hong Kong Polytechnic University, Hong Kong)
  • Laurent Romary (INRIA, France)
  • Felix Sasaki (Deutsches Forschungszentrum für Künstliche Intelligenz, Germany)
  • Andrea Schalley (Griffith University, Australia)
  • Gilles Sérraset (Joseph Fourier University, France)
  • Kiril Simov (Bulgarian Academy of Sciences, Sofia, Bulgaria)
  • Milena Slavcheva (JRC-Brussels, Belgium)
  • Armando Stellato (University of Rome, Tor Vergata, Italy)
  • Marco Tadic (University of Zagreb, Croatia)
  • Marieke van Erp (VU University Amsterdam, The Netherlands)
  • Daniel Vila (Universidad Politécnica de Madrid)
  • Cristina Vertan (University of Hamburg, Germany)
  • Walther v. Hahn (University of Hamburg, Germany)
  • Menzo Windhouwer (Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands)

Semantic Web views from EKAW 2014

- December 4, 2014 in Bibliographic, Events, linked data, tools

Last week (24th to 28th of November 2014) I attended the the 19th International Conference on Knowledge Engineering and Knowledge Management (EKAW2014). Set in Linköping, a charming (and chilly) city in Sweden, the conference had a strong focus on Semantic Web related areas, such as Description Logics, OWL and Linked Data. For tweets about the more »

Semantic Web views from EKAW 2014

- December 4, 2014 in Bibliographic, Events, linked data, tools

Last week (24th to 28th of November 2014) I attended the the 19th International Conference on Knowledge Engineering and Knowledge Management (EKAW2014). Set in Linköping, a charming (and chilly) city in Sweden, the conference had a strong focus on Semantic Web related areas, such as Description Logics, OWL and Linked Data. For tweets about the more »

Semantic Web views from EKAW 2014

- December 4, 2014 in Bibliographic, Events, linked data, tools

Last week (24th to 28th of November 2014) I attended the the 19th International Conference on Knowledge Engineering and Knowledge Management (EKAW2014). Set in Linköping, a charming (and chilly) city in Sweden, the conference had a strong focus on Semantic Web related areas, such as Description Logics, OWL and Linked Data. For tweets about the more »

Semantic Web views from EKAW 2014

- December 4, 2014 in Bibliographic, Events, linked data, tools

Last week (24th to 28th of November 2014) I attended the the 19th International Conference on Knowledge Engineering and Knowledge Management (EKAW2014). Set in Linköping, a charming (and chilly) city in Sweden, the conference had a strong focus on Semantic Web related areas, such as Description Logics, OWL and Linked Data. For tweets about the more »

Scottish Linked Data Interest Group workshop

- November 9, 2014 in Events, linked data, Open Data

The School of Informatics hosted the 3rd Scottish Linked Data Interest Group Workshop on the 10th September 2014, following on from two successful workshops earlier in the year organised by Jeff Pan in the University of Aberdeen. The event brought together a diverse set of about 30 people with different interests and perspectives on Linked more »