You are browsing the archive for academia.

Escola de Dados realiza Coda.Br nos dias 10 e 11 de novembro

- November 7, 2018 in academia, Coda.Br, conferência, dados, Destaque, Escola de Dados, Evento, Jornalismo de dados, Métodos Digitais, métricas, monitoramento

Mais de 40 palestrantes nacionais e internacionais estão confirmados para a terceira edição da Conferência Brasileira de Jornalismo de Dados e Métodos Digitais, Coda.Br, organizada pela Escola de Dados. O evento ocorre em São Paulo nos dias 10 e 11 de novembro, na ESPM. Confira a programação completa e inscreva-se no site: coda.escoladedados.org. O Coda.Br recebe Jeremy Merril da ProPublica, premiadíssimo veículo norte-americano de jornalismo investigativo; Alberto Cairo, referência internacional quando o assunto é visualização de informações; Fernanda Viegas, pesquisadora sênior do Google; Neale El-Dash, cientista político fundador do Polling Data; entre outras dezenas de especialistas já confirmados.   Como conseguir evidências ou boas histórias utilizando bases de dados públicas? Que tal criar mapas para visualizar sua informação espacialmente? Ou usar a linguagem R para analisar políticas governamentais? São mais de 60 horas de workshops, apresentações e oficinas práticas com convidados nacionais e internacionais. Não sabe por onde começar? O Coda.Br vai ter momentos de debates abertos e atividades introdutórias para ensinar a lidar com bases de dados massivas, usando Python ou SQL, por exemplo. Mas se você quer aprofundar, vale conferir as atividades sobre Machine Learning ou quem sabe se aventurar nos workshops de Estatística Avançada ou Processamento de Linguagem Natural aplicados ao jornalismo. Além das mesas e dos workshops práticos, o evento vai ter espaços de networking sobre dados abertos, bootcamps de 6h para quem deseja aprender a programar em Python ou R e sessões de mentoria no estilo “Traga seu problema”, onde os participantes vão poder tirar dúvidas de projetos em andamento ou já realizados. Ainda há ingressos disponíveis. Garanta já o seu aqui. (Dica: junte um grupo de quatro ou mais amigos, colegas de trabalho ou da faculdade, para conseguir descontos. Basta entrar em contato no e-mail de contato da Escola de Dados) Se você já está mais certo que as planilhas, confirme também no evento no Facebook e convide amigos por lá. Ajude a notícia a chegar em possíveis interessados e ampliar a comunidade de jornalistas e cientistas de dados no Brasil. Esperamos você lá! Flattr this!

CKAN for Research Data Management workshop

- February 27, 2013 in academia, CKAN4RDM, Events, jiscMRD, orbitalMRD, research data

The ‘researcher’ user group

On 18 February, over 40 people participated in a workshop in London which focused on the use of CKAN for ‘research data management’. The event was led by the Orbital project at the University of Lincoln. After some initial presentations, participants broke into groups to gather a list of requirements for CKAN – or any data management system – in a research setting. You can read all about the day, and what we produced, over on the Orbital project blog. There are also write-ups from the DCC, Datapool, data.bris and LSE. We hope that the workshop marked the beginning of a CKAN user and developer community within the higher education and research sectors. If you’re interested in joining us, feel free to add yourself to the new CKAN4RDM mailing list. The results of the workshop will also form part of the research for a paper I am presenting at the IASSIST conference in May. The abstract is as follows:
This paper offers a full and critical evaluation of the open source CKAN software <> for use as a Research Data Management (RDM) tool within a university environment. It presents a case study of CKAN’s implementation and use at the University of Lincoln, UK, and highlights its strengths and current weaknesses as an institutional Research Data Management tool. The author draws on his prior experience of implementing a mixed media Digital Asset Management system (DAM), Institutional Repository (IR) and institutional Web Content Management System (CMS), to offer an outline proposal for how CKAN can be used effectively for data analysis, storage and publishing in academia. This will be of interest to researchers, data librarians, and developers, who are responsible for the implementation of institutional RDM infrastructure. This paper is presented as part of the dissemination activities of the JISC-funded Orbital project <http://orbital.blogs.lincoln.ac.uk>.
I hope to see some of you there.

First Open Economics International Workshop Recap

- January 25, 2013 in academia, Advisory Panel, collaboration, Economic Publishing, economics profession, Events, Featured, Open Access, Open Data, Open Economics, Open Research, Open Tools, research, Workshop

The first Open Economics International Workshop gathered 40 academic economists, data publishers and funders of economics research, researchers and practitioners to a two-day event at Emmanuel College in Cambridge, UK. The aim of the workshop was to build an understanding around the value of open data and open tools for the Economics profession and the obstacles to opening up information, as well as the role of greater openness of the academy. This event was organised by the Open Knowledge Foundation and the Centre for Intellectual Property and Information Law and was supported by the Alfred P. Sloan Foundation. Audio and slides are available at the event’s webpage. Open Economics Workshop

Setting the Scene

The Setting the Scene session was about giving a bit of context to “Open Economics” in the knowledge society, seeing also examples from outside of the discipline and discussing reproducible research. Rufus Pollock (Open Knowledge Foundation) emphasised that there is necessary change and substantial potential for economics: 1) open “core” economic data outside the academy, 2) open as default for data in the academy, 3) a real growth in citizen economics and outside participation. Daniel Goroff (Alfred P. Sloan Foundation) drew attention to the work of the Alfred P. Sloan Foundation in emphasising the importance of knowledge and its use for making decisions and data and knowledge as a non-rival, non-excludable public good. Tim Hubbard (Wellcome Trust Sanger Institute) spoke about the potential of large-scale data collection around individuals for improving healthcare and how centralised global repositories work in the field of bioinformatics. Victoria Stodden (Columbia University / RunMyCode) stressed the importance of reproducibility for economic research and as an essential part of scientific methodology and presented the RunMyCode project.

Open Data in Economics

The Open Data in Economics session was chaired by Christian Zimmermann (Federal Reserve Bank of St. Louis / RePEc) and was about several projects and ideas from various institutions. The session examined examples of open data in Economics and sought to discover whether these examples are sustainable and can be implemented in other contexts: whether the right incentives exist. Paul David (Stanford University / SIEPR) characterised the open science system as a system which is better than any other in the rapid accumulation of reliable knowledge, whereas the proprietary systems are very good in extracting the rent from the existing knowledge. A balance between these two systems should be established so that they can work within the same organisational system since separately they are distinctly suboptimal. Johannes Kiess (World Bank) underlined that having the data available is often not enough: “It is really important to teach people how to understand these datasets: data journalists, NGOs, citizens, coders, etc.”. The World Bank has implemented projects to incentivise the use of the data and is helping countries to open up their data. For economists, he mentioned, having a valuable dataset to publish on is an important asset, there are therefore not sufficient incentives for sharing. Eustáquio J. Reis (Institute of Applied Economic Research – Ipea) related his experience on establishing the Ipea statistical database and other projects for historical data series and data digitalisation in Brazil. He shared that the culture of the economics community is not a culture of collaboration where people willingly share or support and encourage data curation. Sven Vlaeminck (ZBW – Leibniz Information Centre for Economics) spoke about the EDaWaX project which conducted a study of the data-availability of economics journals and will establish publication-related data archive for an economics journal in Germany.

Legal, Cultural and other Barriers to Information Sharing in Economics

The session presented different impediments to the disclosure of data in economics from the perspective of two lawyers and two economists. Lionel Bently (University of Cambridge / CIPIL) drew attention to the fact that there is a whole range of different legal mechanism which operate to restrict the dissemination of information, yet on the other hand there is also a range of mechanism which help to make information available. Lionel questioned whether the open data standard would be always the optimal way to produce high quality economic research or whether there is also a place for modulated/intermediate positions where data is available only on conditions, or only in certain part or for certain forms of use. Mireille van Eechoud (Institute for Information Law) described the EU Public Sector Information Directive – the most generic document related to open government data and progress made for opening up information published by the government. Mireille also pointed out that legal norms have only limited value if you don’t have the internalised, cultural attitudes and structures in place that really make more access to information work. David Newbery (University of Cambridge) presented an example from the electricity markets and insisted that for a good supply of data, informed demand is needed, coming from regulators who are charged to monitor markets, detect abuse, uphold fair competition and defend consumers. John Rust (Georgetown University) said that the government is an important provider of data which is otherwise too costly to collect, yet a number of issues exist including confidentiality, excessive bureaucratic caution and the public finance crisis. There are a lot of opportunities for research also in the private sector where some part of the data can be made available (redacting confidential information) and the public non-profit sector also can have a tremendous role as force to organise markets for the better, set standards and focus of targeted domains.

Current Data Deposits and Releases – Mandating Open Data?

The session was chaired by Daniel Goroff (Alfred P. Sloan Foundation) and brought together funders and publishers to discuss their role in requiring data from economic research to be publicly available and the importance of dissemination for publishing. Albert Bravo-Biosca (NESTA) emphasised that mandating open data begins much earlier in the process where funders can encourage the collection of particular data by the government which is the basis for research and can also act as an intermediary for the release of open data by the private sector. Open data is interesting but it is even more interesting when it is appropriately linked and combined with other data and the there is a value in examples and case studies for demonstrating benefits. There should be however caution as opening up some data might result in less data being collected. Toby Green (OECD Publishing) made a point of the different between posting and publishing, where making content available does not always mean that it would be accessible, discoverable, usable and understandable. In his view, the challenge is to build up an audience by putting content where people would find it, which is very costly as proper dissemination is expensive. Nancy Lutz (National Science Foundation) explained the scope and workings of the NSF and the data management plans required from all economists who are applying for funding. Creating and maintaining data infrastructure and compliance with the data management policy might eventually mean that there would be less funding for other economic research.

Trends of Greater Participation and Growing Horizons in Economics

Chris Taggart (OpenCorporates) chaired the session which introduced different ways of participating and using data, different audiences and contributors. He stressed that data is being collected in new ways and by different communities, that access to data can be an enormous privilege and can generate data gravities with very unequal access and power to make use of and to generate more data and sometimes analysis is being done in new and unexpected ways and by unexpected contributors. Michael McDonald (George Mason University) related how the highly politicised process of drawing up district lines in the U.S. (also called Gerrymandering) could be done in a much more transparent way through an open-source re-districting process with meaningful participation allowing for an open conversation about public policy. Michael also underlined the importance of common data formats and told a cautionary tale about a group of academics misusing open data with a political agenda to encourage a storyline that a candidate would win a particular state. Hans-Peter Brunner (Asian Development Bank) shared a vision about how open data and open analysis can aid in decision-making about investments in infrastructure, connectivity and policy. Simulated models about investments can demonstrate different scenarios according to investment priorities and crowd-sourced ideas. Hans-Peter asked for feedback and input on how to make data and code available. Perry Walker (new economics foundation) spoke about the conversation and that a good conversation has to be designed as it usually doesn’t happen by accident. Rufus Pollock (Open Knowledge Foundation) concluded with examples about citizen economics and the growth of contributions from the wider public, particularly through volunteering computing and volunteer thinking as a way of getting engaged in research. During two sessions, the workshop participants also worked on Statement on the Open Economics principles will be revised with further input from the community and will be made public on the second Open Economics workshop taking place on 11-12 June in Cambridge, MA.

Open Research Data Handbook Sprint

- January 17, 2013 in academia, community, Events, Featured, Open Data, Open Economics, Open Research, Open Tools, research, Sprint

On February 15-16, the Open Research Data Handbook Sprint will happen at the Open Data Institute, 65 Clifton Street, London EC2A 4JE. The Open Research Data Handbook aims to provide an introduction to the processes, tools and other areas that researchers need to consider to make their research data openly available. Join us for a book sprint to develop the current draft, and explore ways to remix it for different disciplines and contexts.

Who it is for:

  • Researchers interested in carrying out their work in more open ways
  • Experts on sharing research and research data
  • Writers and copy editors
  • Web developers and designers to help present the handbook online
  • Anyone else interested in taking part in an intense and collaborative weekend of action

Register at Eventbrite

What will happen:

The main sprint will take place on Friday and Saturday. After initial discussions we’ll divide into open space groups to focus on research, writing and editing for different chapters of the handbook, developing a range of content including How To guidance, stories of impact, collections of links and decision tools. A group will also look at digital tools for presenting the handbook online, including ways to easily tag content for different audiences and remix the guide for different contexts.

Agenda:

Week before & after:
  • Calling for online contributions and reviews
Friday:
  • 12.00 – 14:00: Seminar or bring your own lunch on open research data
  • 14:00 – 17:30: planning and initial work in the handbook in small teams
Saturday:
  • 10.00 – 10:30: Arrive and coffee
  • 10.30 – 11.30: Introducing open research – lightning talks
  • 11.30 – 13:30: Forming teams and starting sprint. Groups on:
    • Writing chapters
    • Decision tools
    • Building website & framework for book
    • Remixing guide for particular contexts
  • 13.30 – 14:30: Lunch
  • 14.30 – 16:30: Working in teams
  • 17.30 – 18:30: Report back
  • 18:30 – …… : Pub

Partners:

OKF Open Science Working Group – creators of the current Open Research Data Handbook
OKF Open Economic Working Group – exploring economics aspects of open research
Open Data Research Network - exploring a remix of the handbook to support open social science
research in a new global research network, focussed on research in the Global South.
Open Data Institute – hosting the event