You are browsing the archive for data literacy.

Apply Now! School of Data’s 2018 Fellowship Programme

Cédric Lombion - April 17, 2018 in data literacy, School of Data, School of Data Fellows, training

This blog has been reposted from the School of Data blog School of Data is inviting journalists, data scientists, civil society advocates and anyone interested in advancing data literacy to apply for its 2018 Fellowship Programme, which will run from May 2018 to January 2019. 8 positions are open, 1 in each of the following countries: Bolivia, Guatemala, Ghana, Indonesia, Kenya, Malawi, Tanzania, The Philippines. The application deadline is set on Sunday, May 6th of 2018. If you would like to sponsor a fellowship, please get in touch with School of Data. Apply for the Fellowship Programme

The Fellowship

School of Data works to empower civil society organisations, journalists and citizens with the skills they need to use data effectively in their efforts to create more equitable and effective societies. Fellowships are nine-month placements with School of Data for data-literacy practitioners or enthusiasts. During this time, Fellows work alongside School of Data to build an individual programme that will make use of both the collective experience of School of Data’s network to help Fellows gain new skills, and the knowledge that Fellows bring along with them, be it about a topic, a community or specific data literacy challenges. Similarly to previous years, our aim with the Fellowship programme is to increase awareness of data literacy and build communities who together, can use data literacy skills to make the change they want to see in the world. The 2018 Fellowship will continue the work in the thematic approach pioneered by the 2016 class. As a result, we will be prioritising candidates who:
  • possess experience in, and enthusiasm for, a specific area of data literacy training
  • can demonstrate links with an organisation practising in this defined area and/or links with an established network operating in the field
We are looking for engaged individuals who already have in-depth knowledge of a given sector or specific skillsets that can be applied to this year’s focus topics.. This will help Fellows get off to a running start and achieve the most during their time with School of Data: nine months fly by! Read More about the Fellowship Programme

The areas of focus in 2018

We have partnered with Hivos and NRGI to work on the following themes: Procurement and data in the extractives industry (oil, mining, gas). These amazing partner organisations will provide Fellows with guidance, mentorship and expertise in their respective domains.

2018 Fellowship Positions

Bolivia The Fellowship in Bolivia will be focused on public procurement data through the Open Contracting Programme. For this position, School of Data is looking for someone with: Experience with and interest in community building, experience with the implementation of civic projects with a data or technical component, storytelling skills, and experience with promoting data or technical stories to a wide audience, basic understanding of the public procurement process Guatemala The Fellowship in Guatemala will be focused on public procurement data through the Open Contracting Programme. For this position, School of Data is looking for someone with: Experience in the planning, coordination and implementation of projects with civil society organisations, the ability to advise and train organisations on working with data and delivering technical projects, basic understanding of the public procurement process Ghana The Fellowship in Ghana with be focused on extractives Data through the Media Development Programme at NRGI. For this position, School of Data is looking for someone with: an interest in supporting or working within the civil society sector, experience working with financial (or related) data for analysis experience as a trainer and/or community builder, interest and/or experience in the extractives sector, demonstrated skills as a data storyteller or journalist Malawi The Fellowship in Malawi will be focused on public procurement data through the Open Contracting Programme. For this position, School of Data is looking for someone with: experience with delivering technical and data-driven projects, experience with facilitating training activities, experience with data collection projects, basic understanding of the public procurement process Indonesia The Fellowship in Indonesia will be focused on public procurement data through the Open Contracting Programme. For this position, School of Data is looking for someone with: experience with delivering technical and data-driven projects, experience with facilitating training activities, experience with working with government systems or data. Candidates with the following optional interests and experience will be appreciated: experience with explaining complex topics to varied audiences, experience with user design methodologies, experience with community development The Philippines The Fellowship in The Philippines will be focused on public procurement data through the Open Contracting Programme. For this position, School of Data is looking for someone with: experience with user-centric research and design methodologies, experience with community-building activities, experience with data storytelling. Candidates with the following optional interests and experience will be appreciated: graphic design skills, experience with delivering trainings Kenya The Fellowship in Kenya will be focused on public procurement data through the Open Contracting Programme. For this position, School of Data is looking for someone with: experience with delivering data-driven projects, experience with user research and data storytelling, experience with explaining complex topics to varied audiences. Candidates with the following optional interests and experience will be appreciated: interest in or experience with supporting civic projects and civil society organisations, experience with facilitating training activities. Tanzania The Fellowship in Tanzania will be focused on public procurement data through the Open Contracting Programme. For this position, School of Data is looking for someone with: experience with delivering data-driven projects, experience with facilitating training activities, experience with explaining complex topics to varied audiences. Candidates with the following optional interests and experience will be appreciated: experience working with journalists or as a journalist, interest in or experience with supporting civic projects and civil society organisations, experience with writing pedagogical content

9 months to make an impact

The programme will run from May to January 2019, and entail up to 10 days a month of time. Fellows will receive a monthly stipend of $1,000 USD a month to cover for their work. What are you waiting for? Read more about School of Data’s Fellowship or Apply now

Key Information: Fellowship

  • Available positions: up to 8 fellows, 1 in each of the following countries: Bolivia, Guatemala, Ghana, Indonesia, Kenya, Malawi, Tanzania, The Philippines
  • Application deadline: May 6th, 2018, midnight GMT+0
  • Duration: From May 14th, 2018 to January 31st, 2019
  • Level of activity: 10 days per month
  • Stipend: $1000 USD per month

Key links

About diversity and inclusivity

School of Data is committed to being inclusive in its recruitment practices. Inclusiveness means excluding no one because of race, age, religion, cultural appearance, sexual orientation, ethnicity or gender. We proactively seek to recruit individuals who differ from one another in these characteristics, in the belief that diversity enriches all that we do.

New edition of Data Journalism Handbook to explore journalistic interventions in the data society

Jonathan Gray - January 12, 2018 in Data Journalism, data journalism handbook, data literacy, journalism, Open Access

This blog has been reposted from http://jonathangray.org/2017/12/20/new-edition-data-journalism-handbook/ The first edition of The Data Journalism Handbook has been widely used and widely cited by students, practitioners and researchers alike, serving as both textbook and sourcebook for an emerging field. It has been translated into over 12 languages – including Arabic, Chinese, Czech, French, Georgian, Greek, Italian, Macedonian, Portuguese, Russian, Spanish and Ukrainian – and is used for teaching at many leading universities, as well as teaching and training centres around the world. A huge amount has happened in the field since the first edition in 2012. The Panama Papers project undertook an unprecedented international collaboration around a major database of leaked information about tax havens and offshore financial activity. Projects such as The Migrants Files, The Guardian’s The Counted and ProPublica’s Electionland have shown how journalists are not just using and presenting data, but also creating and assembling it themselves in order to improve data journalistic coverage of issues they are reporting on.

The Migrants’ Files saw journalists in 15 countries work together to create a database of people who died in their attempt to reach or stay in Europe.

Changes in digital technologies have enabled the development of formats for storytelling, interactivity and engagement with the assistance of drones, crowdsourcing tools, satellite data, social media data and bespoke software tools for data collection, analysis, visualisation and exploration. Data journalists are not simply using data as a source, they are also increasingly investigating, interrogating and intervening around the practices, platforms, algorithms and devices through which it is created, circulated and put to work in the world. They are creatively developing techniques and approaches which are adapted to very different kinds of social, cultural, economic, technological and political settings and challenges. Five years after its publication, we are developing a revised second edition, which will be published as an open access book with an innovative academic press. The new edition will be significantly overhauled to reflect these developments. It will complement the first edition with an examination of the current state of data journalism which is at once practical and reflective, profiling emerging practices and projects as well as their broader consequences.

“The Infinite Campaign” by Sam Lavigne (New Inquiry) repurposes ad creation data in order to explore “the bizarre rubrics Twitter uses to render its users legible”.

Contributors to the first edition include representatives from some of the world’s best-known newsrooms data journalism organisations, including the Australian Broadcasting Corporation, the BBC, the Chicago Tribune, Deutsche Welle, The Guardian, the Financial Times, Helsingin Sanomat, La Nacion, the New York Times, ProPublica, the Washington Post, the Texas Tribune, Verdens Gang, Wales Online, Zeit Online and many others. The new edition will include contributions from both leading practitioners and leading researchers of data journalism, exploring a diverse constellation of projects, methods and techniques in this field from voices and initiatives around the world. We are working hard to ensure a good balance of gender, geography and themes. Our approach in the new edition draws on the notion of “critical technical practice” from Philip Agre, which he formulates as an attempt to have “one foot planted in the craft work of design and the other foot planted in the reflexive work of critique” (1997). Similarly, we wish to provide an introduction to a major new area of journalism practice which is at once critically reflective and practical. The book will offer reflection from leading practitioners on their experiments and experiences, as well as fresh perspectives on the practical considerations of research on the field from leading scholars. The structure of the book reflects different ways of seeing and understanding contemporary data journalism practices and projects. The introduction highlights the renewed relevance of a book on data journalism in the current so-called “post-truth” moment, examining the resurgence of interest in data journalism, fact-checking and strengthening the capacities of “facty” publics in response to fears about “alternative facts” and the speculation about a breakdown of trust in experts and institutions of science, policy, law, media and democracy. As well as reviewing a variety of critical responses to data journalism and associated forms of datafication, it looks at how this field may nevertheless constitute an interesting site of progressive social experimentation, participation and intervention. The first section on “data journalism in context” will review histories, geographies, economics and politics of data journalism – drawing on leading studies in these areas. The second section on “data journalism practices” will look at a variety of practices for assembling data, working with data, making sense with data and organising data journalism from around the world. This includes a wide variety of case studies – including the use of social media data, investigations into algorithms and fake news, the use of networks, open source coding practices and emerging forms of storytelling through news apps and data animations. Other chapters look at infrastructures for collaboration, as well as creative responses to disappearing data and limited connectivity. The third and final section on “what does data journalism do?”, examines the social life of data journalism projects, including everyday encounters with visualisations, organising collaborations across fields, the impacts of data projects in various settings, and how data journalism can constitute a form of “data activism”. As well as providing a rich account of the state of the field, the book is also intended to inspire and inform “experiments in participation” between journalists, researchers, civil society groups and their various publics. This aspiration is partly informed by approaches to participatory design and research from both science and technology studies as well as more recent digital methods research. Through the book we thus aim to explore not only what data journalism initiatives do, but how they might be done differently in order to facilitate vital public debates about both the future of the data society as well as the significant global challenges that we currently face.

2017: A Year to Remember for OK Nepal

Nikesh Balami - January 11, 2018 in community, Community Stories, data literacy, nepal, OK Nepal, Open Data, Open Data Day

This blog has been cross-posted from the OK Nepal blog as part of our blog series of Open Knowledge Network updates.
Best wishes for 2018 from OK Nepal to all of the Open Knowledge family and friends!! The year 2017 was one of the best years for Open Knowledge Nepal. We started our journey by registering Open Knowledge Nepal as a non-profit organization under the Nepal Government and as we start to reflect 2017, it has been “A Year to Remember”. We were able to achieve many things and we promise to continue our hard work to improve the State of Open Data in South Asia in 2018 also. Some of the key highlights of 2017 are:
  1. Organizing Open Data Day 2017
For the 5th time in a row, the Open Knowledge Nepal team led the effort of organizing International Open Data Day at Pokhara, Nepal. This year it was a collaborative effort of Kathmandu Living Labs and Open Knowledge Nepal. It was also the first official event of Open Knowledge Nepal that was held out of the Kathmandu Valley.  
  1. Launching Election Nepal Portal  
On 13th April 2017 (31st Chaitra 2073), a day before Nepalese New Year 2074, we officially released the  Election Nepal Portal in collaboration with Code for Nepal and made it open for contribution. Election Nepal is a crowdsourced citizen engagement portal that includes the Local Elections data. The portal will have three major focus areas; visualizations, datasets, and twitter feeds.
  1. Contributing to Global Open Data Index  
On May 2nd, 2017 Open Knowledge International launched the 4th edition of Global Open Data Index (GODI), a global assessment of open government data publication. Nepal has been part of this global assessment continuously for four years with lots of ups and downs. We have been leading it since the very beginning. With 20% of openness, Nepal was ranked 69 in 2016 Global Open Data Index. Also, this year we helped Open Knowledge International by coordinating for South Asia region and for the first time, we were able to get contributions from Bhutan and Afghanistan.
  1. Launching Local Boundaries   
To help journalists and researchers visualize the geographical data of Nepal in a map, we build Local Boundaries where we share the shapefile of Nepal federal structure and others. Local Boundaries brings the detailed geodata of administrative units or maps of all administrative boundaries defined by Nepal Government in an open and reusable format, free of cost. The local boundaries are available in two formats (TopoJSON and GeoJSON) and can be easily reused to map local authority data to OpenStreetMap, Google Map, Leaflet or MapBox interactively.
  1. Launching Open Data Handbook Nepali Version  
After the work of a year followed by a series of discussion and consultation, on 7 August 2017 Open Knowledge Nepal launched the first version of Nepali Open Data Handbook – An introductory guidebook used by governments and civil society organizations around the world as an introduction and blueprint for open data projects. The handbook was translated with the collaborative effort by volunteers and contributors.  Now the Nepali Handbook is available at http://handbook.oknp.org
  1. Developing Open Data Curriculum and Open Data Manual  
To organize the open data awareness program in a structured format and to generate resources which can be further use by civil society and institution, Open Knowledge Nepal prepared an Open Data Curriculum and Open Data Manual. It contains basic aspects of open data like an introduction, importance, principles, application areas as well as the technical aspects of open data like extraction, cleaning, analysis, and visualization of data. It works as a reference and a recommended guide for university students, private sectors, and civil society.
  1. Running Open Data Awareness Program
The Open Data Awareness Program was conducted in 11 colleges and 2 youth organization, reaching more than 335+ youths are first of its kind conducted in Nepal. Representatives of Open Knowledge Nepal visited 7 districts of Nepal with the Open Data Curriculum and the Open Data Manual to train youths about the importance and use of open data.
  1. Organizing Open Data Hackathon  
The Open Data Hackathon was organized with the theme “Use data to solve local problems faced by Nepali citizens” at Yalamaya Kendra (Dhokaima Cafe), Patan Dhoka on November 25th, 2017. In this hackathon, we brought students and youths from different backgrounds under the same roof to work collaboratively on different aspects of open data.
  1. Co-organizing Wiki Data-a-thon
On 30th November 2017, we co-organized a Wiki Data-a-thon with Wikimedians of Nepal at Nepal Connection, Thamel on the occasion of Global Legislative Openness Week (GLOW). During the event, we scraped the data of last CA election and pushed those data in WikiData.  
  1. Supporting Asian Regional Meeting  
On 2nd and 3rd December 2017, we supported Open Access Nepal to organize Asian Regional Meeting on Open Access, Open Education and Open Data with the theme “Open in Action: Bridging the Information Divide”. Delegates were from different countries like the USA, China, South Africa, India, Bangladesh, China, Nepal. We managed the Nepali delegates and participants.

2018 Planning

We are looking forward to a prosperous 2018, where we plan to outreach the whole of South Asia countries to improve the state of open data in the region by using focused open data training, research, and projects. For this, we will be collaborating with all possible CSOs working in Asia and will serve as an intermediary for different international organizations who want to promote or increase their activities in Asian countries. This will help the Open Knowledge Network in the long run, and we will also get opportunities to learn from each others’ successes and failures, promote each other’s activities, brainstorm collaborative projects and make the relationship between countries stronger. Besides this, we will continue also our work of data literacy like Open Data Awareness Program to make Nepalese citizens more data demanding and savvy, and launch a couple of new projects to help people to understand the available data. To be updated about our activities, please follow us at different medias:  

Data is a Team Sport

Dirk Slater - December 20, 2017 in announcement, Data Blog, data literacy, research, Team Sport

Data is a Team Sport is a series of online conversations held with data literacy practitioners in mid-2017 that explores the ever evolving data literacy eco-system. Our aim in producing ‘Data is a Team Sport’ was to surface learnings and present them in formats that would be accessible to data literacy practitioners. Thanks to the efforts of governments, organizations and agencies to make their information more transparent the amount of data entering the public domain has increased dramatically in recent years. As political and economic forces become more adept at using data, we enter a new era of data literacy were just being able to understand information is not enough. In this project, we aimed to engage data literacy practitioners to capture lessons learned and examine how their methodologies are shifting and adapting. We also wanted to better understand how they perceived the data literacy ecosystem: the diverse set of actors needed to enable and support the use of data in social change work.
Conversation Guests PodCast
Enabling Learning Rahul Bhargava & Lucy Chambers
Data Driven Journalism Eva Constantaras & Natalia Mazotte
One on One with Daniela Lepiz Daniela Lepiz
Advocacy Organisations Milena Marin & Sam Leon
One on One with Heather Leson Heather Leson
One on One with Friedhelm Weinberg Friedhelm Weinberg
Mentors, Mediators and Mad Skills Emma Prest & Tin Geber
Government Priorities and Incentives Ania Calderon & Tamara Puhovski
Read the rest of this entry →

Data is a Team Sport: Government Priorities and Incentives

Dirk Slater - August 13, 2017 in Ania Calderon, Data Blog, data literacy, Event report, Fabriders, Government, Open Data, research, Tamara Puhovski, Team Sport, The Open Data Charter

Data is a Team Sport is our open-research project exploring the data literacy eco-system and how it is evolving in the wake of post-fact, fake news and data-driven confusion.  We are producing a series of videos, blog posts and podcasts based on a series of online conversations we are having with data literacy practitioners. To subscribe to the podcast series, cut and paste the following link into your podcast manager : http://feeds.soundcloud.com/users/soundcloud:users:311573348/sounds.rss or find us in the iTunes Store and Stitcher. The conversation in this episode focuses on the challenges of getting governments to prioritise data literacy both externally and internally, and incentives to produce open-data and features:
  • Ania Calderon, Executive Director at the Open Data Charter, a collaboration between governments and organisations working to open up data based on a shared set of principles. For the past three years, she led the National Open Data Policy in Mexico, delivering a key presidential mandate. She established capacity building programs across more than 200 public institutions.
  • Tamara Puhovskia sociologist, innovator, public policy junky and an open government consultant. She describes herself as a time traveler journeying back to 19th and 20th century public policy centers and trying to bring them back to the future.

Notes from the conversation:

Access to government produced open-data is critical for healthy functioning democracies. It takes an eco-system that includes a critical thinking citizenry, knowledgeable civil servants, incentivised elected officials, and smart open-data advocates.  Everyone in the eco-system needs to be focused on long-term goals.
  • Elected officials needs incentivising beyond monetary arguments, as budgetary gains can take a long time to fruition.
  • Government’s capacities to produce open-data is an issue that needs greater attention.
  • We need to get past just making arguments for open-data, but be able to provide good solid stories and examples of its benefits.

Resources mentioned in the conversation:

Also, not mentioned, but be sure to check out Tamara’s work on Open Youth

View the full online conversation:

Flattr this!

Data is a Team Sport: Mentors Mediators and Mad Skills

Dirk Slater - August 7, 2017 in advocacy, community, Data Blog, data literacy, Data Maturity, DataKind UK, Emma Prest, Event report, Fabriders, Intermediaries, mentoring, Service Organisations, Team Sport, Tin Geber

Data is a Team Sport is our open-research project exploring the data literacy eco-system and how it is evolving in the wake of post-fact, fake news and data-driven confusion.  We are producing a series of videos, blog posts and podcasts based on a series of online conversations we are having with data literacy practitioners. To subscribe to the podcast series, cut and paste the following link into your podcast manager : http://feeds.soundcloud.com/users/soundcloud:users:311573348/sounds.rss or find us in the iTunes Store and Stitcher. This episode features:
  • Emma Prest oversees the running of DataKind UK, leading the community of volunteers and building understanding about what data science can do in the charitable sector. Emma sits on the Editorial Advisory Committee at the Bureau of Investigative Journalism. She was previously a programme coordinator at Tactical Tech, providing hands-on help for activists using data in campaigns. 
  • Tin Geber has been working on the intersection of technology, art and activism for most of the last decade. In his previous role as Design and Tech Lead for The Engine Room, he developed role-playing games for human rights activists; collaborated on augmented reality transmedia projects; and helped NGOs around the world to develop creative ways to combine technology and human rights.
In this episode we take a deep dive into how to get organisations beyond ‘data literacy’ and reach ‘data maturity’, where organisations understand what is good practice on running a data project.  Some main points:
  • A red flag that indicates a data project will end in failure is when the goal is implementation of a tool as opposed to a mission-critical goal.
  • Training in itself can be helpful with hard skills, such as how to do analysis, but in terms of running data projects, it takes a lot of hand-holding and mentorship is a more effective.
  • A critical role in and organisations is people who can champion tech and data work, and they need better support in that role.
  • Fake news and data-driven confusion has meant the need for understanding good data practice is even more important.

DataKind UK’s resources:

Tin’s resources:

Resources that are inspiring Emma’s Work:

Resources that are inspiring Tin’s work:

  • DataBasic.io – A a suite of easy-to-use web tools for beginners that introduce concepts of working with data
  • Media Manipulation and Disinformation Online – Report from Data and Society on how false or misleading information is having real and negative effects on the public consumption of news.
  • Raw Graphs – The missing link between spreadsheets and data visualization

View the full online conversation:

Flattr this!

Data is a Team Sport: Mentors Mediators and Mad Skills

Dirk Slater - August 7, 2017 in advocacy, community, Data Blog, data literacy, Data Maturity, DataKind UK, Emma Prest, Event report, Fabriders, Intermediaries, mentoring, Service Organisations, Team Sport, Tin Geber

Data is a Team Sport is our open-research project exploring the data literacy eco-system and how it is evolving in the wake of post-fact, fake news and data-driven confusion.  We are producing a series of videos, blog posts and podcasts based on a series of online conversations we are having with data literacy practitioners. To subscribe to the podcast series, cut and paste the following link into your podcast manager : http://feeds.soundcloud.com/users/soundcloud:users:311573348/sounds.rss or find us in the iTunes Store and Stitcher. This episode features:
  • Emma Prest oversees the running of DataKind UK, leading the community of volunteers and building understanding about what data science can do in the charitable sector. Emma sits on the Editorial Advisory Committee at the Bureau of Investigative Journalism. She was previously a programme coordinator at Tactical Tech, providing hands-on help for activists using data in campaigns. 
  • Tin Geber has been working on the intersection of technology, art and activism for most of the last decade. In his previous role as Design and Tech Lead for The Engine Room, he developed role-playing games for human rights activists; collaborated on augmented reality transmedia projects; and helped NGOs around the world to develop creative ways to combine technology and human rights.
In this episode we take a deep dive into how to get organisations beyond ‘data literacy’ and reach ‘data maturity’, where organisations understand what is good practice on running a data project.  Some main points:
  • A red flag that indicates a data project will end in failure is when the goal is implementation of a tool as opposed to a mission-critical goal.
  • Training in itself can be helpful with hard skills, such as how to do analysis, but in terms of running data projects, it takes a lot of hand-holding and mentorship is a more effective.
  • A critical role in and organisations is people who can champion tech and data work, and they need better support in that role.
  • Fake news and data-driven confusion has meant the need for understanding good data practice is even more important.

DataKind UK’s resources:

Tin’s resources:

Resources that are inspiring Emma’s Work:

Resources that are inspiring Tin’s work:

  • DataBasic.io – A a suite of easy-to-use web tools for beginners that introduce concepts of working with data
  • Media Manipulation and Disinformation Online – Report from Data and Society on how false or misleading information is having real and negative effects on the public consumption of news.
  • Raw Graphs – The missing link between spreadsheets and data visualization

View the full online conversation:

Flattr this!

Data is a Team Sport: One on One with Heather Leson

Dirk Slater - July 19, 2017 in community, Data Blog, data literacy, data protection, Event report, Heather Leson, Humanitarian Organisations, IFRC, research, Team Sport

Data is a Team Sport is our open-research project exploring the data literacy eco-system and how it is evolving in the wake of post-fact, fake news and data-driven confusion.  We are producing a series of videos, blog posts and podcasts based on a series of online conversations we are having with data literacy practitioners. To subscribe to the podcast series, cut and paste the following link into your podcast manager : http://feeds.soundcloud.com/users/soundcloud:users:311573348/sounds.rss or find us in the iTunes Store and Stitcher. This episode features a one on one conversation with Heather Leson, the Data Literacy Lead at International Federation of Red Cross and Red Crescent Societies. As a technologist, she strengthens community collaboration via humanitarian technologies and social entrepreneurship. She builds partnerships, curates digital spaces, fosters volunteer engagement and delivers training while inspiring systems for co-creation with maps, code and data. At the International Federation of Red Cross Red Crescent, her mandate includes global data advocacy, data literacy and data training programs in partnership with the 190 national societies and the 13 million volunteers. She is a past Board Member at the Humanitarian OpenStreetMap Team (4 years), Peace Geeks (1 year), and an Advisor for MapSwipe – using gamification systems to crowdsource disaster-based satellite imagery. Previously, she worked as Social Innovation Program Manager, Qatar Computing Research Institute (Qatar Foundation) Director of Community Engagement, Ushahidi, and Community Director, Open Knowledge (School of Data).

Main Points from the Conversation:

  • Data protection is the default setting for humanitarian organisations collecting data.
  • She’s found its critical to focus on people and what they are trying to accomplish, as opposed to focusing on tools.
  • She’s added ‘socialisation’ as the beginning step to the data pipeline.

Heather’s Resources

Blogs/websites Heather’s work The full online conversation:
Flattr this!

Data is a Team Sport: Advocacy Organisations

Dirk Slater - July 11, 2017 in Amnesty International, Data Blog, data literacy, Event report, Fabriders, global witness, research, Team Sport

Data is a Team Sport is our open-research project exploring the data literacy eco-system and how it is evolving in the wake of post-fact, fake news and data-driven confusion.  We are producing a series of videos, blog posts and podcasts based on a series of online conversations we are having with data literacy practitioners. To subscribe to the podcast series, cut and paste the following link into your podcast manager : http://feeds.soundcloud.com/users/soundcloud:users:311573348/sounds.rss or find us in the iTunes Store and Stitcher. In this episode we discussed data driven advocacy organisations with:
  • Milena Marin is Senior Innovation Campaigner at Amnesty International. She is currently leads Amnesty Decoders – an innovative project aiming to engage digital volunteers in documenting human right violations using new technologies. Previously she worked as programme manager of School of Data. She also worked for over 4 years with Transparency International where she supported TI’s global network to use technology in the fight against corruption.
  • Sam Leon, is Data Lead at Global Witness, focusing on the use of data to fight corruption and how to turn this information into change making stories. He is currently working with a coalition of data scientists, academics and investigative journalists to build analytical models and tools that enable anti-corruption campaigners to understand and identify corporate networks used for nefarious and corrupt practices.

Notes from the Conversation

In order to get their organisations to see the value and benefit of using data, they both have had to demonstrate results and have looked for opportunities where they could show effective impact. What data does for advocacy is to show the extent of the problem and it provides depths to qualitative and individual stories.  Milena credits the work of School of Data for the fact that journalists now expect their to be data accessible from Amnesty to back up their data.
  • They see gaps in the way that people working in advocacy see data and new technologies as bright shiny and easy answers to their challenges.
  • In today’s post-fact world, they find that it’s being used to more quickly discredit their work and as a result they need to work harder at presenting verifiable data.
  • Amnesty’s decoder project has involved 45,000 volunteers and along with being able to review a huge amount of video, they have also gotten training and a deeper understanding of what Amnesty does.
  • Global Witness has had a limited amount of data-sets they have released to the public and would like to be releasing more data that can be of use to communities. However there is a long way to go before their data can be open by default as there needs to be further learning and understanding of how data can make individuals more vulnerable.
  • Advocacy organisations need to use intermediaries and externals to cover the gaps in their own expertise around data.

More about their work

Milena

Sam

Dirk

Resources and Readings

View the Full Conversation:

  Flattr this!

Data is a Team Sport: One on One with Daniela Lepiz

Dirk Slater - July 3, 2017 in community, Data Blog, Data Journalism, data literacy, Event report, Fabriders, Nigeria, research, Team Sport, West Africa

Data is a Team Sport is our open-research project exploring the data literacy eco-system and how it is evolving in the wake of post-fact, fake news and data-driven confusion.  We are producing a series of videos, blog posts and podcasts based on a series of online conversations we are having with data literacy practitioners. To subscribe to the podcast series, cut and paste the following link into your podcast manager : http://feeds.soundcloud.com/users/soundcloud:users:311573348/sounds.rss or find us in the iTunes Store and Stitcher. This episode features a one on one episode with Daniela Lepiz, a Costa Rican data journalist and trainer, who is currently the Investigation Editor for CENOZO, a West African Investigative Journalism Project that aims to promote and support cross border data investigation and open data in the region. She has a masters degree in data journalism from the Rey Juan Carlos University in Madrid, Spain. Previously involved with OpenUP South Africa working with journalists to produce data driven stories.  Daniela is also a trainer for the Tanzania Media Foundation and has been involved in many other projects with South African Media, La Nacion in Costa Rica and other international organisations.

Notes from the conversation

Daniela spoke to us from Burkina Faso and reflected on the role of journalism and particularly data-driven journalism in functioning democracies.  The project she is working on empowering journalists working cross-border in western Africa to utilise data to expose corruption and violation of human rights.  To identify journalists to participate in the project, they have looked for individuals who are experienced, passionate and curious. The project engages existing media houses, such as Premium Times in Nigeria, to assure that there are places for their stories to appear. Important points Daniela raises:
  • Media is continually evolving and learning to evolve and Daniela can see that data literacy will be a required proficiency in the next five years.
  • The biggest barrier to achieving open-data in government are government officials who resist transparency
  • There is a real fear from journalists of having to be proficient in maths when they are considering improve their skills to produce data-driven stories.  They often fail to realise that its about working with others that have skills on statistics and data analysis.
  • Trust in media has declined in such a big way and it means journalists have to work that much harder, particularly in labelling things as opinion or being biased.

Resources she finds inspiring

Her blogs posts

The full online conversation:

Daniela’s bookmarks!

These are the resources she uses the most often. .Rddj – Resources for doing data journalism with RComparing Columns in Google Refine | OUseful.Info, the blog…Journalist datastores: where can you find them? A list. | Simon RogersAidInfoPlus – Mastering Aid Information for Change

Data skills

Mapping tip: how to convert and filter KML into a list with Open Refine | Online Journalism Blog
Mapbox + Weather Data
Encryption, Journalism and Free Expression | The Mozilla Blog
Data cleaning with Regular Expressions (NICAR) – Google Docs
NICAR 2016 Links and Tips – Google Docs
Teaching Data Journalism: A Survey & Model Curricula | Global Investigative Journalism Network
Data bulletproofing tips for NICAR 2016 – Google Docs
Using the command line tabula extractor tool · tabulapdf/tabula-extractor Wiki · GitHub
Talend Downloads

Github

Git Concepts – SmartGit (Latest/Preview) – Confluence
GitHub For Beginners: Don’t Get Scared, Get Started – ReadWrite
Kartograph.org
LittleSis – Profiling the powers that be

Tableau customized polygons

How can I create a filled map with custom polygons in Tableau given point data? – Stack Overflow
Using Shape Files for Boundaries in Tableau | The Last Data Bender
How to make custom Tableau maps
How to map geographies in Tableau that are not built in to the product (e.g. UK postcodes, sales areas) – Dabbling with Data
Alteryx Analytics Gallery | Public Gallery
TableauShapeMaker – Adding custom shapes to Tableau maps | Vishful thinking…
Creating Tableau Polygons from ArcGIS Shapefiles | Tableau Software
Creating Polygon-Shaded Maps | Tableau Software
Tool to Convert ArcGIS Shapefiles into Tableau Polygons | Tableau and Behold!
Polygon Maps | Tableau Software
Modeling April 2016
5 Tips for Making Your Tableau Public Viz Go Viral | Tableau Public
Google News Lab
HTML and CSS
Open Semantic Search: Your own search engine for documents, images, tables, files, intranet & news
Spatial Data Download | DIVA-GIS
Linkurious – Linkurious – Understand the connections in your data
Apache Solr –
Apache Tika – Apache Tika
Neo4j Graph Database: Unlock the Value of Data Relationships
SQL: Table Transformation | Codecademy
dc.js – Dimensional Charting Javascript Library
The People and the Technology Behind the Panama Papers | Global Investigative Journalism Network
How to convert XLS file to CSV in Command Line [Linux]
Intro to SQL (IRE 2016) · GitHub
Malik Singleton – SELECT needle FROM haystack;
Investigative Reporters and Editors | Tipsheets and links
Investigative Reporters and Editors | Tipsheets and Links

SQL_PYTHON

More data

2016-NICAR-Adv-SQL/SQL_queries.md at master · taggartk/2016-NICAR-Adv-SQL · GitHub
advanced-sql-nicar15/stats-functions.sql at master · anthonydb/advanced-sql-nicar15 · GitHub
2016-NICAR-Adv-SQL/SQL_queries.md at master · taggartk/2016-NICAR-Adv-SQL · GitHub
Malik Singleton – SELECT needle FROM haystack;
Statistical functions in MySQL • Code is poetry
Data Analysis Using SQL and Excel – Gordon S. Linoff – Google Books
Using PROC SQL to Find Uncommon Observations Between 2 Data Sets in SAS | The Chemical Statistician
mysql – Query to compare two subsets of data from the same table? – Database Administrators Stack Exchange
sql – How to add “weights” to a MySQL table and select random values according to these? – Stack Overflow
sql – Fast mysql random weighted choice on big database – Stack Overflow
php – MySQL: Select Random Entry, but Weight Towards Certain Entries – Stack Overflow
MySQL Moving average
Calculating descriptive statistics in MySQL | codediesel
Problem-Solving using Graph Traversals: Searching, Scoring, Ranking, …
R, MySQL, LM and quantreg
26318_AllText_Print.pdf
ddi-documentation-english-572 (1).pdf
Categorical Data — pandas 0.18.1+143.g3b75e03.dirty documentation
python – Loading STATA file: Categorial values must be unique – Stack Overflow
Using the CSV module in Python
14.1. csv — CSV File Reading and Writing — Python 3.5.2rc1 documentation
csvsql — csvkit 0.9.1 documentation
weight samples with python – Google Search
python – Weighted choice short and simple – Stack Overflow
7.1. string — Common string operations — Python v2.6.9 documentation
Introduction to Data Analysis with Python | Lynda.com
A Complete Tutorial to Learn Data Science with Python from Scratch
GitHub – fonnesbeck/statistical-analysis-python-tutorial: Statistical Data Analysis in Python
Verifying the email – Email Checker
A little tour of aleph, a data search tool for reporters – pudo.org (Friedrich Lindenberg)
Welcome – Investigative Dashboard Search
Investigative Dashboard
Working with CSVs on the Command Line
FiveThirtyEight’s data journalism workflow with R | useR! 2016 international R User conference | Channel 9
Six issue when installing package · Issue #3165 · pypa/pip · GitHub
python – Installing pip on Mac OS X – Stack Overflow
Source – Journalism Code, Context & Community – A project by Knight-Mozilla OpenNews
Introducing Kaggle’s Open Data Platform
NASA just made all the scientific research it funds available for free – ScienceAlert
District council code list | Statistics South Africa
How-to: Index Scanned PDFs at Scale Using Fewer Than 50 Lines of Code – Cloudera Engineering Blog
GitHub – gavinr/geojson-csv-join: A script to take a GeoJSON file, and JOIN data onto that file from a CSV file.
7 command-line tools for data science
Python Basics: Lists, Dictionaries, & Booleans
Jupyter Notebook Viewer

PYTHON FOR JOURNALISTS

New folder

Reshaping and Pivot Tables — pandas 0.18.1 documentation
Reshaping in Pandas – Pivot, Pivot-Table, Stack and Unstack explained with Pictures – Nikolay Grozev
Pandas Pivot-Table Example – YouTube
pandas.pivot_table — pandas 0.18.1 documentation
Pandas Pivot Table Explained – Practical Business Python
Pivot Tables In Pandas – Python
Pandas .groupby(), Lambda Functions, & Pivot Tables
Counting Values & Basic Plotting in Python
Creating Pandas DataFrames & Selecting Data
Filtering Data in Python with Boolean Indexes
Deriving New Columns & Defining Python Functions
Python Histograms, Box Plots, & Distributions
Resources for Further Learning
Python Methods, Functions, & Libraries
Python Basics: Lists, Dictionaries, & Booleans
Real-world Python for data-crunching journalists | TrendCT
Cookbook — agate 1.4.0 documentation
3. Power tools — csvkit 0.9.1 documentation
Tutorial — csvkit 0.9.1 documentation
4. Going elsewhere with your data — csvkit 0.9.1 documentation
2. Examining the data — csvkit 0.9.1 documentation
A Complete Tutorial to Learn Data Science with Python from Scratch
For Journalism
ProPublica Summer Data Institute
Percentage of vote change | CARTO
Data Science | Coursera
Data journalism training materials
Pythex: a Python regular expression editor
A secure whistleblowing platform for African media | afriLEAKS
PDFUnlock! – Unlock secured PDF files online for free.
The digital journalist’s toolbox: mapping | IJNet
Bulletproof Data Journalism – Course – LEARNO
Transpose columns across rows (grefine 2.5) ~ RefinePro Knowledge Base for OpenRefine
Installing NLTK — NLTK 3.0 documentation
1. Language Processing and Python
Visualize any Text as a Network – Textexture
10 tools that can help data journalists do better work, be more efficient – Poynter
Workshop Attendance
Clustering In Depth · OpenRefine/OpenRefine Wiki · GitHub
Regression analysis using Python
DataBasic.io
DataBasic.io
R for Every Survey Analysis – YouTube
Git – Book
NICAR17 Slides, Links & Tutorials #NICAR17 // Ricochet by Chrys Wu
Register for Anonymous VPN Services | PIA Services
The Bureau of Investigative Journalism
dtSearch – Text Retrieval / Full Text Search Engine
Investigation, Cybersecurity, Information Governance and eDiscovery Software | Nuix
How we built the Offshore Leaks Database | International Consortium of Investigative Journalists
Liz Telecom/Azimmo – Google Search
First Python Notebook — First Python Notebook 1.0 documentation
GitHub – JasonKessler/scattertext: Beautiful visualizations of how language differs among document types
  Flattr this!