Key Takeaways from the Data Visualization Society’s Outlier 2021 Conference

Data Visualization

The Data Visualization Society’s (DVS) first conference Outlier 2021 took place on 4th, 5th, and 7th February 2021. It was organized as an online conference, joined by about 1000 participants from their computer screens all over the world. 41 main talks, about 20min each, were presented, as well as dozens of smaller sessions.

Talks were distributed within a large time window, suitable (or not) for people in different time zones. I was only able to participate in full on sunday the 7th. But due to the presentations being prerecorded, and made available as videos immediately after each talk, I was able to see every talk.

To profit the most from this event, and process it in a structured way for myself, I shortly summarized the key takeaways from the talks. These summaries are listed below. The talks I found most interesting are summarized in more detail than the others. Unfortunately the very short summaries do not do justice to these also great talks. So, I encourage you to also see these video in full if their topics interests you.

To process the content in retrospect, it also made sense for me to regroup the talks into categories. Categories that emmerged are: general methodology, tools, history, data art and experimental case studies, and case studies. My interest mainly lay in the methodological talks, followed by presentations of tools. The talks on data visualization history and data art provided some lighter content in between. The enormous breadth of case studies greatly contributed to the diverse and international atmosphere of this event. The talks that I most recommend watching in full are marked with an asterisk* below.

The full list of talks is also available as a youtube playlist. The list contains a few additional talks not mentioned below on organizational issues of the Data Visualization Society as well as several 5-minute short so-called lighting talks.

Data Visualization General Methodology

How to Get Your Organization to Value Data Visualization – And You! (Steve Wexler)* (watch video)

Steve Wexler demonstrated how to convince people of the power of data visualization in company environments were people are still working with raw numbers in spreadsheets. By showing examples and asking questions people can experience for themselves that data visualizations allow to find answers much faster than tables. Dashboards can be made more attractive for people if they can see their own relative position in the data. Needless discussions about chart types and color choices can be avoided by having experiments at hand demonstrating your point, such as estimating the relative sizes of bubbles/circles versus bars.

Soft Landing, Firm Impact: Practical Tips on How to Give and Receive Meaningful Data Visualization Feedback (Candra McRae)* (watch video)

Candra McRae gave practical tips on how to give and receive feedback. When giving feedback one should be self-aware of one’s tone, body language, and biases. Personal opions should be voiced in the form of „I“ and „me“. It is better to give feedback in a one-on-one setting than in a group. One should first seek to understand why things were done in a certain way. The given feedback should be clear and honest but also kind. Dataviz experts‘ (Tufte, Few) stances should not be used in a discussion. When receiving feedback one shouldn’t shut down and be argumentative. One should ask engaging open questions. It is also important to act upon the given feedback.

Side Projects (Jan Willem Tulp)* (watch video)

Jan Willem Tulp elaborated what makes good side projects in data visualization. Such projects serve to learn something and to show something. For data visualization designers starting out, such projects usually serve to fill the portfolio. But they also make sense for seasoned professionals, because they can lead to paid projects. Side projects provide the opportunity to fully do you own thing, with your ideas, creativity, and skills. It is recommended to keep a notebook/spreadsheet of ideas and interesting datasets. Good side projects are relevant and original. Relevance can be achieved by using a well-known dataset, treating a current event, and by allowing people to find themselves in the data. Originality can be achieved by collecting one’s own data, redesigning an existing visualization, trying a new visualization concept, visualizing uncommon questions, and by creating engaging design people spend more time with. Mr. Tulp then discussed how his own and other people’s side projects meet the criteria of relevance and originality.

My Statistic Enemy, or Why Difficulties Make Better Data Visualization (Julie Brunet)* (watch video)

Julie Brunet explained how she cooperates with people with different skillsets. The basic idea is to manage that which you don’t know. People in the data visualization community have very different backgrounds. There is a temptation to try to learn to do everything by oneself. But a better approach is to cooperate with people that have the skills that one lacks for a project. People can thus alternately take the lead for different parts of a project.

Personal comment: The slides of this presentation were probably the most beautiful of the conference.

Data Viz, the UnEmpathetic Art (Mushon Zer-Aviv)* (watch video)

Mushon Zer-Aviv discussed how empathy can be achieved in data visualizations. Humans easily empathize with individuals but not with masses. Research has shown that people are willing to donate more than double the amount to save an individual (identifiable life) than to save the many (statistical lives). Even when the statistics are just shown aside the individual fates, the donations go down. This is called statistical numbing. Daniel Kahneman wrote about two systems of thinking. System 1 is fast, automatic, and involuntary, system 2 is slow, effortful and deliberating. Often system 2 rationalizes in retrospect, what system 1 has perceived. Empathy can be located in system 1. Or, speaking in data visualization terms, it can be called a preattentive attribute that focuses our attention. A good approach to reaching empathy with data visualization is thus to start with the individual fate and then zoom out to the bigger picture. But it is not enough to simply rouse people, there must also be a specific call to action. Not just the status-quo should be shown, but also the better situation that could be.

Personal comment: Especially in the Covid crisis, where statistical data represents thousands of deaths, this is a very pressing topic. Many great examples of empathic and unempathetic data visualizations have emerged in this context.

3 Languages, 3 Aesthetics, 1 Graphic: A Case Study of Visualization in a Multicultural Environment (Nilangika Fernando)* (watch video)

Nilangika Fernando explained how she takes three different cultural aesthetics in Sri Lanka into account when designing data visualizations. The official languages of Sri Lanka are English, Sinhala, and Tamil. When she published data visualizations from an English context, translated into Sinhala, they would get little traction in Sinhala media. Looking at newspaper frontpages she noticed that each language and culture has it’s own look and feel. Newspaper try to make their frontpage as attractive as possible to the given audience, so they can be used to determine wether an audience has a different design aesthetic. These specific aesthetics could also be seen in online-memes of the different cultures. To analyze an aesthetic one should look at the layout, color, font, images, and narrative. Icons need to match the cultural context. For instance, a savings box in the form of a pig would not be understood in Sri Lanka, or even be considered offensive. Also, the hair and eye color of icons should be appropriate. Then she explained how to bridge this visual gap. She creates the infographic in the language of the primary audience, and then translate them into the others. She works with collaborators who are based in the different cultures. Finally she explained how data visualization can be presented in a non-data culture. She advised to use serve infographics in small doses, give a finished product that is attractive to publish, and to use storytelling.

Mind Games: The Psychology Behind Designing Beautiful, Effective and Impactful Data Viz (Amy Alberts)* (watch video)

Amy Alberts talked about results of user research at Tableau. Using eye trackers she analyzed how people perceive dashboards. Such eye tracking studies are in themselves data visualizations because the results are shown and analyzed as gazeplots, heatmaps, and gaze opacity maps. Given 10 seconds people focused their attention especially on big numbers, high color contrast, pictures of humans, and maps. People also tended to read the dashboards starting in the upper left corner moving right and down. When the viewing duration was increased, the viewing patterns remained largely the same. But when a specific task was given when viewing a dashboard, the patterns fell apart. So humans are on the one side dumb monkeys, looking with little actual intent, but on the other side also very intelligent in navigating systems to reach a goal. These result are in line with UX research. The mentioned attention-getters can be used purposefully for designing dashboards, notably taking up corporate design elements. Priming can be also be used to focus attention, by saying or writing something related to what you want people to focus on before showing the dashboard.

Are Your Data Visualizations Excluding People? (Larene Le Gassick, Sarah Fossheim, Frank Elavsky) (watch video)

Larene Le Gassick, Sarah Fossheim and Frank Elavsky explained how data visualizations can be made more accesible to people with vision impairment and blind people. They argued how everyone, also people with good vision, profit from more accessible data visualizations.

Iron Quest: Lessons from the Community (Sarah Bartlett) (watch video)

Sarah Bartlett gave tips on how to succeed in the Tableau Ironviz challenge. She recommends to visualize what one loves, build own datasets, use an exploratory or declarative approach, and provide context to the shown data.

Data Designer: A Self Portrait (Valentina d‘Efilippo) (watch video)

Valentina d’Efilippo gave tips on working as a data designer she wishes she had known when she started out herself. She recommends to see design as a problem-solving mindset, not box oneself in and embrace the chaos, tap into other’s brains to create empathy, learn to say no, feed one’s brain with creative things, raise one’s own personal voice, and listed to one’s gut.

Labels Matter (Gaelan Smith) (watch video)

Gaelan Smith discussed how labels and categories used in data gathering can include and exclude people. He explained how adding categories can make room for diversity.

Beyond Word Clouds: Visualizing the Linguistic Patterns of Political Speeches (Riva Quiroga) (watch video)

Riva Quiroga presented her analysis of presidential speeches in Chile. Among other analyses, she showed how the punctuation of speeches with many exclamation marks indicate authoritarian presidencies.

Using Zipf‘s Law to Help Understand COVID-19 (Howard Wainer) (watch video)

Howard Wainer showed how Zip’s Law can be used for outlier detection. Many natural processes follow a distribution where the frequency of occurence of an observation is inversely proportional to its rank (according to frequency of occurrence). When a process follows this distribution, outliers can easily be detected that deviate from it.

An Odd Couple’s Journey Towards SciArt: Design Meets Science and Vice-Versa (Greta Carrete Vega, Estefania Casal) (watch video)

Greta Carrete Vega and Estefania Casal discussed how they work together as a scientist and designer. Among other things they showed a model by Min Basadur on roles required in creative problem solving: the generator, the conceptualizer, the optimizer, and the implementer. Casal, the designer, likes to generate ideas and concepts. Vega, the scientist, likes to get things done practically. Thus, they complement each other as collaborators.

Creative problem solving profile according to Min Basadur (Source: Greta Carrete Vega, Estefania Casal: Outlier 21 presentation)

Data Viz for Non-Profit (Guillermina Sutter Schneider, Luis Ahumada) (watch video)

Guillermina Sutter Schneider and Luis Ahumada explained how non-profit organizations can work with data visualization. They recommend to develop a style guide for an organization in order to give the created charts an uniform and recognizable look.

Data Visualization Tools

Going Beyond Matplotlib and Seaborn: A Survey of Python Data Visualization Tools (Stephanie Kirmer)* (watch video)

Stephanie Kirmer provided an overview of six Python data visualization libraries. She included the older standard libraries Mathplotlib (2003) and Seaborn (2012), and the newer libraries Bokeh (2012), Altair (2016), Plotnine (2017), and Plotly (2013). The target criteria she wanted libraries to meet are an easy learning curve, consistent grammar, flexibility, beautiful output, and interactivity. She tested each library with a set of standard charts, and then discussed how the target criteria were met. She advises against using the older libraries. In conclusion she showed for which individual target criterion which of the four newer libraries should be used. For an easy learning curve: Plotnine or Altair. For consistent grammar: Plotnine or Altair. For flexibility: Plotnine or Bokeh, For beautiful images: Altair or Bokeh. For interactivity: Plotly or Bokeh. Generally, Altair is only suitable for small datasets.

Srengths of different Python graphics libraries (Source: Stephanie Kirmer: Outlier 21 presentation)

Navigating the Wide World of Data Visualization Libraries (for the Web) (Krist Wongsuphasawat)* (watch video)

Krist Wongsuphasawat explained a framework for choosing data visualization libraries for the web, mainly Javascript libraries. He located libraries within a two-dimensional design space. The x-axis is the level of abstraction from 1 to 5. The y-axis are different categories of API design. Level of abstraction 1 is graphics libraries working on a low level. P5.js, Three.js, and Two.js fall into this category. Level 2 is low-level building blocks. D3, visx, cola, dagre, and others belong into this category. Level 3 is visualization grammars. Vega-lite, Chart Parts, Muze, and G2 are part of this category. Level 4 are high-level building block. Echarts, Highcharts, Plotly, Victory, React-Vis, and Semiotic belong into this category. Level 5 are chart templates. Chart.js and Nivo are part of this category. The other dimension, API design, consists of the categories JSON, JSON with callback, plain Javascript, and framework specific. He then showed how the different libraries are located within this dimension. He then explained how to choose a library. It should allow you to create what you need (custom, rare, or common data visualizations) within the time you have. Familiarity with a specific library plays a role here. Technical aspects that can be considered are performance, the used tech stack, and project lifespan (maintenance of the library in the long term).

Note: Krist Wongsuphasawat has also published a corresponding Nightingale article.

Design Space of data visualization libraries (Source: Krist Wongsuphasawat: Outlier 21 presentation)

ggplot Wizardy: My Favorite Tricks and Secrets for Beautiful Plots in R (Cédric Scherer)* (watch video)

Cédric Scherer explained how he creates print-ready charts entirely programmed in R with the ggplot2 library and extensions. He refined his R skills mainly within the weekly TidyTuesday challenge. The R community shares extension packages for a big variety of graphs and extra functionalities. He then demonstrated the capabilities of the extension packages he regularly uses in his work. The package ggtext provides improved text rendering. The package ggforce provides annotations. The package ggdist is useful for visualizing distributions and uncertainty. Then he showed several tips for improving charts within the ggplot2 library by changing default parameters. Plot-titles and plot-captions can be aligned with the outer margins. The legend can be placed at the top of the chart. The legend formatting can be improved. The axis labels can be placed closer to the axes. The clipping of elements that protrude beyond the borders of the chart, such as long labels, can be shut off. The outer margin between chart and border of the image can be enlarged. An image can be added to the plot to make it more illustrative. Finally he showed how the patchwork package can be used to combine and arrange several plots.

Data Visualization History

Otto and Gerd in the Chauvet Caves (Nigel Holmes)* (watch video)

Nigel Holmes explained how basic principles of information design can be traced back to early cave art. The earliest figurative cave art known to date is in Sulawesi from 45 500 years ago. Abstract marks from 70-100 000 years ago have been found in the Blombos cave. Such drawings might have been made by homo sapiens or other early homonids. Many of the known pictures of cave art are reproduced drawings, not actual photos of the art itself. Jumping forward to modern times, in the 1920s Otto Neurath and Gerd Arntz developed the Isotype graphic language to display statistical information. Neurath urged the artists to find the essence of the depicted object. Objects are shown in profile, from the side as a silhouette, omitting surface details. At first, icons were cut out from black cardboard, later they were printed as linocuts to obtain this simple appearance. A basic mechanism that is used in Isotype is to combine two icons into one. For instance, a waiter can be represented as a person with a coffee cup. The same principles of depicting the essential outline in sideview, and combining basic element into icons can be found in cave art. With combined elements, rhinos are shown wooly and with their summer coat. Thus it is valid to say that cave painter were the first information designers. “They were counting, recording, explaining, storytelling, while showing only the essentials.” Today the same principles can be found in roadsigns showing animal silhouettes, signs in airports, and emojis.

Florence Nightingale Is a Design Hero (RJ Andrews) (watch video)

RJ Andrews talked about the data visualization work of Florence Nightingale. Her charts were meant to be easily understandable and convince the army leadership of improving the medical care of soldiers. She worked together with several collaborators from different institutions.

Spotting Minard on the Corner Three (Senthil Natarajan) (watch video)

Senthil Natarajan demonstrated how he creates basketball data visualizations based on the styles of famous historic charts.

Data Art and Experimental Case Studies

3D Geo Dataviz: From Insight to Data Art (Craig Taylor)* (watch video)

Craig Taylor showed spectacular 3D visualizations of traffic data he develops at the company Ito. These cinematic visualizations serve to gather insight and for use as marketing material. He presented the project transit in motion which showed the change of patterns in public bus mobility during a Covid lockdown. He presented several possibilities of representing the data, some of which were quite experimental and artistic. Then he presented the project Europe’s quiet skies which shows the reduction of airplane flights in the Europe during the Covid crisis. In the Q&A session Craig Taylor explained that he uses QGIS and ESRI ArcMap for data preparation and visualizes the data using Houdini, Cinema 4D, and the Octane rendering engine.

Personal comment: This talk demonstrated the controversy around 3D data visualization and use of animations very well. On the one hand beautiful, spectacular images. On the other hand a way of presenting data that make it hard to derive deeper analytical insight.

Loud Numbers: Telling Stories with Data and Music (Miriam Quick, Duncan Geere) (watch video)

Miriam Quick and Duncan Geere gave an introduction to data sonification, which is the transformation of data into sounds. They also introduced their upcoming podcast Loud Numbers.

Data Through Design: Creating a Data Art Exhibition (Sara Eichner) (watch video)

Sara Eichner talked about the the Data Through Design exhibition taking place in New York. The exhibition shows data art based on New York open data. She discussed the challenges of exhibting data art in the corona crisis.

Using Data in a Fine Art Practice (Wilma Woolf) (watch video)

Wilma Wolf presented her physical data art and the processes and philosophy behind it. Her work focuses on women’s rights. It is important to her that high ethical standards are met during each manufacturing step of the art piece. She aims at the „death of the artist“, meaning that the final works stands for itself, without her as an artist being visible.

Step and Repeat: Visualizing Human Motion (Emma Margarite Erenst) (watch video)

Emma Margarita Erenst presented her physical data art works, mainly pieces of clothing, that deal with human motion and dance.

Coding with Fire: Cooking with Data (Ian Johnson, EJ Fox) (watch video)

Ian Johnson and EJ Fox talked about their streaming format twitch.tv/enjalot where they do live coding of Javascript in observable notebooks.

Data Visualization Case Studies

A Viral Map (Karim Douieb)* (watch video)

Karim Douieb showed how he developed an animated visualization of the results of the U.S. presidential election of 2016. This animation went viral on social media. The animation visualizes the fact that land doesn’t vote, people do, by transitioning each state area to a bubble proportional to the population of the state. He presented a detailed walkthrough of how he developed this animation in Javascript, using the Observable working environment and the D3 library. He used a D3 force layout to distribute the bubbles, and Flubber for the animated transitions. He published his result as a looping gif on social media. The attention that his work received when posted by others exceeded that of his own posting. He noted that a watermark should be added, to avoid one’s work being shared widely without attribution.

Mapping the Covid19 Research Landscape: The Power of Data Viz over Black Boxes (Caroline Goulard)* (watch video)

Caroline Goulard presented a tool for visualizing scientific papers about Covid. There currently exist more than 50 000 publications on this topic. This make it very difficult for researchers to find the relevant ones. “Dark knowledge” is a big problem. 50 % of publication on Covid are not cited, 6 % are not in English. The currently available tools such as Pubmed, Scopus, and Google Scholar only display search results as paginated lists. It is not transparent how these ranked lists were generated. Also the user needs to precisely specify what he is looking for. Caroline Goulard proposes spatial mapping as part of the solution. This helps get a mental representation of the data, helps interaction, and helps memorization. They developed two approaches. The first approach is a citations network graph, implemented via a force-directed graph. The second approach is a dimensions reduction map. Here publications that have similar keywords are located closer together in two-dimensional space. This replicates walking through a library and looking into the nearby shelves. This second approach was favored by interviewed users. Clusters of publications were created using hierarchical clustering. Each cluster was assigned a color. In the interface colors can also be assigned to years of publication, fields of study, and keywords. The interface also allows to look at the detailed metadata of each publication. In user testing it was found that people mainly use search functionalities, and then look at the map for confirmation. Users found using the tool a “disturbing experience”. So a sexy interface will not guarantee, that a tool will actually be used next time, instead of the standard tools. In the Q&A section Caroline Goulard explained that the application was programmed with WebGL and the HDBSCAN library.

How Do We Translate Cultural Experiences Into Data Stories? (Mick Yang, Isabella Chua) (watch video)

Mick Yang and Isabella Chua explained how they develop data stories at the Kontinentalist, a Singapore-based data journalism agency. The agency focuses on data stories dealing with asian culture. They advocate to have the courage to be niche and local in the data stories one tells.

Narrating a Nation Through Numbers – India in Pixels (Ashris Choudhury) (watch video)

Ashrin Choudhury presented his work on visualizing data on India for an Indian audience. He asks for feedback from several colleagues of diverse ethnical and regional backgrounds, in order to avoid cultural pitfalls.

Data Points Are People Too (Bronwen Robertson, Saja Hathman, Joachaim Mangalima, Zdenek Hynek) (watch video)

Bronwen Robertson, Saja Hathman, Joachaim Mangalima and Zdenek Hynek talked about their participation in different gloabal Data4change projects. They discussed how the covid crisis has impacted their work.

#BlackInDataWeek: Connecting and Celebrating Black People in Data Fields (Rith Agbakoba, Jarrett C. Hurms, Simone Webb) (watch video)

Rith Agbakoba, Jarrett C. Hurms, and Simone Webb presented initatives for black people working in data fields. They talked about the activities of BlackTides and BlackInData.

Visualizing the History of Mass Incarceration (Sarah Fawson) (watch video)

Sarah Fawson presented the results of her master’s thesis in which she visualized the history of mass incarceration in the USA. Her work shows that black men are disproportionately often imprisoned.

Visualizing Transgender Day of Remembrance: Lessons in Bearing Witness through Making Losses Visible and Visceral (Kelsey Campbell, Cathryn Ploehn) (watch video)

Kelsey Campbell and Cathryn Ploehn showed ongoing work where they visualize transgender people’s murderings.

Visualization of Violence in Colombia (Gustavo Ojeda) (watch video)

Gustavo Ojeda showed the data visualizations he is creating on violence in the Columbian society. Many people in Columbia do not have electricty and internet connection may be slow. He showed how data visualizations can be implemented technically to lower the amout of data transfered.

Are We Fine with Global warming? The Role of Nuclear Power & Low Carbon Energy (Harim Jung) (watch video)

Harim Jung discussed a dashboard where she showed CO2 emissions and electricity generation mix (renewable, fossil, nuclear) of different countries. Separating countries into four strata according to gross domestric product (GDP) shows that the countries with a high GDP emit a large share of global CO2.

Using DataViz to Re-sensitive the World to Animals (Karol Orzechowski) (watch video)

Karol Ozechowski demonstrated how he uses data visualizations to advocate for animals rights at Faunalytics. He identified three main problems in this field: problems of scale, problems of strategy, and problems of data opacity.

Shaping Data Viz through Student Newsrooms (Raeedah Wahid, Jessica Li) (watch video)

Raeedah Wahid and Jessica Li talked about their work at the university student newspaper Columbia Daily Specator. They explain how their newspaper built up data visualization expertise in the last years.

Becoming a Data Driven Learner (Aminah Aliu) (watch video)

Aminah Aliu showed how she determined the best time of day for her to study as a highschool student. She timed the durations she needed to solve problems of the card game Set at different times of day. Thus she could show that she performed better in the morning.

At the conference Jason Forrest and Mary Aviles announced that Nightingale, the online publication of the Data Visualization Society, will also appear as a printed magazine.

Thanks to DVS Events Director Mollie Pettit and the rest of the volunteering organization team for this event: Duncan Geere, Evelina Judeikyte, Gabrielle Merite, Lloyd Richards, Maxene Graze, Marília Ferreira da Cunha, Frederic Fery, Céline Genest, Katy Liang,Jennifer Li, Bill Tran, Yi Ning Wong Isabella Chua, Akshit Aggarwal, Nöelle Rakotondravony, Naomi Smulders

Blog article history

22.02.2021 First version published. Additional talk summaries added in the following days and weeks.

21.03.2021: Added links to the now public youtube videos.