This DataViz Makeover will critique and improve 2 visualizations which portray the results of a study conducted to understand the willingness of the public to take COVID19 vaccinations
| s/n | Comment | Suggestion |
|---|---|---|
| 1 | The legend title ‘Vac1’ is meaningless. | Rename title ‘Vac1’ to ‘Response’. |
| 2 | The ‘% of strongly agreed to vaccination’ chart is contained within the ‘Which country is more pro-vaccine’ chart. It can be reproduced by merely changing the way the latter chart is presented. | Do away with the ‘% of strongly agreed to vaccination’ chart as it does not concern anything new on top of the first chart. |
| 3 | Continuing from comment 2) above, it also isn’t obvious from the titles of both charts whether they are referring to the same survey question. The chart on the left has the title ‘Which country is more pro-vaccine’ while the chart on the right has the title ‘% of strongly agreed to vaccination’. On inspection and comparison of the bars, it is apparent that they are visualizations of the same question, but the titles alone are not clear about that. | If 2 charts are referring to the same survey question, their charts should be clear about it. |
| 4 | The visualization only shows the responses by proportion of the countries’ respondents but does not provide insight into the absolute number of respondents. If the sample size for a particular country is small, then the results for that country may not be properly representative. | It will help to add information on the absolute number of respondents and possibly add statistical measures like error bars. |
| 5 | Legend does not explain explicitly what responses 2, 3, 4 represent. | Label 2 as Slightly Agree, 3 as Neutral, 4 as Slightly Disagree |
| 6 | x-axis labelling unclear. For example, ‘% of Total Record’ is unclear and could mean % of all respondents to someone who doesn’t know the background of the data. | Rename ‘% of Total Record’ as ‘% of each country’s respondents’. |
| s/n | Comment | Suggestion |
|---|---|---|
| 7 | For ‘Which country is more pro-vaccine?’ chart, the responses are portrayed as ‘5’ on the left to ‘1’ on the right. This is the inverse way in which the legend is portrayed, which isn’t the most visually intuitive. | Portray the responses as 1 on the left to 5 on the right. |
| 8 | The x-axis of the left chart is in whole numbers while the x-axis of the right chart is to 1 decimal place. There is no need for there to be 1 decimal place precision as the x-axis intervals are already so wide. | Make both x-axis of the same whole number precision. |
| 9 | It is aesthetically odd for the ‘united kingdom’ bar to go beyond the maximum value of the x-axis. | Make all of the chart values to stay within the maximum bound of axes. |
| 10 | If x-axis already shows units is %, no need to label each tick with %. | Remove % from the x-axis tick marks. |

| s/n | Advantage |
|---|---|
| 1 | The Likert Scale is good to visualize attitude or belief items such as the willingness to receive the COvID19 vaccine in this study. By centering around 0, it is visually intuitive at one glance to see if respondents of a country lean more towards agreeing or disagreeing. |
| 2 | By allowing for filtering of different profiles such as gender, age group, household size bracket etc, the user is able to derive more pointed insights rather than a static view of the entire population |
| 3 | By showing the Error Bars in the Dot Plot, one is able to be visually informed how certain one can be of the study results based on the data of each group of respondents. It provides visual information regarding the variability of the data and is useful as predictors of the range of new samples. |
| 4 | Allowing for the user to select from a list of questions regarding public perception of the vaccine in different contexts allows for a variety of insights regarding the issue, rather than just from one perspective as provided in the original visualization. |
Click for link to Tableau Public post
We go through the Data Source Preparation steps first before going through the steps to build the 2 charts proper.
Download the data for the relevant countries from https://github.com/YouGov-Data/covid-19-tracker/tree/master/data
Open ‘australia.csv’ as the first Data Source.
Remove it as shown in the screengrab below.

Rename ‘Table Name’ field to ‘Country’.
Create aliases for all the survey questions’ responses (vac_1, vac_2, vac2_1, vac2_2, vac2_3, vac2_6, vac_3) as shown below. This is to facilitate building calculated Field later on where there is inequality in the formula based on the survey responses.

The ages are binned accordingly so that viewers can view results by meaningful distinct age groups.
The household sizes are binned accordingly so that viewers can view results by meaningful household size brackets
The number of children in each household is binned accordingly so that viewers can view results by meaningful brackets of the number of children in each household.
This reverses the order of the scores where 5-Strongly Disagree now has a score of 1, while 1-Strongly Agree has a score of 5. This is for the construction of the Likert Scale where responses below 3 (Neutral) are for negative responses. Create the same fields for all the other survey questions (vac_2, vac2_1, vac2_2, vac2_3, vac2_6, vac_3).
Make sure to label in the Response Legend that 1 corresponds to Strongly Disagree, 2 to Disagree, 3 to Neutral, 4 to Agree and 5 as Strongly Agree.
Drag Gantt Percent to Columns
Drag Country to Rows. Chart will first look as follows:
Drag Response to Detail under the Marks tab
Under Gantt Percent, select Compute using Response as follows:

Change Chart Type to Gantt Bar under Marks
Drag Response to Colour under the Marks tab
Drag Percentage to Size under the Marks tab. Chart will now look as follows:
Drag Score into Filters tab and uncheck Null so Null responses are disregarded throughout.
Show the Parameter ‘Select Question’. This shows up as a Dropdown Bar.
Change the Chart Title to be dynamic according to the question selected as follows:
Show Age Group as a Multiple Values Dropdown list
Show gender as a Multiple values list
Include Null as an allowed value although it accounts for an insignificant number of responses. User can uncheck it to phase it out if desired.
Show employment_status as a Multiple Values Dropdown list
Show Household Size Bracket as a Multiple Values Dropdown list
Show Number of Children as a Multiple Values Dropdown list
Likert Scale is as follows, before any formatting:
We next build Dot Plot with Error Bars. For each set of filters selected by the user, this Dot Plot will show the Proportion of Respondents from each Country with a ‘Strongly Agree’ response, accompanied by Error Bars showing the 95% Confidence Intervals. This is an improvement from the original ‘% of strongly agreed to vaccination’ chart, as it not only allows for filtering of profiles but also shows the confidence intervals based on the sample size of each country.
This is to mark each Strongly Agree response as 1 count. Create the same fields for the other 4 responses.
This is to count the proportion of respondents which selected Strongly Agree out of the total number of records. Create the same fields for the other 4 responses.
This is to calculate the standard error. Create the same fields for the other 4 responses.
This is the critical z-score for 95% confidence level
This is to calculate the upper 95% confidence level for the proportion of respondents who answered Strongly Agree. Create the same fields for the other 4 responses.
This is to calculate the lower 95% confidence level for the proportion of respondents who answered Strongly Agree. Create the same fields for the other 4 responses.
This is to generate the correct lower 95% level according to the type of response which the user wishes to see the lower 95% level for.
This is to generate the correct upper 95% level according to the type of response which the user wishes to see the upper 95% level for.
Drag ‘Select Response Type’ to Columns
Drag Country to Rows
Change Marks type to Circle. At this point chart looks as follows:

Create a new Dashboard
Drag sheets ‘Likert’ and ‘Final Error Bars’ into the dashboard side by side. It will look as follows:
Move the Select Question dropdown list to the top. This is so that the entire question can be shown and not truncated.
Move the select ‘Type of Response’ dropdown list below ‘Select Question’
Drag all the profile filters (gender, Age group, Household Size Bracket, Number of children and employment status) to the bottom of the dashboard.
Create a text box to direct the user to filter to view results based on different profile combinations.
Capitalize first letter of each profile filter group to tidy the look.
go to the Final Error Bars sheet. To show the number of records for each country for that particular question, drag ‘Number of Records’ into the All tab under Marks as follows:
Final product:
This is in line with scientific evidence that females are generally more risk-averse than males.
This is probably due to the fact that there has been published statistical evidence that the elderly are more susceptible to being more adversely affected and have a higher death rate when hit by COVID19 as compared to the young, who have a much higher and faster recovery rate.