Just ask Siri? A pilot study comparing smartphone digital assistants and laptop Google searches for smoking cessation advice

PLOS ONE, Mar 2018

Objective To compare voice-activated internet searches by smartphone (two digital assistants) with laptop ones for information and advice related to smoking cessation. Design Responses to 80 questions on a range of topics related to smoking cessation (including the FAQ from a NHS website), compared for quality. Setting Smartphone and internet searches as performed in New Zealand. Main outcome measures Ranked responses to the questions. Results Google laptop internet searches came first (or first equal) for best quality smoking cessation advice for 83% (66/80) of the responses. Voiced questions to Google Assistant (“OK Google”) came first/first equal 76% of the time vs Siri (Apple) at 28%. Google and Google Assistant were statistically significantly better than Siri searches (odds ratio 12.4 and 8.5 respectively, p<0.0001 in each comparison). When asked FAQs from the National Health Service website, or to find information the Centers for Disease Control has made videos on, the best search results used expert sources 59% (31/52) of the time, “some expertise” (eg, Wikipedia) 18% of the time, but also magazines and other low quality sources 19% of the time. Using all three methods failed to find relevant information 8% (6/80) of the time, with Siri having the most failed responses (53% of the time). Conclusion Google internet searches and Google Assistant were found to be significantly superior to the Siri digital assistant for smoking cessation information. While expert content was returned over half the time, there is still substantial room for improvement in how these software systems deliver smoking cessation advice.

A PDF file should load here. If you do not see its contents the file may be temporarily unavailable at the journal website or you do not have a PDF plug-in installed and enabled in your browser.

Alternatively, you can download the file locally and open with any standalone PDF reader:

http://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0194811&type=printable

Just ask Siri? A pilot study comparing smartphone digital assistants and laptop Google searches for smoking cessation advice

March Just ask Siri? A pilot study comparing smartphone digital assistants and laptop Google searches for smoking cessation advice Matt Boyd 0 1 Nick Wilson 0 0 Editor: Albert Lee, The Chinese University of Hong Kong , HONG KONG 1 Adapt Research Ltd , Reefton , New Zealand , 2 Department of Public Health, Univeristy of Otago , Wellington , New Zealand ☯ These authors contributed equally to this work; * matt@adaptresearchwriting; com Main outcome measures Ranked responses to the questions. Google laptop internet searches came first (or first equal) for best quality smoking cessation advice for 83% (66/80) of the responses. Voiced questions to Google Assistant (ªOK Googleº) came first/first equal 76% of the time vs Siri (Apple) at 28%. Google and Google Assistant were statistically significantly better than Siri searches (odds ratio 12.4 and 8.5 respectively, p<0.0001 in each comparison). When asked FAQs from the National Health Service website, or to find information the Centers for Disease Control has made videos on, the best search results used expert sources 59% (31/52) of the time, ªsome expertiseº (eg, Wikipedia) 18% of the time, but also magazines and other low quality sources 19% of the time. Using all three methods failed to find relevant information 8% (6/80) of the time, with Siri having the most failed responses (53% of the time). a1111111111 a1111111111 a1111111111 a1111111111 a1111111111 OPEN ACCESS Data Availability Statement: All raw data is provided in the table in the supplementary file. An Excel file with all the results is available from the authors on request. The data contained in this paper and the Supporting Information file constitutes the minimal underlying dataset. Funding: The study was self-funded by the authors and no funder had any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the 'author contributions' section. Objective Design Setting Results To compare voice-activated internet searches by smartphone (two digital assistants) with laptop ones for information and advice related to smoking cessation. Responses to 80 questions on a range of topics related to smoking cessation (including the FAQ from a NHS website), compared for quality. Competing interests: All authors have completed the Unified Competing Interest form (available on request from the corresponding author) and declare: no support from any organisation for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work. MB is the owner and sole employee of Adapt Research Ltd, this does not alter our adherence to PLOS ONE policies on sharing data and materials. Conclusion Google internet searches and Google Assistant were found to be significantly superior to the Siri digital assistant for smoking cessation information. While expert content was returned over half the time, there is still substantial room for improvement in how these software systems deliver smoking cessation advice. Introduction The internet is widely used for obtaining health-related information and advice. For example, in the United Kingdom, 41% of internet users report going online to find information for health-related issues, with about half of these (22% of all users) having done so in the previous week [1]. But many people are also wary of the information they find online and value trusted sources [2]. Improving search engine functionality offers a potential solution. For example, Google is cooperating with Mayo Clinic physicians to curate and check health data that is added to the database it uses for instant search results [3]. Similarly, National Health Service (NHS) England is working with Microsoft and Google to increase the visibility of NHS content online [4]. With increasing smartphone use there is also a particular case for studying health informa tion obtainable with digital assistants on smartphones. Present literature on digital assistant use is very limited [5±7]. and there appears to be no published research on the use of these tools in providing information or advice on smoking cessation. Therefore we aimed to assess the current situation using the digital assistants Siri and Google Assistant (GA) and to compare these with internet searches. Methods Selection of digital assistants Siri (Apple) and GA (Google) were selected because they were in common use as personal digital assistants at the time of the Pilot study in October 2017 [5, 6]. Selection of questions The first set of questions (n = 35) were adapted from the most detailed ªfrequently asked questions (FAQ)º we could identify: that of the UK National Health Service (NHS) smokefree website [8]. The specific questions are listed in S1 Appendix, including slight modifications so they are relevant to an international audience. The next set of questions (n = 17) were related to the most comprehensive list of short videos on smoking-related disease that we could identify: those produced by the Centers for Disease Control and Prevention (CDC) in the USA for the ªTips From Former Smokersº Campaign [9]. The final set of questions (n = 28) were those devised by us to test responses to a range of features such as, finding smoking-related pictures, diagrams, instructional videos; and navigating to the nearest service/retailer for quitting-related products. Data collection Data were collected independently by both researchers on a pre-designed form and each independently conducted their own quality grading and rankings (internet search vs GA vs Siri). 2 / 6 For speaking into the smartphones, a maximum of three attempts were made per question by the two authors (both of whom had New Zealand accents). The smartphones used were an iPhone 5S and an iPhone 7, with settings for ªEnglish (New Zealand)º. For Google searches on laptops, the site used was that for New Zealand (https://www.Google.co.nz/) and using Google Chrome. Only the first non-advertisement link or information returned was considered in the analysis. All searches were conducted in October 2017 with both researchers being located in New Zealand (in the capital city and a small rural town, 250 km apart). Hierarchy of information/advice quality In independently grading the quality of the information and advice, we used the following hierarchy: Grade A: Health agencies which had medical expertise whether local or international (eg, Ministry of Health, the national Quitline service, the NHS, CDC, universities, and hospitals). Grade B: Sites with ªsome expertiseº. Examples were Wikipedia and commercially orien tated medical sites such as WebMD, or certified clinicians giving information directly. Grade C: Online news items, online magazines and internet sites run by individuals and non-health organisations. Analysis Results Inter-rater agreement was calculated on the ratings of quality of the content and which tools were best or equal best in answering each question. The frequency with which the three search tools provided the best information was compared using odd ratios. The tools frequently returned different search results to the two raters. On the 55 occasions that the best quality result was the same for both raters, there was 100% concordance of the raters' grading of quality of the information (grades: A, B or C). Cohen's kappa was calculated for the level of observer agreement for ranking which tool had returned the best or best equal information. There were eight possible ranking choices for each question (one tool being best alone, or combinations of best equal, or none) and kappa was 0.45 ±when blinded, showing moderate agreement. This was surely lowered by instances where the search results returned were different between raters. When instances where the content returned by the best rated tool was the same, kappa rose to 0.56. A laptop-based Google search provided the best or equal best information 83% (66/80) of the time (Table 1, see also S1 Appendix for specific results). GA was the better digital assistant, with 76% of the best (or best equal) responses, compared to Siri (28%). All three search approaches were classified as equally successful for only 18 questions (22%). The results for Google searches were not statistically significantly better than GA, but were considerably better than Siri, odds ratio (OR) = 12.4 (95% CI = 5.8±26.5, p<0.0001). GA was better than Siri with OR = 8.5 (4.2±17.3, p<0.0001). Google searches also had the lowest outright failure rate of providing no useful response for 9% (7/80) of the questions, compared to GA (14%, 12/80) and Siri (53%, 42/80) with no significant differences between the former and GA, however Google was superior to Siri (p<0.0001), as was GA (p<0.0001). All three devices failed on only 8% (6/80) questions. For assessing response quality, we considered just the questions relating to the NHS 35 FAQs and also those relating to the CDC's set of 17 videos on smoking cessation. Taking just the best result for each of these 52 questions, 59% (31/52) of the search questions were answered with a best answer that we determined to be expert sources. These included the CDC 3 / 6 Notes # mean of two raters rounded up to next whole number; statistical tests compared GA to Siri: p<0.01 p<0.001 Typed Google search on a laptop 83% (66#/80) 90% (32/35) 79% (14/17) 75% (21/28) 9% (7/80) 21% (17/80) 0.4 adverts 52% (27/52) 22% (12/52) 22% (12/52) Google Assistant (GA) 76% (61/80) 79% (28/35) 85% (15/17) Siri 28% (22/80) 49% (17/35) 0% (0/17) 66% (19/28) 18% (5/28) 14% (12/80) 28% (22/80) 0.6 adverts 49% (26/52) 20% (11/52) 24% (13/52) 53% (42/80) 8% (6/80) 0.3 adverts 24% (13/52) 13% (7/52) 13% (7/52) (n = 10), Cancer.org (n = 6), NHS (n = 4), and a range of other medical expert-endorsed sites eg, hospitals, specialist clinics, and universities. Around a fifth (18%, 10/52) of searches provided websites with ªsome expertiseº such as Wikipedia articles and commercially orientated ones (eg, private medical clinics), and 19% of searches provided only news items or magazine articles. Discussion Main findings and interpretation Our search results were encouraging in terms of the usefulness of the information provided, with nearly 60% of searches returning expert content on at least one tool, and Google and GA returning expert content about half the time. However, all search modalities could improve on the chances of finding expert information. Our results are consistent however, with the only other reported health-related study, which was undertaken in 2015/2016 [7]. It found that Siri and other smartphone assistants sometimes trivialised important general health inquiries or failed to provide appropriate information. We found that all tools had trouble finding gay and lesbian-specific information, Siri was poor when videos were requested by content, and all three tools sometimes returned magazine or blog content instead of professional health advice. The responses sometimes included a useful Google summary box, and/or a diagram. The summary was often read out verbally by the digital assistants and this has obvious advantages for people with disabilities or some situations such as when the questioner is doing other activities. There was notable variation in the search results between the two researchers. For example, when asked to find an antismoking advertisement, rater A was directed to a New Zealand public health campaign advertisement, while rater B was shown a Youtube video of the `top 40 scariest antismoking ads' from around the world (S1 Appendix). This variation may reflect the impact of location, Google search history, demographics, ongoing changes in website traffic and website links on search algorithms. 4 / 6 Study strengths and limitations A strength is that this study is the first to consider smartphone digital assistants for the provision of smoking cessation information and advice. It also used questions derived from expert sources (NHS and CDC) and tested a wide range of smartphone functionalities with the two researchers collecting data independently. But a possible limitation is that our results might be superior to questions asked in the real world since we used reasonably precise wording and terms, as opposed to slang words or colloquialisms that some of the public might use. On the other hand, we only considered the first result returned in each search list, and there were often superior sites listed after the initial sites. Potential research implications These pilot results demonstrate that a range of useful information is returned to users of digital assistants when asking for smoking cessation advice. This suggests that a larger study of actual smokers wanting to quit is warranted. The larger study could investigate the user experience as well as the quality of the information returned by digital assistants. In the meantime, however, software designers and health authorities should continue to work together to improve search functionality, as is starting to happen in some localities [3, 4]. Conclusions Google internet searches and Google Assistant were found in this pilot study to be significantly superior to the Siri digital assistant for sourcing smoking cessation content. While expert content was returned over half the time, there is still substantial room for improvement in how these software systems deliver smoking cessation advice. Supporting information S1 Appendix. Search results by question. (DOCX) Author Contributions Conceptualization: Nick Wilson. Data curation: Matt Boyd, Nick Wilson. Formal analysis: Matt Boyd. Investigation: Matt Boyd. Methodology: Matt Boyd, Nick Wilson. Project administration: Matt Boyd, Nick Wilson. Resources: Matt Boyd, Nick Wilson. Supervision: Nick Wilson. Validation: Nick Wilson. Writing ± original draft: Matt Boyd, Nick Wilson. Writing ± review & editing: Matt Boyd, Nick Wilson. 5 / 6 1. 3. 5. Ofcom . Adults' media use and attitudes . Report 2017. London: Ofcom , 2017 . https://www.ofcom.org. uk/__data/assets/pdf_file/0020/102755/adults-media -use-attitudes-2017 .pdf. Higgins O , Sixsmith J , Barry M , Domegan C. A literature review on health information seeking behaviour on the web: a health consumer and health professional perspective . Stockholm: ECDC; 2011 . Gibbs S. Google to put health information directly into search results . The Guardian 2015 ( 10 February) . https://www.theguardian.com/technology/2015/feb/10/Google-health -information-directly-into-searchresults. Stevens L. NHS England working with internet giants to promote digital tools . Digital Health 2017 ( 9 March) . https://www.digitalhealth.net/ 2017 /03/nhs-england -working-with-us-internet-giants-topromote-digital-tools/. Dunn J. We put Siri, Alexa, Google Assistant , and Cortana through a marathon of tests to see who's winning the virtual assistant raceÐhere's what we found . Business Insider 2016 ( 7 November) . https:// www.businessinsider.com. au/siri-vs-Google-assistant-cortana- alexa- 2016- 11 ?r=US&IR=T#/ #withthat-out-of-the-way-onto-the-tests-2. Hachman M. Hands-on: Google Assistant's Allo chatbot outdoes Cortana, Siri as your digital pal . PCWorld 2016 ( 22 September) . http://www.pcworld.com/article/3122482/android/hands-on -Googleassistants-allo-chatbot-outdoes-cortana-siri-as-your-digital-pal .html. Miner AS , Milstein A , Schueller S , Hegde R , Mangurian C , Linos E. Smartphone-based conversational agents and responses to questions about mental Health, interpersonal violence, and physical health . JAMA Intrn Med 2016 ; 176 : 619 ± 25 . National health Service. Smokefree NHS: Frequently asked questions . https://www.nhs.uk/smokefree/ frequently-asked-questions (Accessed 20 September 2017 ). Centers for Disease Control and Prevention. Tips from former smokers: Videos . https://www.cdc.gov/ tobacco/campaign/tips/resources/videos/index.html ?s_cid=OSH_tips_D9390 (Page last updated: August 8, 2017 ).


This is a preview of a remote PDF: http://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0194811&type=printable

Matt Boyd, Nick Wilson. Just ask Siri? A pilot study comparing smartphone digital assistants and laptop Google searches for smoking cessation advice, PLOS ONE, 2018, DOI: 10.1371/journal.pone.0194811