skip to main content
10.1145/3677045.3685418acmotherconferencesArticle/Chapter ViewAbstractPublication PagesnordichiConference Proceedingsconference-collections
extended-abstract

Hiring an AI: Incorporating Personnel Selection Methods in User-Centered Design to Design AI Agents for Safety-Critical Domains

Authors:
Stephan Huber
Chair of Psychological Ergonomics, Julius-Maximilians-Universität Würzburg, Germany
,
Nathalie Bettina Elisabetha Papenfuß
Chair of Psychological Ergonomics, Julius-Maximilians-Universität Würzburg, Germany
,
Lea Weppert
Chair of Psychological Ergonomics, Julius-Maximilians-Universität Würzburg, Germany
,
Valentina Wohlfart
Chair of Psychological Ergonomics, Julius-Maximilians-Universität Würzburg, Germany
,
Johannes Basch
Fakultät Wirtschaftswissenschaften, Hochschule Neu-Ulm, Germany
,
Oliver Happel
Universitätsklinikum Würzburg, Germany
,
Tobias Grundgeiger
Chair of Psychological Ergonomics, Julius-Maximilians-Universität Würzburg, Germany
Authors Info & Claims
Published: 13 October 2024 Publication History

Abstract

Enhancing human teams with AI is currently aspired across application domains. However, the prospective team role of high-performing AI-agents is rarely considered. In this paper, we propose a new method complementing existing user-centered design processes to define an AI-agent personality that suits the team. Inspired by personnel selection methods, the hiring an AI workshop prompts teams from safety-critical domains to 1) remember past incidents, 2) envision the personality and competence of their desired teammate, and 3) validate how the teammate would act in described incidents. Clustering and modeling the data made it accessible to interdisciplinary teams. Incorporating the models into the ideation process and revisiting the data for a personality drill-down exceeded insights from contextual inquiries and had a measurable impact (i.e., downstream utility) on the visions. Participants’ humanizing descriptions of a knowing and competent, yet modest teammate raise challenges for technical implementation and questions on future team-collaboration in safety-critical domains.

Supplemental Material

ZIP File
This online supplement consists of a workshop agenda and character models. All templates are translated from German and cleared of raw user data for comprehensibility. During our wall walk, the four models were presented next to an affinity diagram of positive and negative team behaviour. The four models include (1) Cassandra?s profile, (2) eight personas, (3) characteristics of personality traits, and (4) personality traits and behaviors in critical situations.

References

[1]
Rangina Ahmad, Dominik Siemon, Ulrich Gnewuch, and Susanne Robra-Bissantz. 2022. A framework of personality cues for conversational agents. In 55th Hawaii International Conference on System Sciences. Shidler College of Business, Universtiy of Hawaii, Manoa, Honolulu, Hawaii, 10 pages. https://doi.org/10.24251/HICSS.2022.524
[2]
Christiane Attig, Patricia Wollstadt, Tim Schrills, Thomas Franke, and Christiane B. Wiebel-Herboth. 2024. More than Task Performance: Developing New Criteria for Successful Human-AI Teaming Using the Cooperative Card Game Hanabi. In Extended Abstracts of the CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, Article 245. https://doi.org/10.1145/3613905.3650853
[3]
Jerry Ball, Christopher Myers, Andrea Heiberg, Nancy J. Cooke, Michael Matessa, Mary Freiman, and Stuart Rodgers. 2010. The synthetic teammate project. Computational and Mathematical Organization Theory 16, 3 (2010), 271–299. https://doi.org/10.1007/s10588-010-9065-3
[4]
Gagan Bansal, Tongshuang Wu, Joyce Zhou, Raymond Fok, Besmira Nushi, Ece Kamar, Marco Tulio Ribeiro, and Daniel Weld. 2021. Does the whole exceed its parts? The effect of AI explanations on complementary team performance. In 2021 CHI Conference on Human Factors in Computing Systems (CHI ’21). Association for Computing Machinery, New York, NY, USA, 1–16. https://doi.org/10.1145/3411764.3445717
[5]
Arngeir Berge, Frode Guribye, Siri-Linn Schmidt Fotland, Gro Fonnes, Ingrid H. Johansen, and Christoph Trattner. 2023. Designing for control in nurse-AI collaboration during emergency medical calls. In 2023 ACM Designing Interactive Systems Conference (DIS ’23). Association for Computing Machinery, New York, NY, USA, 1339–1352. https://doi.org/10.1145/3563657.3596110
[6]
Virginia Braun and Victoria Clarke. 2022. Thematic Analysis. SAGE Publications Ltd, London, UK.
[7]
Tara Capel and Margot Brereton. 2023. What is human-centered about human-centered AI? A map of the research landscape. In 2023 CHI Conference on Human Factors in Computing Systems (CHI’23). Association for Computing Machinery, New York, NY, USA, 23 pages. https://doi.org/10.1145/3544548.3580959
[8]
Stuart K. Card, Thomas P. Moran, and Allen Newell. 1986. The model human processor- An engineering model of human performance. Handbook of perception and human performance. 2, 45–1 (1986), 6 pages.
[9]
Charles K. Coe. 1992. The MBTI: Potential Uses and Misuses in Personnel Administration. Public Personnel Management 21, 4 (1992), 511–522. https://doi.org/10.1177/009102609202100407
[10]
Andreea Danielescu and Gwen Christian. 2018. A bot is not a polyglot: Designing personalities for multi-lingual conversational agents. In Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems (CHI EA ’18). Association for Computing Machinery, New York, NY, USA, 1–9. https://doi.org/10.1145/3170427.3174366
[11]
Tripp Driskell, James E. Driskell, C. Shawn Burke, and Eduardo Salas. 2017. Team Roles: A Review and Integration. Small Group Research 48, 4 (2017), 482–511. https://doi.org/10.1177/1046496417711529
[12]
John C. Flanagan. 1954. The critical incident technique. Psychological Bulletin 51, 4 (1954), 327–358. https://doi.org/10.1037/h0061470
[13]
H. Rex Hartson, Terence S. Andre, and Robert C. Williges. 2001. Criteria for evaluating usability evaluation methods. International journal of human-computer interaction 13, 4 (2001), 373–410.
[14]
Jennifer Heier. 2021. Design intelligence - taking further steps towards new methods and tools for designing in the age of AI. Lecture Notes in Coomputer Science, Vol. 12797. Springer, Cham, 202–215. https://doi.org/10.1007/978-3-030-77772-2_13
[15]
Anna Hohm, Oliver Happel, Jörn Hurtienne, and Tobias Grundgeiger. 2023. "When the Beeping Stops, you Completely Freak Out"-How Acute Care Teams Experience and Use Technology. Proceedings of the ACM on Human-Computer Interaction 7, CSCW1 (2023), 29 pages. https://doi.org/10.1145/3579590
[16]
Karen Holtzblatt and Hugh Beyer. 2017. Contextual Design: Design for Life. Morgan Kaufmann, Burlington, Massachusetts, USA.
[17]
Hankyung Kim, Dong Yoon Koh, Gaeun Lee, Jung-Mi Park, and Youn-kyung Lim. 2019. Designing personalities of conversational agents. In Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems (CHI EA ’19). Association for Computing Machinery, New York, NY, USA, 1–6. https://doi.org/10.1145/3290607.3312887
[18]
Taenyun Kim, Maria D. Molina, Minjin Rheu, Emily S. Zhan, and Wei Peng. 2023. One AI does not fit all: A cluster analysis of the laypeople’s perception of AI roles. In 2023 CHI Conference on Human Factors in Computing Systems (CHI ’23). Association for Computing Machinery, New York, NY, USA, 20 pages. https://doi.org/10.1145/3544548.3581340
[19]
Effie Lai-Chong Law. 2006. Evaluating the downstream utility of user tests and examining the developer effect: A case study. International Journal of Human-Computer Interaction 21, 2 (2006), 147–172.
[20]
Kris Liu, Jackson Tolins, Jean E. Fox Tree, Michael Neff, and Marilyn A. Walker. 2016. Two techniques for assessing virtual agent personality. IEEE Transactions on Affective Computing 7, 1 (2016), 94–105. https://doi.org/10.1109/TAFFC.2015.2435780
[21]
Joseph B. Lyons and Kevin T. Wynne. 2021. Human-machine teaming: Evaluating dimensions using narratives. Human-Intelligent Systems Integration 3, 2 (2021), 129–137. https://doi.org/10.1007/s42454-020-00019-7
[22]
Tim Merritt and Kevin McGee. 2012. Protecting artificial team-mates: More seems like less. In SIGCHI Conference on Human Factors in Computing Systems (CHI ’12). Association for Computing Machinery, New York, NY, USA, 2793–2802. https://doi.org/10.1145/2207676.2208680
[23]
Robert J. Moore and Raphael Arar. 2019. Conversational UX design: A practitioner’s guide to the natural conversation framework. Association for Computing Machinery, New York, NY, USA.
[24]
Cecily Morrison, Edward Cutrell, Martin Grayson, Anja Thieme, Alex Taylor, Geert Roumen, Camilla Longden, Sebastian Tschiatschek, Rita Faia Marques, and Abigail Sellen. 2021. Social sensemaking with AI: Designing an open-ended AI experience with a blind child. In 2021 CHI Conference on Human Factors in Computing Systems (CHI ’21). Association for Computing Machinery, New York, NY, USA, 14 pages. https://doi.org/10.1145/3411764.3445290
[25]
National Academies of Sciences, Engineering and Medicine. 2021. Human-AI teaming: State of the art and research needs. The National Academies Press, Washington, DC. 142 pages. https://doi.org/10.17226/26355
[26]
Fritz Ostendorf and Alois Angleitner. 2004. Neo-Persönlichkeitsinventar nach Costa und McCrae: Neo-PI-R [Neo-personality inventory by Costa and McCrae]; Manual (revised ed.). Hogrefe, Göttingen, Germany.
[27]
Max Pellert, Clemens M. Lechner, Claudia Wagner, Beatrice Rammstedt, and Markus Strohmaier. 2024. AI psychometrics: Assessing the psychological profiles of large language models through psychometric inventories. Perspectives on Psychological Science 0, 0 (2024), 19 pages. https://doi.org/10.1177/17456916231214460
[28]
Beatrice Rammstedt and Oliver P. John. 2007. Measuring personality in one minute or less: A 10-item short version of the Big Five Inventory in English and German. Journal of Research in Personality 41, 1 (2007), 203–212. https://doi.org/10.1016/j.jrp.2006.02.001
[29]
Beatrice Rammstedt, C. J. Kemper, M. C. Klein, Constanze. Beierlein, and A. Kovaleva. 2014. Big Five Inventory (BFI-10). Zusammenstellung sozialwissenschaftlicher Items und Skalen (ZIS) (2014). https://doi.org/10.6102/zis76
[30]
Ben Shneiderman. 2020. Design lessons from AI’s two grand goals: Human emulation and useful applications. IEEE Transactions on Technology and Society 1, 2 (2020), 73–82. https://doi.org/10.1109/TTS.2020.2992669
[31]
Sinan Sonlu, Uğur Güdükbay, and Funda Durupinar. 2021. A conversational agent framework with multi-modal personality expression. ACM Transactions on Graphics 40, 1 (2021), Article 7, 16 pages. https://doi.org/10.1145/3439795
[32]
Samantha Straka, Martin Jakobus Koch, Astrid Carolus, Marc Erich Latoschik, and Carolin Wienrich. 2023. How do employees imagine AI they want to work with: A drawing study. In Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, Article 160. https://doi.org/10.1145/3544549.3585631
[33]
Kyoko Sugisaki and Andreas Bleiker. 2020. Usability guidelines and evaluation criteria for conversational user interfaces: a heuristic and linguistic approach. In Mensch und Computer 2020 (MuC ’20). Association for Computing Machinery, New York, NY, USA, 309–319. https://doi.org/10.1145/3404983.3405505
[34]
Rui Zhang, Wen Duan, Christopher Flathmann, Nathan McNeese, Guo Freeman, and Alyssa Williams. 2023. Investigating AI teammate communication strategies and their impact in human-AI teams for effective teamwork. Proceedings of the ACM on Human-Computer Interaction 7, CSCW2 (2023), Article 281, 31 pages. https://doi.org/10.1145/3610072

Index Terms

  1. Hiring an AI: Incorporating Personnel Selection Methods in User-Centered Design to Design AI Agents for Safety-Critical Domains

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      NordiCHI '24 Adjunct: Adjunct Proceedings of the 2024 Nordic Conference on Human-Computer Interaction
      October 2024
      385 pages
      ISBN:9798400709654
      DOI:10.1145/3677045
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 13 October 2024

      Check for updates

      Author Tags

      1. AI Agent
      2. Human-AI Teaming
      3. Personality
      4. Personnel Selection
      5. Safety Critical

      Qualifiers

      • Extended-abstract
      • Research
      • Refereed limited

      Funding Sources

      • German Federal Ministry of Education and Research

      Conference

      NordiCHI Adjunct 2024

      Acceptance Rates

      Overall Acceptance Rate 379 of 1,572 submissions, 24%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 16
        Total Downloads
      • Downloads (Last 12 months)16
      • Downloads (Last 6 weeks)16
      Reflects downloads up to 19 Nov 2024

      Other Metrics

      Citations

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media

      Get Access

      Get Access

      Login options

      References

      References

      [1]
      Rangina Ahmad, Dominik Siemon, Ulrich Gnewuch, and Susanne Robra-Bissantz. 2022. A framework of personality cues for conversational agents. In 55th Hawaii International Conference on System Sciences. Shidler College of Business, Universtiy of Hawaii, Manoa, Honolulu, Hawaii, 10 pages. https://doi.org/10.24251/HICSS.2022.524
      [2]
      Christiane Attig, Patricia Wollstadt, Tim Schrills, Thomas Franke, and Christiane B. Wiebel-Herboth. 2024. More than Task Performance: Developing New Criteria for Successful Human-AI Teaming Using the Cooperative Card Game Hanabi. In Extended Abstracts of the CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, Article 245. https://doi.org/10.1145/3613905.3650853
      [3]
      Jerry Ball, Christopher Myers, Andrea Heiberg, Nancy J. Cooke, Michael Matessa, Mary Freiman, and Stuart Rodgers. 2010. The synthetic teammate project. Computational and Mathematical Organization Theory 16, 3 (2010), 271–299. https://doi.org/10.1007/s10588-010-9065-3
      [4]
      Gagan Bansal, Tongshuang Wu, Joyce Zhou, Raymond Fok, Besmira Nushi, Ece Kamar, Marco Tulio Ribeiro, and Daniel Weld. 2021. Does the whole exceed its parts? The effect of AI explanations on complementary team performance. In 2021 CHI Conference on Human Factors in Computing Systems (CHI ’21). Association for Computing Machinery, New York, NY, USA, 1–16. https://doi.org/10.1145/3411764.3445717
      [5]
      Arngeir Berge, Frode Guribye, Siri-Linn Schmidt Fotland, Gro Fonnes, Ingrid H. Johansen, and Christoph Trattner. 2023. Designing for control in nurse-AI collaboration during emergency medical calls. In 2023 ACM Designing Interactive Systems Conference (DIS ’23). Association for Computing Machinery, New York, NY, USA, 1339–1352. https://doi.org/10.1145/3563657.3596110
      [6]
      Virginia Braun and Victoria Clarke. 2022. Thematic Analysis. SAGE Publications Ltd, London, UK.
      [7]
      Tara Capel and Margot Brereton. 2023. What is human-centered about human-centered AI? A map of the research landscape. In 2023 CHI Conference on Human Factors in Computing Systems (CHI’23). Association for Computing Machinery, New York, NY, USA, 23 pages. https://doi.org/10.1145/3544548.3580959
      [8]
      Stuart K. Card, Thomas P. Moran, and Allen Newell. 1986. The model human processor- An engineering model of human performance. Handbook of perception and human performance. 2, 45–1 (1986), 6 pages.
      [9]
      Charles K. Coe. 1992. The MBTI: Potential Uses and Misuses in Personnel Administration. Public Personnel Management 21, 4 (1992), 511–522. https://doi.org/10.1177/009102609202100407
      [10]
      Andreea Danielescu and Gwen Christian. 2018. A bot is not a polyglot: Designing personalities for multi-lingual conversational agents. In Extended Abstracts of the 2018 CHI Conference on Human Factors in Computing Systems (CHI EA ’18). Association for Computing Machinery, New York, NY, USA, 1–9. https://doi.org/10.1145/3170427.3174366
      [11]
      Tripp Driskell, James E. Driskell, C. Shawn Burke, and Eduardo Salas. 2017. Team Roles: A Review and Integration. Small Group Research 48, 4 (2017), 482–511. https://doi.org/10.1177/1046496417711529
      [12]
      John C. Flanagan. 1954. The critical incident technique. Psychological Bulletin 51, 4 (1954), 327–358. https://doi.org/10.1037/h0061470
      [13]
      H. Rex Hartson, Terence S. Andre, and Robert C. Williges. 2001. Criteria for evaluating usability evaluation methods. International journal of human-computer interaction 13, 4 (2001), 373–410.
      [14]
      Jennifer Heier. 2021. Design intelligence - taking further steps towards new methods and tools for designing in the age of AI. Lecture Notes in Coomputer Science, Vol. 12797. Springer, Cham, 202–215. https://doi.org/10.1007/978-3-030-77772-2_13
      [15]
      Anna Hohm, Oliver Happel, Jörn Hurtienne, and Tobias Grundgeiger. 2023. "When the Beeping Stops, you Completely Freak Out"-How Acute Care Teams Experience and Use Technology. Proceedings of the ACM on Human-Computer Interaction 7, CSCW1 (2023), 29 pages. https://doi.org/10.1145/3579590
      [16]
      Karen Holtzblatt and Hugh Beyer. 2017. Contextual Design: Design for Life. Morgan Kaufmann, Burlington, Massachusetts, USA.
      [17]
      Hankyung Kim, Dong Yoon Koh, Gaeun Lee, Jung-Mi Park, and Youn-kyung Lim. 2019. Designing personalities of conversational agents. In Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems (CHI EA ’19). Association for Computing Machinery, New York, NY, USA, 1–6. https://doi.org/10.1145/3290607.3312887
      [18]
      Taenyun Kim, Maria D. Molina, Minjin Rheu, Emily S. Zhan, and Wei Peng. 2023. One AI does not fit all: A cluster analysis of the laypeople’s perception of AI roles. In 2023 CHI Conference on Human Factors in Computing Systems (CHI ’23). Association for Computing Machinery, New York, NY, USA, 20 pages. https://doi.org/10.1145/3544548.3581340
      [19]
      Effie Lai-Chong Law. 2006. Evaluating the downstream utility of user tests and examining the developer effect: A case study. International Journal of Human-Computer Interaction 21, 2 (2006), 147–172.
      [20]
      Kris Liu, Jackson Tolins, Jean E. Fox Tree, Michael Neff, and Marilyn A. Walker. 2016. Two techniques for assessing virtual agent personality. IEEE Transactions on Affective Computing 7, 1 (2016), 94–105. https://doi.org/10.1109/TAFFC.2015.2435780
      [21]
      Joseph B. Lyons and Kevin T. Wynne. 2021. Human-machine teaming: Evaluating dimensions using narratives. Human-Intelligent Systems Integration 3, 2 (2021), 129–137. https://doi.org/10.1007/s42454-020-00019-7
      [22]
      Tim Merritt and Kevin McGee. 2012. Protecting artificial team-mates: More seems like less. In SIGCHI Conference on Human Factors in Computing Systems (CHI ’12). Association for Computing Machinery, New York, NY, USA, 2793–2802. https://doi.org/10.1145/2207676.2208680
      [23]
      Robert J. Moore and Raphael Arar. 2019. Conversational UX design: A practitioner’s guide to the natural conversation framework. Association for Computing Machinery, New York, NY, USA.
      [24]
      Cecily Morrison, Edward Cutrell, Martin Grayson, Anja Thieme, Alex Taylor, Geert Roumen, Camilla Longden, Sebastian Tschiatschek, Rita Faia Marques, and Abigail Sellen. 2021. Social sensemaking with AI: Designing an open-ended AI experience with a blind child. In 2021 CHI Conference on Human Factors in Computing Systems (CHI ’21). Association for Computing Machinery, New York, NY, USA, 14 pages. https://doi.org/10.1145/3411764.3445290
      [25]
      National Academies of Sciences, Engineering and Medicine. 2021. Human-AI teaming: State of the art and research needs. The National Academies Press, Washington, DC. 142 pages. https://doi.org/10.17226/26355
      [26]
      Fritz Ostendorf and Alois Angleitner. 2004. Neo-Persönlichkeitsinventar nach Costa und McCrae: Neo-PI-R [Neo-personality inventory by Costa and McCrae]; Manual (revised ed.). Hogrefe, Göttingen, Germany.
      [27]
      Max Pellert, Clemens M. Lechner, Claudia Wagner, Beatrice Rammstedt, and Markus Strohmaier. 2024. AI psychometrics: Assessing the psychological profiles of large language models through psychometric inventories. Perspectives on Psychological Science 0, 0 (2024), 19 pages. https://doi.org/10.1177/17456916231214460
      [28]
      Beatrice Rammstedt and Oliver P. John. 2007. Measuring personality in one minute or less: A 10-item short version of the Big Five Inventory in English and German. Journal of Research in Personality 41, 1 (2007), 203–212. https://doi.org/10.1016/j.jrp.2006.02.001
      [29]
      Beatrice Rammstedt, C. J. Kemper, M. C. Klein, Constanze. Beierlein, and A. Kovaleva. 2014. Big Five Inventory (BFI-10). Zusammenstellung sozialwissenschaftlicher Items und Skalen (ZIS) (2014). https://doi.org/10.6102/zis76
      [30]
      Ben Shneiderman. 2020. Design lessons from AI’s two grand goals: Human emulation and useful applications. IEEE Transactions on Technology and Society 1, 2 (2020), 73–82. https://doi.org/10.1109/TTS.2020.2992669
      [31]
      Sinan Sonlu, Uğur Güdükbay, and Funda Durupinar. 2021. A conversational agent framework with multi-modal personality expression. ACM Transactions on Graphics 40, 1 (2021), Article 7, 16 pages. https://doi.org/10.1145/3439795
      [32]
      Samantha Straka, Martin Jakobus Koch, Astrid Carolus, Marc Erich Latoschik, and Carolin Wienrich. 2023. How do employees imagine AI they want to work with: A drawing study. In Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, Article 160. https://doi.org/10.1145/3544549.3585631
      [33]
      Kyoko Sugisaki and Andreas Bleiker. 2020. Usability guidelines and evaluation criteria for conversational user interfaces: a heuristic and linguistic approach. In Mensch und Computer 2020 (MuC ’20). Association for Computing Machinery, New York, NY, USA, 309–319. https://doi.org/10.1145/3404983.3405505
      [34]
      Rui Zhang, Wen Duan, Christopher Flathmann, Nathan McNeese, Guo Freeman, and Alyssa Williams. 2023. Investigating AI teammate communication strategies and their impact in human-AI teams for effective teamwork. Proceedings of the ACM on Human-Computer Interaction 7, CSCW2 (2023), Article 281, 31 pages. https://doi.org/10.1145/3610072