sensors Article How Realistic Is Threat Image Projection for X-ray Baggage Screening? Robin Riz à Porta * , Yanik Sterchi and Adrian Schwaninger Institute Humans in Complex Systems, School of Applied Psychology, University of Applied Sciences and Arts Northwestern Switzerland, 4600 Olten, Switzerland; yanik.sterchi@fhnw.ch (Y.S.); adrian.schwaninger@fhnw.ch (A.S.) * Correspondence: robin.rizaporta@fhnw.ch Abstract: At airports, security officers (screeners) inspect X-ray images of passenger baggage in order to prevent threat items (bombs, guns, knives, etc.) from being brought onto an aircraft. Because threat items rarely occur, many airports use a threat-image-projection (TIP) system, which projects pre-recorded X-ray images of threat items onto some of the X-ray baggage images in order to improve the threat detection of screeners. TIP is regulatorily mandated in many countries and is also used to identify officers with insufficient threat-detection performance. However, TIP images sometimes look unrealistic because of artifacts and unrealistic scenarios, which could reduce the efficacy of TIP. Screeners rated a representative sample of TIP images regarding artifacts identified in a pre-study. We also evaluated whether specific image characteristics affect the occurrence rate of artifacts. 24% of the TIP images were rated to display artifacts and 26% to depict unrealistic scenarios, with 34% showing at least one of the two. With two-thirds of the TIP images having been perceived as realistic, we argue that TIP still serves its purpose, but artifacts and unrealistic scenarios should be reduced. Recommendations on how to improve the efficacy of TIP by considering image characteristics are provided.   Keywords: aviation security; X-ray imaging; human–machine interaction; X-ray baggage screening; Citation: Riz à Porta, R.; Sterchi, Y.; threat image projection; visual search Schwaninger, A. How Realistic Is Threat Image Projection for X-ray Baggage Screening? Sensors 2022, 22, 2220. https://doi.org/ 10.3390/s22062220 1. Introduction As an integral part of aviation security, passenger baggage and other consignments Academic Editor: Petros Daras are screened using X-ray machines at airport-security checkpoints in order to prevent Received: 9 December 2021 threat items (bombs, guns, knives, etc.) from being brought onto an aircraft [1]. The Accepted: 3 March 2022 X-ray images are inspected by airport-security officers (screeners), which involves a visual Published: 13 March 2022 search and decision making [2]. This task is challenging for various reasons [3–5], one Publisher’s Note: MDPI stays neutral of which being that the low prevalence of threat items in X-ray images results in lower with regard to jurisdictional claims in detection due to a shift in response tendency [6–8]. Threat image projection (TIP) is published maps and institutional affil- used at many airports worldwide to increase target prevalence by projecting pre-recorded iations. images of threat items, also called fictional-threat images (FTIs), onto X-ray images of the baggage and other consignments being screened [9–11]. Moreover, with TIP, screeners receive frequent detection-performance feedback, which is otherwise missing because real threats rarely occur in practice. Performance feedback is an integral factor of job Copyright: © 2022 by the authors. motivation [12,13]. The positive effect of feedback on motivation has also been shown to Licensee MDPI, Basel, Switzerland. further translate into better cognitive performance [14,15], suggesting that TIP could also This article is an open access article increase detection performance by frequently providing feedback. TIP is also used as an distributed under the terms and operational-performance measure [9–11]. Screener responses are recorded and summarized conditions of the Creative Commons by calculating the proportion of detected TIP images (the hit rate). This hit rate is used Attribution (CC BY) license (https:// to identify screeners with insufficient performance and trigger corrective actions. For creativecommons.org/licenses/by/ example, at many airports, based on EU regulation [16], screeners who miss a minimum 4.0/). Sensors 2022, 22, 2220. https://doi.org/10.3390/s22062220 https://www.mdpi.com/journal/sensors Sensors 2022, 22, 2220 2 of 15 TIP hit rate must undergo remedial training and only resume screening after passing an X-ray-image-interpretation test. TIP is employed under the premise that TIP images look realistic. Therefore, they should not look different from the images of baggage containing real prohibited articles. However, based on interviews and observations in an ethnography study, Bassetti [17] reported that this is not always the case and that screeners recognize some TIP images because they look artificial. Depending on how often this occurs, it could constitute a serious problem as many airports worldwide use TIP scores as operational-performance measures. Moreover, TIP artifacts could also result in a security issue if screeners focus on detecting TIP artifacts instead of visually searching for prohibited articles in X-ray images of passenger baggage. We, therefore, conducted two studies to determine the prevalence of TIP artifacts. In a pre-study, we interviewed screeners to ascertain which artifacts occur in TIP images. In the main study, screeners rated a representative sample of TIP images regarding artifacts identified in the pre-study. In the main study, we also evaluated whether specific image characteristics affect the occurrence rate of artifacts. 2. Pre-Study 2.1. Method 2.1.1. Participants Nine screeners (four females and five males) participated in the pre-study. Their age was between 32 and 61 years (mean: M = 46.56, standard deviation: SD = 10.03), and they had 3 to 22 years of work experience (M = 9.00, SD = 5.68). Our studies (the pre- and main study) complied with the American Psychological Association Code of Ethics. They were approved by the institutional review board of the University of Applied Sciences and Arts Northwestern Switzerland. All pre- and main-study participants were qualified, trained, and certified according to the standards of the appropriate national authority (civil aviation administration) in compliance with the relevant EU regulation [16]. All participants provided informed consent in writing and were compensated according to their hourly rate of remuneration. 2.1.2. Procedure Participants were individually invited to participate in semi-structured interviews with open-ended and close-ended questions. We first briefly explained what TIP artifacts are and asked the screeners to freely describe which artifacts they encountered during work. Next, we went through a list of potential artifacts. This list was created through discussions with screening experts before the interviews by considering how TIP artifacts could emerge when considering FTI TIP algorithms. Each artifact was explained to the participants and we asked whether they had noticed it during work. We started with three artifacts that might emerge when the FTI TIP system selects an FTI and then positions it in an unrealistic location or orientation in the X-ray image of the baggage (Figure 1b): • Placement artifacts: The FTI is positioned such that it appears to penetrate an item in the baggage (e.g., the heel of a shoe). • Alignment artifacts: The FTI is oriented such that it is poorly aligned with the content of the baggage. For example, the FTI is oriented at a 45◦ angle, while all other items are neatly packed and horizontally oriented. In some cases, the FTI can be oriented such that it appears to float (e.g., when an FTI bomb is oriented at a 45◦ angle away from a book or laptop lying flat). • Distortion artifacts: This type of artifact refers to unrealistic distortion. It should be noted that X-ray images generally display a distorted image of the recorded items. The distortion depends on the location of an item in relation to the X-ray-beam source. The distortion of the FTI can appear unrealistic if it does not appropriately reflect the location in relation to the X-ray source. Sensors 2022, 22, x FOR PEER REVIEW 3 of 15 Participants were first asked whether the FTI was sometimes in an unrealistic posi- tion. They were then asked to specify why the position was unrealistic and whether it was because of placement or alignment artifacts. Later, they were asked about distortion arti- facts. Afterward, participants were questioned about artifacts that might emerge after the FTI had been selected and positioned. The FTI TIP system merges the FTI with the X-ray image of the baggage. For this step, the TIP system must ensure that the merged FTI has a realistic color and luminosity, which requires considering the material and density in- formation of the projected item and the overlapping items of the X-ray image. Addition- ally, the TIP system must ensure that the FTI and the X-ray baggage image are identically processed (e.g., the identical edge enhancement is applied). Participants were asked about six artifacts that could result from the improper merging of the FTI with the baggage im- age (see Figure 1c):  Color artifacts: The color of the FTI is unrealistic. For example, the color of an FTI gun looks different from the color of actual X-rays of guns.  Size artifacts: The size of the FTI is unrealistic. The FTI is too small or too large.  Resolution artifacts: The image resolution of the FTI differs from the image resolution of the other items in the TIP image.  Edges artifacts: The edges of the FTI differ from the edges of the other items in the Sensors 2022, 22, 2220 TIP image. 3 of 15  Halo artifacts: There is a lightened area surrounding the FTI like a halo. Figure 1. (a) Pre-recorded fictional-threat image. (b) Artifacts originating from an improper position- Figure 1. (a) Pre-recorded fictional-threat image. (b) Artifacts originating from an improper posi- tiionnginogf tohf ethFeT FI.T(Ic. )(cA) rAtirftaifcatsctos roirgiigniantaintigngfr forommi mimpproroppeerrm meerrggininggo off tthhee FFTTII wwiitthh tthhee XX--rraayy bbaaggggaaggee imimaaggee. .NNootete: :ththe eimimagagese swwitihth pplalcaecmemenent tanandd alailgignnmmenent taratritfiafactcst saarere rereaal limimaaggees sfrforomm ththee TTIPIP sysystsetemm ininvveestsitgigaatetded inin oouur rstsutuddyy. .InIn ththeeses eimimaaggese,s ,ththe epprorojejectcetded bboommbbs smmayay bbe eddififfificucultl ttoto idideenntitfiyfy toto ththee uunntrtarainineedd eeyyee. .TThheererefoforere, ,wwee hhigighhlilgighht tththeemm wwitihth reredd frfarammeess inin ththee fifigguurere. .TThhee imimaaggees swwitihth aartritfiafactcst s inin FFigiguurere 11cc wweerere ccrreeaatetedd uussiningg GGIMIMPP ffoorr ilillulussttrraattioionn ppuurrppoosseess. . FPinaartlilcyi,p paanrttsicwipearentfisr wstearsek aeldsow ahsketehde rwthethFTerI twhaesres owmereeti amneys iinndaicnautonrrse oalfi sTtIiPc pimosaigtieosn . oTuhtseiydew tehree itmhaengea istkseldf, tsoucshp eacsi fcyomwphuyttehr enpoiosseist iorn aw samsaulln lraega lfirsotimc athned awddhiettihoenrali tpwroa-s cbesescianugs.e of placement or alignment artifacts. Later, they were asked about distortion artifacts. Afterward, participants were questioned about artifacts that might emerge after 2t.h1.e3.F RTeIshualtds been selected and positioned. The FTI TIP system merges the FTI with the X-raWy himilea gfreeoeflyt hdeesbcargibgiangge .arFtoifracthtsi,s ssctreepe,ntehres TdiIdP nsyost tmemenmtiounst aennys uarretiftahcatt otuhtesimdee rtgheed liFsTt Iofh paostaenrteiaall iasrtitcifaccotlso (raratnifdacltusm ini nFoigsiutyre, w1)h. Ficihver eoquut iorfe snicnoen psiadretirciinpganthtse rempaotretreida lthaantd TdIPen ssoitmyeitnimfoerms iamtipornoopferthlye pporsoijteicotnesd tihtee mFTaIn, dcatuhseinogv eprllaacpepminegnti taenmds aolifgtnhme eXn-tr aayrtiimfaactgse . (oAnded pitaiortnicailplya,ntth we ThiIlPe sfyresetelym emlabuosrtaetninsgu raentdh afot uthr epFaTrtIicainpdantthse wXh-reany abskagedg asgpeecimifiacgaellya)r.e identically processed (e.g., the identical edge enhancement is applied). Participants were asked about six artifacts that could result from the improper merging of the FTI with the baggage image (see Figure 1c): • Color artifacts: The color of the FTI is unrealistic. For example, the color of an FTI gun looks different from the color of actual X-rays of guns. • Size artifacts: The size of the FTI is unrealistic. The FTI is too small or too large. • Resolution artifacts: The image resolution of the FTI differs from the image resolution of the other items in the TIP image. • Edges artifacts: The edges of the FTI differ from the edges of the other items in the TIP image. • Halo artifacts: There is a lightened area surrounding the FTI like a halo. Finally, participants were also asked whether there were any indicators of TIP images outside the image itself, such as computer noises or a small lag from the additional processing. 2.1.3. Results While freely describing artifacts, screeners did not mention any artifact outside the list of potential artifacts (artifacts in Figure 1). Five out of nine participants reported that TIP sometimes improperly positions the FTI, causing placement and alignment artifacts (one participant while freely elaborating and four participants when asked specifically). Few participants reported other artifacts. Color artifacts were mentioned by one participant while freely elaborating and by one participant when specifically asked. Resolution artifacts were reported by one participant when specifically asked. The FTI size being sometimes too small was mentioned by two participants when feely elaborating, and the FTI size Sensors 2022, 22, x FOR PEER REVIEW 4 of 15 Few participants reported other artifacts. Color artifacts were mentioned by one partici- pant while freely elaborating and by one participant when specifically asked. Resolution Sensors 2022, 22, 2220 artifacts were reported by one participant when specifically asked. The FTI size bei4nogf 15 sometimes too small was mentioned by two participants when feely elaborating, and the FTI size being sometimes too large was only mentioned by one participant. Issues with edbgeisn wgesroem meetinmtieosnetodo blya rognee wpasrtiocniplyanmt ewnhtieonn sepdebciyficoanlelyp aasrkteicdi.p Tanhte. hIaslsou easrtiwfaictht wedasg es newveerr erempoenrtteidon. eInd abdydointieonp atrot iacriptiafancttws, haennotshpeerc iffiocrmall yofa suknerdea. lTishteich TaIlPo aimrtiafgaecst wass nrev- er porretepdo:r tfeodu.r Isncraedendeitriso nmteontaiortnifeadc ttsh,aatn eovtehne rwfohremn noof uanrtriefaaclitsst iwc eTrIeP pirmesaegnets, wTIaPs irmepagoertse d: frefqouernstclrye elonoekrsedm uennrtieoanliesdticth baetceavuesne wthheeyn snhoowaretdif aacnts uwnreerealpisrteics esncte,nTaIrPioi m(saegee Fsifgrueqreu e2n).t ly Thleo omkoesdt uconmremaloisnt iccabseec raeupsoertthedey bsyh poawretidcipananutns rweaalsi sat icthsrceeant airteiom( speleacFeidg uwrehe2r)e. Tnhoe temr-ost rorciostm wmoounldc ahsiedere ipt.o Froterd exbaymppalreti, cai pgaunnt sisw palsacaetdh roena ttoitpe mof pa lsamceadllw phuersree,n wohteerrero irt iwstowuoldu ld be heiadseilyit .dFetoercteexda.m Inptleer,eastginugnlyi,s spcrlaeceenderosn altsoop roepf oarstemda tlhl aptu urnser,eawlihsetirce TitIPw iomualgdebs e(deuaesi ly to darettiefcatcetds .oIrn utenrreesatilnisgtilcy ,ssccerneaernieorss) aolcscourre mpoorrtee dofttheant wunhreena ltihseti cFTTII Pisi pmraogjeecste(ddu oenttoo alortoifsaec ts iteomrsu innr eaa tlrisatyic (sec.ge.n, atrraioyss) wocicthu ram shooree,o jfatceknewt, hweanlltehte, oFrT Ilaips tporpo)j eccotmedpoanretdo ltooo wsehietenm its iisn a protrjaecyte(ed.g o.,nttroa ya sbwagit ohra osthhoere ,pjaiecckee to,fw baalglegta,goer (lbapagto, psu) ictocmaspea, roerd rutockwshacekn)i. tWisitphr orjeegcaterdd oton to thea qbuaegsotiroont hoef rwphieectheeorf tbhaegreg awgeer(eb aang,ys iunidtciacaseto, rosr oruf cTkIsPa icmk)a.gWesi tohurtesgidared thtoe tihmeaqguee iststieolnf, of suwchh eatsh ceormthpeuretewr enroeiasnesy oinrd ai csamtoarlsl olfagT IfProimma tghees oaudtdsiitdioenthale pimroacgeessitisnegl,f ,aslul cphaartsicciopmanptust er repnooritseeds othraat stmhias lwl laasg nfrootm thteh ceaasde.d i tional processing, all participants reported that this was not the case. FigFuirgeu r2e. E2.xaEmxapmlep TleIPT IimP aimgeasg oesf uofnruenarleisatliics tsiccesncaernioasr io(as,b(a) ,ban) da nad TaIPT IiPmiamgea gdeepdiecptiincgti nag raearleisatliics tic scesncaerniaor (ico).( cT)h. eT hperopjeroctjeecdt egdungu (an) (aan) dan tdhet hperopjreocjteecdt ebdombobms b(bs ,(cb) ,acr)ea rheighhiglihglhigtehdte wdiwthi trhedre fdrafmraems.e s. 2.1.4. Discussion 2.1.4. Discussion Most screeners reported placement and alignment artifacts. Color, size, resolution, disMtoorstito snc,raenendeerds greepaortritfeadct splwaceeremoennlty arnedp oarltiegdnmbyenatf eawrt.ifDaciftfse. rCenotlopro, sssiizbel,e reexspollauntiaotnio, ns disetxoirsttioans,t aonwd heydgsoe maretiafarctitfsa wctserwe eornelyo nrelyporertpeodr tbeyd ab fyewa .f eDwiffsecrreenetn peross.sTibhlee yexmpilgahntatbieonrsa re, exiasnt dasn toot walhlyp asortmiceip aarntitfsacmtsi gwhetrhea ovnelye nrceopuonrtteedre bdyt ah efemw. Mscroesetnleikrse.l yT,hsecyre menigerhst ables oradrief,f er anidn nthote iarlla pssaerstsicmipeannttos fmwighhatt hisarveea leinstciocu. nFtoerreedx atmhepmle. ,Msoomste lipkaerlyti,c sipcraenetnsemrse anltsioo ndeidffetrh at in FthTeIsir caosusledssbmeevnet royf lawrhgaet, wis hriecahlicstainc. sFeoerm exstarmanpglee,. sHomowe epvaerrt,ictihpeasnetsit emmesntwioenreedl atrhgaet in FTrIes acloituyld(e .bge., vaemrya lcahrignee, gwuhni)c,hs ocatnh eseimema gsetrwanagser.e Haloiswtiecvdeers, pthiteesese ietmeminsg woedrde .laWrghee nina n reaolbitjyec (tea.gp.p, eaa rms aocdhdinree ggaurdni)n, gsoit sthceo liomr,argeeso wluatsio rne,asliizseti,co rdeedspgietse, sitememigihntga opdpdea. rWarhteifinc aianl to obsjeocmt aepspcreeaerns eords.dO rtehgearrsdjiundgg ietsi tctoolobre, roedsdolluotoioknin, gsiyzeet, orera eldisgtiecs.,S ict rmeeingehrts aaplpsoearre paortritfeicdiathl at to TsoIPmiem sacgreeesncearns. lOoothkeursn jruedalgiset iict tboe bcaeu osdedt hloeoykdinegp iycteta rneaulnisrteiac.l iSsctirceesnceenrsa railos,oe rveepnoriftetdh ey thaatr eTIfPre eimoafgaerst ifcaacnt slo(soeke uFnigreuarleis2ticfo breicllauusstera tthioeny sd).eSpuiccth aun nurneareliasltiisctiscc escneanraiorsiom, eigvhent, ilfi ke theayr taifraec tfsr,eere odfu acretitfhaectds e(gsereee Ftioguwrhe i2c hfoTrI Pillausstarapteiorfnosr)m. Sauncche umneraesaulirsetirce sflceecntsartihoes dmeitgehctti,o n likoef arretaiflaiscttisc, rthedreuactes t(hi.ee .d, eregdreuec etot hwehvicahli dTiItPy aosf aT pIPerafsorampaenrfcoer meaanscuerem reeafsleucrtes) tahned deshteocu- ld tiotnh eorfe froeraelisatlisco thbereaadtsd (rie.ess.,e rde.dMucoer ethoev evra, lwidhiteyt hoefr TuInPr eaasl ias tpicesrfcoernmarainocseo mcceuarsumroe)r eaonfdte n showuhlden thaenreFfToIreis aplsro jebcet eaddornetsoselodo. sMeoitreemovseirn, wa threatyhe(er. gu.n, rteraylisstwici tshceanashrioes, ojaccckuert ,mwoarell et, oftoern lwaphteonp )acno FmTpI airse pdrtoojewctheden otnhteo FloToIsies ipteromjesc itne da otrnatyo (ae.bga.,g toraryost hweirthp iae csehsooe,f jbaackgegta, ge wamlletr,i tosrf ulartphteorpi)n cvoemstipgaarteiodn t.oR welahteend tthoet hFiTs Ii sisu periosjwechtedth oernttoh eap bearcge potri onthoefra prtiiefcaecsts oafn d bagugnaregael imsteicristcse fnuarrtihoesrd ienpvenstdigsaotnioinm. aRgeelactheadr atoct ethr ist iicssuthea its hwavhetbheeern thsheo pwenrcteopitnioflnu eonf ce arttihfarcetast adnedte uctniorenaliinstXic- rsacye-nimaraiogse dinepspenecdtsio onn. iSmchawgea cnhinagraecrtetriaslt.ic[1s 8th] aidt ehnatvifie ebdeethnr seheoiwmna ge to cinhfalruaecntecrei stthicrse,awt dheictehcthioeny icno iXn-erdayim-imagaeg-be aisnesdpefacctitonrs. S(IcBhFws)aannindgaerre eitl lauls. t[r1a8te] didiennFtifgiuedre 3. thrTehea itmisa,gae tchhraeraatcittermistcicasn, bwehmichor tehoery lceossindeidffi icmualtgteo-bdaesteedc tfadcetpoersn d(IiBnFgs)o nanitds aoreie inlltuasti-on tra(teefdfe icnt oFfigvuireew 3d. Tifhfiactu ilsty, )a, threesaut pitermp ocsainti obne omfoorteh oer ilteesms ds i(fefifcfeuclt toof dsuepteecrtp doespiteionnd)i,nagn d the visual complexity of the bag (effect of bag complexity), which consists of clutter, the bag’s background unsteadiness, opacity, and the relative size of opaque areas in the bag [19]. These IBFs could have an impact on the perceived occurrence of artifacts and unrealistic scenarios. Sensors 2022, 22, x FOR PEER REVIEW 5 of 15 on its orientation (effect of view difficulty), the superposition of other items (effect of su- perposition), and the visual complexity of the bag (effect of bag complexity), which con- sists of clutter, the bag’s background unsteadiness, opacity, and the relative size of opaque Sensors 2022, 22, 2220 areas in the bag [19]. These IBFs could have an impact on the perce5iovfe1d5 occurrence of artifacts and unrealistic scenarios. Figure 3. Illustrations of three image-based factors (IBFs). Adapted from Schwaninger et al. [18]. Figure 3. Illustrations of three image-based factors (IBFs). Adapted from Schwaninger et al. [18]. 3. Main Study The prima3r. yMaaimin oSftuthdeym ain study was to determine the prevalence of the artifacts and unrealistic scenaTrhioes pidriemnatirfiye daimin tohfe tphere m-stauind ys.tuFudryt hwera,sw toe wdeatnetremditnoea tnhael ypzreevwahlenthcer of the artifacts artifacts and unarneda liustnicresacleisntaicr iossceoncacruirosle sids eonftteifniewdh ienn the FpTrIe-istpurdoyje. cFteudrtohnetro, awpei ewceaonfted to analyze baggage versuswohnettohleoro asretifteamcts ai nda turnayr,eanlidstiwc hsecethnearritohse opcecrucer ilveesds ofctceunr rwenhceeno tfhaer FtiTfaIc itss projected onto and unrealisticas pceiencaer ioofs biasgdgeapgeen vdeernsut os nonIBtoF lso. ose items in a tray, and whether the perceived occurrence of artifacts and unrealistic scenarios is dependent on IBFs. 3.1. Method 3.1.1. Participa3n.t1s. Method A total of35.11.1p. rPoafretsicsiiponanaltsc abin-baggage screeners from a European international airport (29 femalesAa ntodta2l 2ofm 51a lperso) fpesasritoicniapla ctaebdinin-btahgegamgae isncrseteundeyr.s fNroomne a oEfutrhoepmeanh aindternational air- participated in the pre-study. Their work experience ranged from 2 to 31 years (M = 8.67, port (29 females and 22 males) participated in the main study. None of them had partici- SD = 4.91). The participants’ ages ranged from 25 to 63 years (M = 44.67, SD = 11.43). pated in the pre-study. Their work experience ranged from 2 to 31 years (M = 8.67, SD = 3.1.2. Materials4.91). The participants’ ages ranged from 25 to 63 years (M = 44.67, SD = 11.43). We randomly sampled 600 TIP images (X-ray images of passenger bags with an FTI 3.1.2. Materials projected onto them) from the automated storage of 14 conventional single-view X-ray machines from the aWirep orarnt dwohmerley tshaemspcrleeden 6e0r0s TwIoPr kimeda.gTesh e(Xc-ornayte inmt aogf eths eofc apbaisnsebnaggegra bgaegs with an FTI varied over thepyreoajer.cTtehde roenfotore t,hweemr)a nfrdoomm ltyhes ealeucttoemdahtaeldf (s3t0o0raimgea goefs 1) 4o fctohnevXe-nrtaiyonimala sgiensgle-view X-ray in winter, andmthaechoitnheesr fhraolmf (t3h0e0 aiimrpaogrets w) ihnerseu mthme secrr.eTenheerTs IwPohrikterda.t eThise tcyopnitceanllty ohf itghhe ;cabin baggage previous studievsahriaevde orevpeor rttheed yveaalur.e Tshoefraebfooruet, 9w0%e r[a9n,1d1o]manlyd saeplepcrtoexdim haatlef l(y30800 %im[a10g]e.sW) oef the X-ray im- found an averaaggeesT iInP whiitnrtaetre, aonfd8 8t%hea ottthheer ahiraplfo (r3t0f0ro imawghesic) hinw seusmameprl.e dThoeu Tr IiPm ahgite rsa. te is typically A high TIP hithriagthe; cparnevreiosuuslt sitnudloiews shtaavties triecpalorptoewd evralwuehse nofc aobmopuut t9i0n%g c[o9,r1r1e]la atniodn as.ppToroximately 80% address this is[s1u0e],. mWies sfeodunTdIP anim aavgeerasgwe eTrIePo hviet rrsaatme opfl e8d8.%A att othtael aoifrp7o5r%t forfomth ewihmicahg ewse sampled our were sampled firmomaghesit.s Aa nhdig2h5 %TI(Pc ohmitp raarteed ctaon1 r0e%su) lftr oinm lomwis ssetas,titsottiaclailn pgo4w50erd ewtehcetned coanmdputing correla- 150 missed TIPtiiomnasg. eTsot hadatdwreesrse tuhsise dissinueth, emmisaseind sTtuIPd yim. ages were oversampled. A total of 75% of the images were sampled from hits and 25% (compared to 10%) from misses, totaling 450 3.1.3. Procedurdeetected and 150 missed TIP images that were used in the main study. Participants attended two sessions that were two weeks apart with five to six screeners per session. Th3e.1fi.r3s.t Psreosscieodnubree gan with an hour-long introduction with visual illustrations of the different artiPfacrttsiciipnaonrtds earttteondimedp atwrtot hsesusinodnes rtlhyaint gwecroen tcwepot sw. eTekhsis apacatritv wityithw faivse to six screen- followed by aneerxsp plaenr asteisosnioonf.h Tohwe tfoirusts esetshseiorant ibneggtaono lwainthd asinx hporuacrt-ilcoentgr iianlstr.oPdaurtcitciiopna nwtsith visual illus- then rated TIPtirmataiogness ofof rthtwe doihffoeurerns.t Tarhteifyacctosn itni nourdederr taoti inmgptharet itmhea guensdfeorrlyainogt hcoenrctewpots. This activity hours during thweasse fcoolnlodwseedss bioyn a. nD uexripnlganbaottihons eosfs ihoonws, ptoa rutsicei pthaen trsartaintegd toaoslm aanndy siimx pagraecstice trials. Par- as they could taincidpawnetsr ethneont riantceedn tTivIPiz iemdatgoesr ufoshr .twToh ehyouwres.r eThinesyt rcuocntteidnutoedt arkateinbgr etahkes images for an- when they feltoftahteigr utewdo. hXo-ruarys idmuarignegs wtheer esedcoisnpdla syeesdsioonn. 2D4uinricnhg Sbaomths usnegssSio2n4sE, 6p5a0rBtiWcipants rated as monitors undemr annoyr mimalagliegsh atisn gthceoyn cdoiutilodn asnadt awderiest annoct einocfenaptipvrizoexdim toa treulysh6.0 Tchmey. wTheere instructed to images coveredtaakpep brorexaimksa twelhyetnw toh-etyh irfedlst ofafttihgeuescdr.e Xen-rcaoyr riemsapgoensd winegrteo daisvpislauyaelda nognle 2o4f inch Samsung about 28 × 30 dSe2g4rEe6e5s0.BTWhe mpaorntiictoiprsa nutnsdweer rneorramndaol mliglyhtsipngli tcionntodiftoiounrsg arot uap dsisatsaintcwe aosf naoptproximately 60 practically feasible for each screener to rate all 600 images. Each group rated a different set of 150 images, with the order of the images randomly sampled for each screener. After having rated all images of their group, the participants proceeded to rate the images of other groups if the sessions were not yet over. Ultimately, each of the 600 TIP images was rated by at least 12 participants. Sensors 2022, 22, 2220 6 of 15 3.1.4. Measures To collect the ratings from the participants, we developed an app using R-shiny [20] that consecutively displayed the TIP images. Participants could press a button to fade-in a red frame around the FTI if they were uncertain as to which threat item was projected. We used a seven-point scale for all ratings. The rating questions and anchors are shown in Table 1. For the artifacts that were mentioned by only a few screeners in the pre-study (distortion, color, resolution, size, and edges artifacts), the screeners were instructed to rate these artifacts if they thought they were present in an X-ray image. The absence of a rating was coded as 1. The ratings for all other image characteristics were mandatory. The screeners also had to provide other ratings that were not relevant to our study and reported in a conference proceeding [21]. Table 1. Image characteristics, rating questions, and anchors. Rating questions and anchors were pre- sented to participants in German and have been translated into English for the readers’ convenience. Characteristics Ratings Anchors Artifact Artificial in general The X-ray image looks unrealistic because Totally disagree (1)the FTI looks artificial. Totally agree (7) Placement The location of the FTI is unlikely. “” Alignment The alignment of the FTI seems artificial. “” Other reasons why the FTI seems artificial . . . Artifact Color . . . due to unrealistic color “” Size . . . due to unrealistic size “” Resolution . . . due to unrealistic resolution “” Distortion . . . due to unrealistic distortion “” Edges . . . due to unrealistic edges “” Unrealistic Scenario The TIP image looks unrealistic because thescenario is unrealistic. “” IBF FTI view difficulty Difficulty to recognize the threat item in Very easy (1)the depicted orientation Very difficult (7) Superposition Superposition of the FTI by other items Very low (1)Very high (7) Bag complexity clutter Clutter in the baggage Very low (1)Very high (7) Bag complexity opacity Proportion of the image that is opaque Very small (1)Very large (7) The FTI view difficulty is defined as the difficulty to recognize the threat item in the depicted orientation independently of the X-ray baggage image [18]. However, the study participants had to rate the FTI view difficulty based on the TIP image with the FTI often superimposed by other items, which might have distorted the rating. Inspired by Schwaninger et al. [22], we therefore calculated an additional measure of FTI view difficulty: the rate at which the FTI was missed across all TIP images displayed at the airport for a year (Equation (1)). The FTI view difficulty was calculated based on the airport’s TIP reports and from at least 29 TIP events per FTI (M = 375.53, SD = 194.27). N FTIvd = Misses (1) NProjections Equation (1). FTI view difficulty (FTIvd) equals the number of TIP events the FTI was missed (NMisses) divided by the number of TIP events the FTI was projected (NProjections). To determine whether the FTI was projected onto a piece of baggage or onto loose items in a tray, screening experts who did not participate in the pre- or main study inspected and categorized the TIP images. Sensors 2022, 22, 2220 7 of 15 3.1.5. Analyses For each X-ray image, the mean ratings across screeners were calculated and then rounded to the nearest integer. In the next section regarding results, the relative frequencies of these averaged ratings are reported. The ratings were corrected for the oversampling of missed TIP images (see Section 3.1.2). Finally, to measure how strongly the screeners agreed with each other, we calculated the inter-rater reliability using intraclass correlation coefficients (ICCs, [23]). 3.2. Results The ICCs indicated good to excellent [24] inter-rater reliabilities for the following image characteristics: artificial in general (0.72), placement artifacts (0.85), alignment artifacts (0.75), unrealistic scenario (0.84), FTI view difficulty (0.89), superposition (0.93), clutter (0.94), and opacity (0.89). Most images were not rated to show any artifacts regarding the FTI’s appearance (size, resolution, color, and edges) or any distortion artifacts. In the rare cases when a screener rated an image to show one of these artifacts, they were largely alone in their rating. Unsurprisingly, the ICCs indicated low [24] inter-rater reliability for these artifacts: color (0.26), size (0.23), resolution (0.27), distortion (0.04), and edges (0.12). Figure 4 shows how the TIP images were rated regarding artifacts and other image characteristics. In the figure, each bar is divided into seven colored sub-bars displaying the share of TIP images that received the respective rating. A total of 17% of the images received a rating indicating that placement artifacts were present (ratings of five or higher, Figure 4a), whereas 83% were rated neutral (rating four) or more toward the absence of a placement artifact (ratings one to three); 15% of the images received a rating indicating that alignment artifacts were present (Figure 4a). No image was rated to contain the artifacts that were only mentioned by a few screeners in the pre-study (color, size, resolution, distortion, and contour artifacts) with an average rating of five or above (Figure 4b). These artifacts were therefore excluded from further analyses. The screeners reported that 26% of the TIP images depicted an unrealistic scenario (ratings of five or higher, Figure 4c) and that 14% looked artificial in general. Figure 4d shows how the TIP images were rated regarding IBFs. Of all the TIP images, 42% were X-ray images with FTIs projected onto a piece of baggage (bag, suitcase, or backpack), and 58% were X-ray images with FTIs projected onto loose items in a tray. The two types of images were rated differently, as can be seen in Figure 5. In Table 2, the prevalence of different combinations of artifacts and unrealistic scenarios is reported. While 65% of all the TIP images with FTIs projected onto loose items in a tray showed at least either an artifact or an unrealistic scenario, this was only the case for 8% of all the TIP images with FTIs projected onto a piece of baggage. For the TIP images with FTIs projected onto a piece of baggage, Table 3 depicts how IBFs correlate with placement and alignment artifacts and unrealistic scenarios. As can be seen, placement and alignment artifacts as well as unrealistic scenarios occur less frequently for higher values of IBFs. Sensors 2022, 22, x FOR PEER REVIEW 8 of 15 For the TIP images with FTIs projected onto a piece of baggage, Table 3 depicts how IBFs correlate with placement and alignment artifacts and unrealistic scenarios. As can be seen, placement and alignment artifacts as well as unrealistic scenarios occur less fre- Sensors 2022, 22, 2220 8 of 15 quently for higher values of IBFs. Figure 4. Results of the main study displayed as stacked bar charts. Each bar is divided into seven Figure 4. Results of the main study displayed as stacked bar charts. Each bar is divided into seven sub-bars displaying the share of TIP images that received the respective rating (averaged across sub-bars displayingp tahrteic isphanatrs)e. Roefg aTrdIPin gimtheapgreesse ntcheaotf arreticfaecitvseadnd tuhnere arleisstpicesccetnivareio sra(at–icn),gp a(rativciepraangtsesdel eactcerdoss participants). Regardraitninggs t1h–3e topirnedsiceantecteh aot fth eayrtdiifsaagcrtese datnhdat aurtnifraectaslwisetriecp srecseennta, r5–io7sto (iand–icc)a,t epthaartttihceiypaagnretesd se- lected ratings 1–3 to tihnadt aicrtaiftaec ttshwaetr ethpereys ednits, angdre4etodi nthdiacta taertthiaftatchtesy werreeu pndrecsiednedt,. R5e–g7a rtdoi ningdIBiFcsa(tde) ,trhaatitn gths ey agreed that artifacts1 w–7ererper epsreentsleonwt,t oahnigdh 4d etgor eiensdoficFaTtIev itehwadt iftfihceuylty w, suepreer puonsidtioenc,idaneddb.a Rg ecogmaprldexinityg inIBteFrsm s(d), ratings 1–7 represenot flcoluwtt etroa nhdigohpa dcietyg. rPeerecse notaf gFeTnIu mvibeewrs bdeliofwfic5u%ltayre, nsoutpdeisrpplaoyseidt.ion, and bag complexity in terms of clutter and opacity. Percentage numbers below 5% are not displayed. Sensors 2022, 22, x FOR PEER REVIEW 9 of 15 Sensors 2022, 22, 2220 9 of 15 FigureF i5g.u Rree5s.uRltess uoltfs tohfeth me maianin ssttudyyf ofrotrh tehTeI PTiImPa gimesawgitehs thweiFthTI tihnea FpiTecIe ionf aba pggieacge aonfd bthaegTgIaPge and the TIP imiamgaegse swwitihth tthhe FTII onnl olooseseit eitmesmins iant raa yt,radyis,p dlaiysepdlasyepeadr asteeplya. rEaatcehlyb.a Er aiscdhi vbiadre disi dntiovsideveedn into seven sub-basrusb -dbiasrps ldaiyspinlagy itnhget hsehsahraer eoof fTTIIP iimaaggesesth tahtaret creeivceedivtehde rtehspee crteisvpe ercattiinvge (aravetirnagge d(aavcerorsasged across particippaarntictsip).a nRtes)g. aRredgianrdgi ntgheth pe rperseseennccee ooff aartritfiafcatcstasn danudn rueanlirsetiaclsiscetinca rsicoesn(aa–rci)o,sp a(rat–iccip),a pntasrutisceidpants used ratingsr a1ti–n3g sto1 –i3ndtoicinadtiec attheatth atht tehye yddisisaaggrreeeeddt hthataat ratirftaicftascwtse rwe perrees epnrte, 5s–e7ntto, 5in–d7i ctaote inthdaitctahteey tahgarete tdhey agreed that artthiafat carttsi fwacetsrwe eprerepsreesnetn, ta, anndd 44 ttooi ninddiciacteatteh atththaet ythweeyre wunedreec iudnedd.eRceigdaerddi.n RgeIBgFasr(ddi)n, rga tIiBngFsso (fd), ratings 1–7 represent low to high degree of FTI view difficulty, superposition, and bag complexity in terms of of 1–7 represent low to high degree of FTI view difficulty, superposition, and bag complexity in clutter and opacity. Percentage numbers below 5% are not displayed. terms of clutter and opacity. Percentage numbers below 5% are not displayed. Sensors 2022, 22, 2220 10 of 15 Table 2. Prevalence of artifacts, unrealistic scenarios, and combinations calculated both overall and separately for TIP images with FTIs projected onto a piece of baggage (bag, suitcase, or backpack) versus TIP images with FTIs projected onto loose items in a tray. Overall FTI in Piece of Baggage FTI on Loose Items in a Tray Placement Artifact 17% 3% 37% Alignment Artifact 15% 6% 25% Any Artifact 1 24% 7% 45% Unrealistic Scenario 26% 3% 56% Any Artifact or Unrealistic Scenario 34% 8% 65% 1 At least one artifact rated to be present. Table 3. Means (M), standard deviations (SD), Pearson correlations, and their 95% confidence intervals (in brackets) of ratings of placement and alignment artifacts, unrealistic scenario, and image-based factors for TIP images with the FTI in a piece of baggage. Variable M SD 1 2 3 4 5 6 1. Placement Artifact 2.13 0.88 2. Alignment Artifact 2.84 0.98 0.64 ** [0.57, 0.69] 3. Unrealistic Scenario 2.66 0.89 0.81 ** 0.69 ** [0.77, 0.84] [0.64, 0.74] 4. FTI View Difficulty (Rated) 2.98 1.28 −0.37 ** −0.48 ** −0.53 ** [−0.46, −0.28] [−0.56, −0.40] [−0.60, −0.45] 5. FTI View Difficulty (TIP-Reports) 0.10 0.11 −0.17 ** −0.29 ** −0.29 ** 0.48 ** [−0.27, −0.07] [−0.39, −0.20] [−0.39, −0.20] [0.39, 0.55] 6. Superposition (log) 0.78 0.47 −0.29 ** −0.32 ** −0.37 ** 0.69 ** 0.08 [−0.38, −0.19] [−0.41, −0.22] [−0.45, −0.27] [0.63, 0.74] [−0.03, 0.18] 7. Bag Complexity (log) 0.81 0.45 −0.23 ** −0.18 ** −0.29 ** 0.50 ** −0.01 0.73 ** [−0.33, −0.13] [−0.28, −0.08] [−0.38, −0.19] [0.42, 0.57] [−0.11, 0.10] [0.67, 0.77] ** p < 0.01. 3.3. Discussion No TIP image received an average rating indicating the presence of artifacts related to the FTI’s appearance (color, size, resolution, and edges) or distortion. If a participant rated an image to depict one of these artifacts, they mostly stood alone, confirming the finding of the pre-study that these artifacts are very rare or at least not perceived by most screeners. However, placement- and alignment-artifact ratings indicate that 17% of the TIP images contained placement artifacts, and almost as many images contained alignment artifacts (15%). A total of 24% were rated to contain at least one of the two artifacts and 26% of the TIP images were rated to depict an unrealistic scenario. Overall, 34% of the images were rated to depict an artifact or an unrealistic scenario. Fewer images were rated as artificial looking in general (14%) than as containing artifacts (24%), which at first might seem contradictory but could be due to a different focus. When screeners evaluated whether a TIP image looked artificial in general (which was asked first), the focus was likely broad, and no artifacts might have been apparent. When the screeners subsequently evaluated specific artifacts, the focus was narrowed, and the artifacts may have become evident. Our results suggest that artifacts and unrealistic scenarios occur less often in images for which the FTI is projected onto a bag or other pieces of baggage. Only 7% of these Sensors 2022, 22, 2220 11 of 15 images were rated to contain artifacts, only 3% to depict unrealistic scenarios, and only 8% were rated to contain artifacts or depict unrealistic scenarios. It is quite understandable that FTIs projected onto loose items were much more-often assessed as an unrealistic scenario, as it seems unlikely that terrorists would not try to hide a threat more effectively. Our findings also suggest that there are more degrees of freedom in merging an FTI with a piece of baggage without the placement or alignment looking unrealistic. TIP images with the FTI merged with a piece of baggage were rated to be more realistic when the FTI was superimposed by other items, when the baggage had high complexity, or when the view difficulty of the FTI was high. In an X-ray image, it is often impossible to see which objects are above or below each other. Images with higher complexity and superposition can therefore provide more degrees of freedom for realistic placement and alignment. Multiple possible explanations exist as to why FTIs with high view difficulty produce fewer artifacts. View difficulty is associated with the orientation of the FTI [18,25]. It also differs between different types of threats (bombs, guns, knives, etc.) [26–28]. Both of these aspects might, in turn, affect the prevalence of artifacts. The FTI view difficulty was evaluated based on two different approaches: based on ratings or on the miss rate across all the TIP images with the same FTI. While the FTI view difficulty is defined as the difficulty of the FTI independently of the baggage image, the rated FTI view difficulty was correlated with superposition. Thus, raters perceived FTI view difficulty to be higher when superposition was higher, possibly because this made the FTI more difficult to detect. Therefore, it is not optimal to estimate view difficulty based on the rating of individual TIP images. Instead, the mean FTI-view-difficulty rating across several baggage images with different degrees of superposition should be taken, as has been done in previous studies [19,27–29]. Alternatively, the miss rate across many TIP images can serve as an estimate if sufficient TIP data is available [19,27,28]. 4. General Discussion Screeners visually inspect X-ray images of passenger baggage for prohibited arti- cles, many of which rarely appear in reality. As rare targets are more challenging to detect, e.g., [6–8,30,31], threat image projection (TIP) could offer a solution by inserting pre- recorded images of threat items (fictional-threat images; FTIs) into randomly selected X-ray images of passenger baggage. As explained in the introduction, TIP has been associated with additional benefits, including the potential to increase motivation and performance by providing regular feedback, and measuring performance on the job. Bassetti [17] reported that screeners recognize some TIP images to be artificial, which could impair the efficacy of TIP. This study aimed to determine the prevalence of TIP-image artifacts. In the pre-study, we interviewed screeners about artifacts they encountered at work. The interviews also revealed that some TIP images are unrealistic because they display unrealistic scenarios. This refers to threat placements that terrorists would not use because they would be easy to find. In the main study, screeners rated a sample of 600 TIP images regarding TIP artifacts and unrealistic scenarios. Further, we evaluated whether certain image characteristics affect the occurrence of artifacts and unrealistic scenarios. 4.1. Prevalence of Artifacts and Unrealistic Scenarios in TIP Images A total 34% of TIP images were considered unrealistic because of artifacts or unrealistic scenarios, 24% were considered to contain an artifact, and 26% to display an unrealistic scenario. The main concerns regarding artifacts were unrealistic placement (17% of the TIP images) and unrealistic alignment of the FTI (15% of the TIP images). Other artifacts were rarely reported in the pre-study and were rarely rated to occur in the main study (unrealistic color, size, resolution, distortion, and edges of the FTI). Does the presence of artifacts and unrealistic scenarios impair the assumed benefits of TIP? Its most important application is to increase the frequency of rare threat items in order to enhance their detection [6–8]. This application relies on screeners detecting threat items rather than artifacts. Our results suggest that this requirement is still fulfilled for the majority of TIP images as 76% of the Sensors 2022, 22, 2220 12 of 15 images were rated to be free of artifacts. Another application of TIP is to use the TIP hit rate for continuous performance evaluation [9–11]. If screeners do not achieve a minimum hit rate, they must undergo additional computer-based training and testing before they can resume screening duties [16]. As images with artifacts or unrealistic scenarios should generally be easier to detect than actual threats, the TIP hit rate is likely to overestimate the hit rate for real threats in real scenarios. The validity of TIP for screener comparison is supported by the finding that the better the screeners perform in TIP, the better they perform in a certification test that evaluates their threat-detection ability [32]. TIP hit rates should therefore be suitable as a performance measure for ergonomic or human-factors studies, which depend on the comparison of performances between screeners rather than on absolute performance [9–11]. In summary, we derive from our results that TIP is still useful despite the presence of TIP artifacts or unrealistic scenarios in about one third of the TIP images. However, the number of unrealistic images should certainly be reduced in order to increase the efficacy of TIP. With fewer unrealistic images, screeners would also perceive TIP hit rates as more-valid feedback on their job performance, which would likely make TIP more motivating. This assumption is consistent with work and psychology models that emphasize the importance of feedback for motivation and performance [12,13]. 4.2. How to Reduce Artifacts and Unrealistic Scenarios Our results suggest that artifacts and unrealistic scenarios occur less often in TIP images where the FTI is projected onto pieces of baggage (bags, suitcases, and backpacks). Only 7% of these TIP images were rated to contain artifacts and only 3% to depict unrealistic scenarios. Furthermore, 92% neither contained artifacts nor depicted unrealistic scenarios. In this investigation, TIP projected approximately one half of the FTIs onto pieces of baggage and the other half onto loose items such as shoes, jackets, wallets, keys, and laptops. Hence, TIP can be effectively enhanced by projecting threat images more frequently onto pieces of baggage. Our results further suggest that TIP images are more realistic when the IBFs view difficulty, superposition, and complexity are medium to high. The TIP system would require an algorithm to distinguish X-ray images of pieces of baggage from X-ray images of trays with loose items in them, and to estimate the IBFs of the TIP images to achieve fewer unrealistic TIP images; the latter, for example, with the algorithm developed by Bolfing et al. [27]. However, TIP should still project some FTIs onto images with loose items in a tray and lower IBFs in order to incentivize screeners to direct their attention towards these images. Otherwise, real threats might be missed due to a lack of focus. Another solution is offered by a different approach to TIP, in which baggage images and FTIs are combined in advance to create combined threat images (CTIs, [9]). These CTIs can be quality controlled to ensure that they do not exhibit any artifacts and show realistic scenarios. However, because CTIs are based on pre-recorded X-ray images of baggage, they can only be used in environments where screeners cannot directly see that the screened baggage does not correspond to the one shown on the TIP image. 4.3. Limitations and Future Research This study analyzed a common FTI TIP system for a single-view X-ray machine that is used at many airports. However, other FTI TIP systems are also used, which may differ regarding artifacts. Furthermore, we used TIP images from one airport. Baggage images from other airports might differ. For example, other airports may have more or fewer pas- sengers with complex baggage. Future research should investigate artifacts and unrealistic scenarios in X-ray images from other TIP systems and airports. Newer X-ray machines show multiple views of the screened passenger baggage from different angles [33,34] or 3D-rotatable computed-tomography (CT) images [35]. This enhancement could provide further solutions for reducing artifacts in TIP. For example, based on the different views, a three-dimensional model of the baggage and its contents can be reconstructed [36–38] and used by the TIP system to find suitable positions and orientations for merging FTIs [39,40]. Sensors 2022, 22, 2220 13 of 15 Our study is further limited by having screeners rate whether TIP images displayed arti- facts. Therefore, we could only investigate artifacts that screeners could explicitly recognize. However, TIP images might also contain artifacts that screeners cannot consciously identify. TIP images could display artifacts that are not consciously perceived but subconsciously affect detection. Additionally, screeners only rarely see actual threat items (especially im- provised explosive devices) and might not know precisely what they look like in an X-ray image. Therefore, they might have to infer from regular items (e.g., laptops, food, jackets, and keys) to determine whether rare threat items look artificial. This method is likely to work well for most artifacts (e.g., when the FTI is unrealistically aligned or when the FTI has different edges). However, it may not work for color artifacts because the color of the FTI could be off but may look realistic when solely compared to the color of regular items. 5. Conclusions Many airports use TIP to improve and measure the detection performance of screeners. The underlying assumption is that TIP is realistic. However, our study reveals that screeners consider every third TIP image unrealistic. While, these images are unlikely to render TIP ineffective, TIP systems should be improved through the more-frequent placement of FTIs inside actual pieces of baggage rather than onto loose items in a tray and more-often projecting FTIs onto baggage images with higher superposition and bag complexity. Author Contributions: Conceptualization, R.R.à.P., Y.S. and A.S.; methodology, R.R.à.P., Y.S. and A.S.; software, R.R.à.P.; validation, Y.S. and A.S.; formal analysis, R.R.à.P. and Y.S.; investigation, R.R.à.P. and Y.S.; resources, R.R.à.P. and Y.S.; data curation, R.R.à.P. and Y.S.; writing—original draft preparation, R.R.à.P., Y.S. and A.S.; writing—review and editing, Y.S. and A.S.; visualization, R.R.à.P.; supervision, Y.S. and A.S.; project administration, R.R.à.P., Y.S. and A.S.; funding acquisition, Y.S. and A.S. All authors have read and agreed to the published version of the manuscript. Funding: This research was funded by the Swiss Federal Office of Civil Aviation and by the University of Applied Sciences and Arts Northwestern Switzerland. Institutional Review Board Statement: The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board of the School of Ap- plied Psychology of the University of Applied Sciences Northwestern Switzerland (protocol code 7; 19 September 2017). Informed Consent Statement: Informed consent was obtained from all subjects involved in the study. Acknowledgments: We want to thank Marius Latscha for his support in participant acquisition and in conducting the rating study. Conflicts of Interest: The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results. References 1. Harris, D.H. How to Really Improve Airport Security. Ergon. Des. 2002, 10, 17–22. [CrossRef] 2. Koller, S.M.; Drury, C.G.; Schwaninger, A. Change of search time and non-search time in X-ray baggage screening due to training. Ergonomics 2009, 52, 644–656. [CrossRef] [PubMed] 3. Biggs, A.T.; Mitroff, S.R. Improving the efficacy of security screening tasks: A review of visual search challenges and ways to mitigate their adverse effects. Appl. Cogn. Psychol. 2015, 29, 142–148. [CrossRef] 4. Schwaninger, A. Threat Image Projection: Enhancing performance? Aviat. Secur. Int. 2006, 13, 36–41. 5. Donnelly, N.; Muhl-Richardson, A.; Godwin, H.J.; Cave, K.R. Using eye movements to understand how security screeners search for threats in x-ray baggage. Vision 2019, 3, 24. [CrossRef] [PubMed] 6. Buser, D.; Sterchi, Y.; Schwaninger, A. Why stop after 20 minutes? Breaks and target prevalence in a 60-minute X-ray baggage screening task. Int. J. Ind. Ergon. 2020, 76, 102897. [CrossRef] 7. Godwin, H.J.; Menneer, T.; Cave, K.R.; Donnelly, N. Dual-target search for high and low prevalence X-ray threat targets. Vis. Cogn. 2010, 18, 1439–1463. [CrossRef] 8. Wolfe, J.M.; Horowitz, T.S.; Van Wert, M.J.; Kenner, N.M.; Place, S.S.; Kibbi, N. Low Target Prevalence Is a Stubborn Source of Errors in Visual Search Tasks. J. Exp. Psychol. Gen. 2007, 136, 623–638. [CrossRef] [PubMed] Sensors 2022, 22, 2220 14 of 15 9. Hofer, F.; Schwaninger, A. Using threat image projection data for assessing individual screener performance. WIT Trans. Built Environ. 2005, 82, 417–426. [CrossRef] 10. Skorupski, J.; Uchroński, P. A Human Being as a Part of the Security Control System at the Airport. Procedia Eng. 2016, 134, 291–300. [CrossRef] 11. Meuter, R.F.I.; Lacherez, P.F. When and Why Threats Go Undetected: Impacts of Event Rate and Shift Length on Threat Detection Accuracy during Airport Baggage Screening. Hum. Factors 2016, 58, 218–228. [CrossRef] [PubMed] 12. Hackman, R.; Oldham, G.R. Motivation through the design of work: Test of a theory. Organ. Behav. Hum. Perform. 1976, 16, 250–279. [CrossRef] 13. Humphrey, S.E.; Nahrgang, J.D.; Morgeson, F.P. Integrating Motivational, Social, and Contextual Work Design Features: A Meta-Analytic Summary and Theoretical Extension of the Work Design Literature. J. Appl. Psychol. 2007, 92, 1332–1356. [CrossRef] [PubMed] 14. Roach, G.D.; Lamond, N.; Dawson, D. Feedback has a positive effect on cognitive function during total sleep deprivation if there is sufficient time for it to be effectively processed. Appl. Ergon. 2016, 52, 285–290. [CrossRef] 15. Eckner, J.T.; Chandran, S.K.; Richardson, J.K. Investigating the role of feedback and motivation in clinical reaction time assessment. PM R 2011, 3, 1092–1097. [CrossRef] [PubMed] 16. European Commission. Commission Implementing Regulation (EU) 2015/1998 of 5 November 2015 Laying down Detailed Measures for the Implementation of the Common Basic Standards on Aviation Security; L 299; Publication Office of the European Union: Luxembourg, 2015; pp. 1–142. 17. Bassetti, C. Airport security contradictions: Interorganizational entanglements and changing work practices. Ethnography 2017, 19, 288–311. [CrossRef] 18. Schwaninger, A.; Hardmeier, D.; Hofer, F. Aviation Security Screeners Visual Abilities & Visual Knowledge Measurement. IEEE Aerosp. Electron. Syst. 2005, 20, 29–35. [CrossRef] 19. Schwaninger, A.; Michel, S.; Bolfing, A. A Statistical Approach for Image Difficulty Estimation in X-ray Screening Using Image Measurements. In Proceedings of the 4th Symposium on Applied Perception in Graphics and Visualization, Tübingen, Germany, 25–27 July 2007. 20. Chang, W.; Cheng, J.; Allaire, J.; Xie, Y.; McPherson, J. Shiny: Web Application Framework for R. R Package Version 1.2.0. Available online: https://cran.r-project.org/package=shiny (accessed on 10 February 2019). 21. Riz à Porta, R.; Sterchi, Y.; Schwaninger, A. Examining Threat Image Projection Artifacts and Related Issues : A Rating Study. In Proceedings of the 2018 International Carnahan Conference on Security Technology (ICCST), Montreal, QC, Canada, 22–25 October 2018; pp. 1–4. 22. Schwaninger, A.; Michel, S.; Bolfing, A. Towards a model for estimating image difficulty in X-ray screening. In Proceedings of the 39th Annual 2005 International Carnahan Conference on Security Technology, Las Palmas, Spain, 11–14 October 2005; pp. 185–188. 23. Mcgraw, K.O.; Wong, S.P. Forming Inferences about Some Intraclass Correlation Coefficients Forming Inferences about Some Intraclass Correlation Coefficients. Psychol. Methods 1996, 1, 30–46. [CrossRef] 24. Cicchetti, D.V. Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychol. Assess 1994, 6, 284–290. [CrossRef] 25. Hardmeier, D.; Hofer, F.; Schwaninger, A. The X-ray object recognition test (X-ray ORT)—A reliable and valid instrument for measuring visual abilities needed in X-ray screening. In Proceedings of the 39th Annual 2005 International Carnahan Conference on Security Technology, Las Palmas, Spain, 11–14 October 2005. [CrossRef] 26. Koller, S.M.; Hardmeier, D.; Michel, S.; Schwaninger, A. Investigating training, transfer and viewpoint effects resulting from recurrent CBT of X-Ray image interpretation. J. Transp. Secur. 2008, 1, 81–106. [CrossRef] 27. Bolfing, A.; Halbherr, T.; Schwaninger, A. How Image Based Factors and Human Factors Contribute to Threat Detection Performance in X-Ray. In Proceedings of the HCI and Usability for Education and Work, 4th Symposium of the Workgroup Human-Computer Interaction and Usability Engineering of the Austrian Computer Society, Graz, Austria, 20–21 November 2008; Volume 5298, pp. 419–438. 28. Schwaninger, A.; Bolfing, A.; Halbherr, T.; Helman, S.; Belyavin, A.; Hay, L. The impact of image based factors and training on threat detection performance in X-ray screening. In Proceedings of the 3rd International Conference on Research in Air Transportation, Fairfax, VA, USA, 1–4 June 2008; pp. 317–324. 29. Bolfing, A.; Michel, S.; Schwaninger, A. Assessing image difficulty in x-ray screening using image processing algorithms. In Proceedings of the 2nd International Conference on Research IN Air Transportation, Belgrade, Serbia, 24–28 June 2006; pp. 253–258. 30. Wolfe, J.M.; Brunelli, D.N.; Rubinstein, J.; Horowitz, T.S. Prevalence effects in newly trained airport checkpoint screeners: Trained observers miss rare targets, too. J. Vis. 2013, 13, 33. [CrossRef] [PubMed] 31. Wolfe, J.M.; Horowitz, T.S.; Kenner, N. Rare targets are often missed in visual search. Nature 2005, 435, 439–440. [CrossRef] 32. Hardmeier, D.; Schwaninger, A. Visual Cognition Abilities in X-ray Screening. In Proceedings of the 3rd International Conference on Research in Air Transportation, ICRAT, Fairfax, VA, USA, 1–4 June 2008; pp. 1–4. 33. von Bastian, C.C.; Schwaninger, A.; Michel, S. Do multi-view X-ray systems improve X-ray image in- terpretation in airport security screening? Z. Arbeitswiss. 2008, 62, 165–173. Sensors 2022, 22, 2220 15 of 15 34. Mendes, M.; Schwaninger, A.; Michel, S. Can laptops be left inside passenger bags if motion imaging is used in X-ray security screening? Front. Hum. Neurosci. 2013, 7, 654. [CrossRef] 35. Hättenschwiler, N.; Mendes, M.; Schwaninger, A. Detecting Bombs in X-ray Images of Hold Baggage: 2D Versus 3D Imaging. Hum. Factors 2019, 61, 305–321. [CrossRef] [PubMed] 36. Riffo, V.; Godoy, I.; Mery, D. Handgun Detection in Single-Spectrum Multiple X-ray Views Based on 3D Object Recognition. J. Nondestruct. Eval. 2019, 38, 66. [CrossRef] 37. Flitton, G.; Mouton, A.; Breckon, T.P. Object classification in 3D baggage security computed tomography imagery using visual codebooks. Pattern Recognit. 2015, 48, 2489–2499. [CrossRef] 38. Flitton, G.; Breckon, T.P.; Megherbi, N. A comparison of 3D interest point descriptors with application to airport baggage object detection in complex CT imagery. Pattern Recognit. 2013, 46, 2420–2436. [CrossRef] 39. Wang, Q.; Megherbi, N.; Breckon, T.P. A reference architecture for plausible Threat Image Projection (TIP) within 3D X-ray computed tomography volumes. J. X-ray Sci. Technol. 2020, 28, 507–526. [CrossRef] [PubMed] 40. Megherbi, N.; Breckon, T.P.; Flitton, G.T.; Mouton, A. Fully automatic 3D Threat Image Projection: Application to densely cluttered 3D Computed Tomography baggage images. In Proceedings of the 2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA), Istanbul, Turkey, 15–18 October 2012; pp. 153–159. [CrossRef]