Wessel Kraaij
Professor of Applied data analytics
- Name
- Prof.dr.ir. W. Kraaij
- Telephone
- +31 71 527 5778
- w.kraaij@liacs.leidenuniv.nl
- ORCID iD
- 0000-0001-7797-619X
Wessel Kraaij is a member of the interdisciplinary research programme Society, Artificial Intelligence and Life Sciences (SAILS). He is an expert in dealing with unstructured information, be it text, video or sensor data. He is developing new methods and models to organize data, search and recommend data relevant for a user/context or discover patterns that could be a starting hypothesis for new knowledge. Example applications include: self management of stress at work using wearables, video search by example, finding side effects in patient forum posting or assistive communication tools for people with aphasia. Wessel Kraaij is also affiliated to TNO as a principal scientist ‘data analytics’.
More information about Wessel Kraaij
PhD Candidates
Postdocs and External PhD Candidates
News
Former PhD Candidates
Applied Data Analytics concerns the methodologies and techniques to extract value from large volumes of heterogeneous unstructured data. Examples of unstructured data are: text, video or sensor data generated by wearables or deviced connected through the Internet of Things.
Wessel Kraaij has been working with unstructured data for two decades. Initially he worked in the domain of (text) Information Retrieval on models for topic detection and tracking, cross language retrieval and summarization. Later he moved to multimedia information retrieval and has been scientific co-coordinator of the influential global NIST TRECVID benchmark since 2003.
More recently, he moved to the digital health domain and started a research line on contextual reasoning by means of the COMMIT/ SWELL project.
His current interests are related to the application of data science methods on personal and population level health and lifestyle data.
Professor of Applied data analytics
- Science
- Leiden Inst of Advanced Computer Science
- Askari A., Verberne S., Abolghasemi M.A., Kraaij W. & Pasi G. (2024), Retrieval for extremely long queries and documents with RPRS: a highly efficient and effective transformer-based re-ranker, ACM Transactions on Information Systems 42(5): 115.
- Askari A., Abolghasemi M.A., Pasi G., Kraaij W. & Verberne S. (2023), Injecting the BM25 score as text improves BERT-based re-rankers. Kamps J., Goeuriot L., Crestani F., Maistro M., Joho H., Davis B., Gurrin C., Kruschwitz U. & Caputo A. (Eds.), Advances in information retrieval: 45th European Conference on Information Retrieval, ECIR 2023. 45th European Conference on Information Retrieval, ECIR 2023 2 April 2023 - 6 April 2023 no. 13980. Cham: Springer. 66–83.
- Dirkson A., Verberne S., Oortmerssen G. van, Gelderblom H. & Kraaij W. (2023), How do others cope? : Extracting coping strategies for adverse drug events from social media, Journal of Biomedical Informatics 139: 104228.
- Dirkson A.R., Verberne S. & Kraaij W. (2022), Breaking BERT: Understanding its vulnerabilities for biomedical named entity recognition through adversarial attack. arXiv. [working paper].
- Hoevenaars D., Yocarini I.E., Paraschiakos S., Holla J.F.M., Groot S. de, Kraaij W. & Janssen T.W.J. (2022), Accuracy of heart rate measurement by the Fitbit Charge 2 during wheelchair activities in people with spinal cord injury: instrument validation study, JMIR Rehabilitation and Assistive Technologies 9(1): e27637.
- Hollander D. den, Dirkson A., Verberne S., Kraaij W., Oortmerssen G. van, Gelderblom H., Oosten A., Reyners A.K.L, Steeghs N., Graaf W.T.A. van der, Desar I. & Husson O. (2022), Symptoms reported by gastrointestinal stromal tumour (GIST) patients on imatinib treatment: combining questionnaire and forum data, Supportive Care in Cancer : .
- Dirkson A.R., Verberne S., Kraaij W., Oortmerssen G. van & Gelderblom H. (2022), Automated gathering of real-world data from online patient forums can complement pharmacovigilance for rare cancers, Scientific Reports 12(1): 10317.
- Dirkson A., Hollander D. den, Verberne S., Desar I., Husson O., Graaf W. T A van der, Oosten A., Reyners A. K L, Steeghs N., Loon W. van: Oortmerssen G. van, Gelderblom H. & Kraaij W. (2022), Sample bias in web-based patient-generated health data of Dutch patients with gastrointestinal stromal tumor: survey study, JMIR Formative Research 6(12): e36755.
- Dirkson A.R., Verberne S. & Kraaij W. (2021), FuzzyBIO: a proposal for fuzzy representation of discontinuous entities. Holderness E., Jimeno Yepes A., Lavelli A., Minard A.L., Pustejovsky J. & Rinaldi F. (Eds.), Proceedings of the 12th International Workshop on Health Text Mining and Information Analysis. The 12th International Workshop on Health Text Mining and Information Analysis LOUHI 2021 19 April 2021 - 19 April 2021: Association for Computational Linguistics. 77–82.
- Egmond M.B. van, Spini G., Galien O. van der, IJpma A., Veugen T., Kraaij W., Sangers A., Rooijakkers T., Langenkamp P., Kamphorst B., L'Isle N. van de & Kooij-Janic M. (2021), Privacy-preserving dataset combination and Lasso regression for healthcare predictions, BMC Medical Informatics and Decision Making 21: 266.
- Kraaij W., Verberne S., Koldijk S., Korte E. de, Dantzig S. van, Sappelli M., Shoaib M., Bosems S., Achterkamp R., Bonomi A., Schavemaker J., Hulsebosch B., Wabeke T., Vollenbroek-Hutten M., Neerincx M. & Sinderen M. van (2020), Personalized support for well-being at work: an overview of the SWELL project, User Modeling and User-Adapted Interaction 30: 413-446.
- Berg A.C. van den, Giest S.N., Groeneveld S.M. & Kraaij W. (2020), Inclusivity in online platforms: Recruitment strategies for improving participation of diverse sociodemographic groups, Public Administration Review 80(6): 989-1000.
- Dirkson A.R., Verberne S. & Kraaij W. (2020), Conversation-Aware Filtering of Online Patient Forum Messages. Gonzalez-Hernandez G., Klein A.Z., Flores I., Weissenbacher D., Magge A., O'Connor K., Sarker A., Minard A.L., Tutubalina E., Miftahutdinov Z. & Alimova I. (Eds.), Proceedings of the Fifth Social Media Mining for Health Applications Workshop & Shared Task. Fifth Social Media Mining for Health Applications Workshop and Shared Task at COLING 2020 12 December 2020 - 12 December 2020: Association for Computational Linguistics. 11-18.
- Dirkson A.R., Verberne S. & Kraaij W. (2019), Narrative detection in online patient communities. Jorge A.M., Campos R., Jatowt A. & Bhatia S. (Eds.), Proceedings of Text2Story — Second Workshop on Narrative Extraction From Texts co-located with 41th European Conference on Information Retrieval (ECIR 2019). Text2Story Workshop at European Conference on Information Retrieval 2019 14 April 2019 - 14 April 2019: CEUR-WS. 21-28.
- Dirkson A.R., Verberne S. & Kraaij W. (2019), Lexical Normalization of User-Generated Medical Forum Data. Weissenbacher D. & Gonzalez-Hernandez G. (Eds.), Proceedings of the Fourth Social Media Mining for Health Applications (SMM4H) Workshop & Shared Task. Fourth Social Media Mining for Health Applications (#SMM4H) Workshop & Shared Task 2 August 2019 - 2 August 2019: Association for Computational Linguistics. 11-20.
- Dirkson A.R., Verberne S., Sarker A. & Kraaij W. (2019), Data-Driven Lexical Normalization for Medical Social Media, Multimodal Technologies and Interaction 3(3): 60.
- Brouwer A.M., Water L. van de, Hogervorst M., Kraaij W., Schaagen J.M. & Hogenelst K. (2018), Monitoring mental state during real life office work. Ham J., Spagnolli A., Blankerta B., Gamberini L. & Jacucci G. (Eds.), Symbiotic Interaction. Symbiotic 2017, The 6th International Workshop on Symbiotic Interaction 18 December 2017 - 19 December 2017 no. 10727: Springer International Publishing.
- Oortmerssen G. van, Raaijmakers S., Sappelli M., Boertjes E., Verberne S., Walasek N. & Kraaij W. (2018), Analyzing cancer forum discussions with text mining. Riano D., Peleg M., Lenz R., Reichert M., Denecke K., Deng Y., Declerck T. & Harmelen F. van (Eds.), Proceedings of the International Joint Workshop KR4HC - ProHealth 2017 (in conjunction with AIME 2017). International Joint Workshop KR4HC - ProHealth 2017 (in conjunction with AIME 2017) 24 June 2017 - 24 June 2017 127-130.
- Dirkson A.R., Verberne S., Oortmerssen G. van & Kraaij W. (2018), Lexical Normalization of User-generated Medical Forum Data, Proceedings of the 17th Dutch-Belgian Information Retrieval workshop. Dutch-Belgian Information Retrieval Workshop 23 November 2018 - 23 November 2018 1-4.
- Keesman M., Janssen V., Kemps H., Hollander M., Scholte op Reimer W., Gemert-Pijnen L. van, Hoes A., Kraaij W., Chavannes N., Atsma D., Kraaijenhagen R. & Evers A. (2018), BENEFIT for all: An ecosystem to facilitate sustained healthy living and reduce the burden of cardiovascular disease, European Journal of Preventive Cardiology 26(6): 606-608.
- Korte M.E. de, Wiezer N., Janssen J.H., Vink P. & Kraaij W. (2018), Evaluating an mHealth App for Health and Well-Being at Work: Mixed-Method Qualitative Study, JMIR mHealth and uHealth 6(3): e72.
- Korte E. de, Wiezer N., Bakhuys Roozeboom M., Vink P. & Kraaij W. (2018), Behavior Change Techniques in mHealth Apps for the Mental and Physical Health of Employees: Systematic Assessment, JMIR mHealth and uHealth 6(10): e167.
- Veeningen M., Chatterjea S., Horváth A.Z., Spindler G., Boersma E., Spek P. van der, Galiën O. van der, Gutteling J., Kraaij W. & Veugen T. (2018), Enabling Analytics on Sensitive Medical Data with Secure Multi-Party Computation. Ugon A., Karlsson D., Klein G.O. & Moen A. (Eds.), Building Continents of Knowledge in Oceans of Data: The Future of Co-Created eHealth. Medical Informatics Europe 2018 (29th MIE 2018) 24 April 2018 - 26 April 2018 no. Studies in Health Technology and Informatics, volume 247: EFMI / IOS Press. 76-80.
- Sappelli M., Verberne S. & Kraaij W. (2017), Evaluation of context-aware recommendation systems for information re-finding, Journal of the Association for Information Science and Technology 68(4): 895–910.
- Awad G., Kraaij W., Over P. & Satoh S. (2017), Instance search retrospective with focus on TRECVID, International Journal of Multimedia Information Retrieval 6(1): 1-29.
- Boer M.H.T. de, Lu Y.J., Zhang H., Schutte K., Ngo C.W. & Kraaij W. (2017), Semantic Reasoning in Zero Example Video Event Retrieval, ACM Transactions on Multimedia Computing, Communications and Applications 13(4): 60.
- Boer M. de, Pingen G., Knook D., Schutte K. & Kraaij W. (2017), Improving video event retrieval by user feedback, Multimedia Tools and Applications 76(21): 22361–22381.
- Raaijmakers S., Sappelli M. & Kraaij W. (2017), Investigating the interpretability of hidden layers in deep text mining, Semantics 2017 Proceedings of the 13th International Conference on Semantic Systems. SEMANTiCS 2017 11 September 2017 - 14 September 2017. New York, NY, U.S.A.: ACM. 177-180.
- Boer M. de, Schutte K. & Wessel Kraaij (2016), Knowledge based query expansion in complex multimedia event detection, Multimedia Tools and Applications 75: 9025-9043.
- Koldijk S., Neerincx M.A. & Wessel Kraaij (2016), Detecting work stress in offices by combining unobtrusive sensors, IEEE Transactions on Affective Computing PP(99): .
- Boer M.H.T. de, Schutte K., Zhang H., Lu Y-L, Ngo C-W & Wessel Kraaij (2016), Blind late fusion in multimedia event retrieval, International Journal of Multimedia Information Retrieval 5: 203-217.
- Verberne S., Sappelli M., Hiemstra D. & Wessel Kraaij (2016), Evaluation and analysis of term scoring methods for term extraction, Information Retrieval 19(5): 510-545.
- Sappelli M., Verberne S. & Kraaij W. (2016), Adapting the interactive activation model for context recognition and identification, ACM Transactions on Interactive Intelligent Systems 6(3): 22-30.
- Sappelli M., Pasi G., Verberne S., Boer M. de & Wessel Kraaij (2016), Assessing e-mail intent and tasks in e-mail messages, Information Sciences 358: 1-17.
- Koldijk S., Wessel Kraaij & Neerincx A.M. (2016), Deriving Requirements for Pervasive Well-Being Technology From Work Stress and Intervention Theory: Framework and Case Study, JMIR mHealth and uHealth 4(3): .
- Awad G., Fiscus J., Michel M., Joy D., Wessel Kraaij, Smeaton A.F., Quénot G., Eskevich M., Aly R., Jones J.F., Ordelman R., Huet B. & Larson M. (2016), TRECVID 2016: Evaluating Video Search, Video Event Detection, Localization, and Hyperlinking, Proceedings of TRECVID 2016. TRECVID 2016.
- Koldijk S., Koot G., Neerincx M. & Wessel Kraaij (2014), Privacy and User Trust in Context-Aware Systems. In: Dimitrova V., Kuflik T., Chin D., Ricci F., Dolog P. & Houben G-J. (Eds.), User Modeling, Adaptation, and Personalization. USER MODELING, ADAPTATION, AND PERSONALIZATION, UMAP 2014 no. LNCS8538: Springer. 134-145.
- Rest J. van, Grootjen F.A., Grootjen M., Wijn R., Aarts O., Roelofs M.L., Burghouts G.J., Bouma H., Alic L. & Wessel Kraaij (2014), Requirements for multimedia metadata schemes in surveillance applications for security, Multimedia Tools and Applications 70(1): 573-598.
- Awad G., Over P. & Wessel Kraaij (2014), Content-Based Video Copy Detection Benchmarking at TRECVID, ACM Transactions on Information Systems 32(3): 14.
- Schutte K., Bomhof F., Burghouts G., Diggelen J. van, Hiemstra P., Hof J. van 't, Wessel Kraaij, Pasman H., Smith A., Versloot C. & Wit J. de (2013), GOOSE: Semantic search on Internet connected sensors, Proceedings of SPIE - International Society for Optical Engineering 8758: .
- Verberne S., Heijden M. van der, Hinne M., Sappelli M., Koldijk S., Hoenkamp E. & Kraaij W. (2013), Reliability and Validity of Query Intent Assessments, Journal of the American Society for Information Science and Technology 64(11): 2224-2237.
- Ngo C.W., Xu C., Kraaij W. & El Saddik A. (2013), Web-Scale Near-Duplicate Search: Techniques and Applications, IEEE MultiMedia 20(3): 10-12.
- Larson M., Jong F. de, Wessel K. & Renals S. (2012), Special Issue on Searching Speech, ACM Transactions on Information Systems 30(3): 15.
- Meij E., Trieschnigg D., Rijke M. de & Kraaij W. (2010), Conceptual language models for domain-specific retrieval, Information Processing and Management 46(4): 448-469.
- Trieschnigg D., Pezik P., Lee V., Jong F. de, Kraaij W. & Rebholz-Schuhmann D. (2009), MeSH Up: effective MeSH text classification for improved document retrieval, Bioinformatics 25(11): 1412-1418.
- Trieschnigg D., Pezik P., Lee V., Jong F. de, Kraaij W. & Rebholz-Schuhmann D. (2009), Response to comment on 'MeSH-up: effective MeSH text classification for improved document retrieval', Bioinformatics 25(20): 2772-2772.
- Post W., Elling E., Cremers A. & Kraaij W. (2007), Experimental comparison of multimodal meeting browsers. In: Smith M.J. & Salvendy G. (Eds.), Human Interface and the Management of Information. Interacting in Information Environments. Human Interface 2007. Lecture Notes in Computer Science no. 4558. Berlin Heidelberg: Springer. 118-127.
- Arlandis J., Over P. & Kraaij W. (2005), Boundary error analysis and categorization in the TRECVID news story segmentation task. Leow W.K., Lew M.S., Chua T.S., Ma W.Y., Chaisorn L. & Bakker E.M. (Eds.), Image and Video Retrieval. CIVR 2005. International Conference on Image and Video Retrieval 20 July 2005 - 22 July 2005. Lecture Notes in Computer Science no. LNCS 3568. Berlin, Heidelberg: Springer. 103-112.
- Kraaij W., Nie J.Y. & Simard M. (2003), Embedding Web-based statistical translation models in cross-language information retrieval, Computational Linguistics 29(3): 381-419.
- Reidsma D., Hiemstra D., Jong F. de & Kraaij W. (2003), Cross-language retrieval at the University of Twente and TNO. In: Peters C., Braschler M., Gonzalo J. & Kluck M. (Eds.), Advances in Cross-Language Information Retrieval. CLEF 2002. Lecture Notes in Computer Science no. 2785. Berlin Heidelberg: Springer. 197-206.
- Wessel Kraaij (2002), TNO at CLEF-2001: Comparing translation resources. In: Peters C., Braschler M., Gonzalo J. & Kluck M. (Eds.), Evaluation of Cross-Language Information Retrieval Systems. CLEF 2001. Lecture Notes in Computer Science no. 2406. Berlin Heidelberg: Springer. 78-93.
- Kraaij W. & Pohlmann R. (2001), Different approaches to cross-language information retrieval. Daelemans W., Sima'an K., Veenstra J. & Zavrel J. (Eds.), Computational Linguistics in the Netherlands 2000. Eleventh Conference on Computational Linguistics in the Netherlands 3 November 2000 - 3 November 2000. Language and Computers: Studies in Practical Linguistics no. 37: Brill / Rodopi. 97-110.
- Stal W.G. ter, Beijert J.H., Bruin G. de, Gent J. van, Jong F.M.G. van, Wessel Kraaij., Netter K. & Smart G. (1998), Twenty-One: cross-language disclosure and retrieval of multimedia documents on sustainable development, Computer Networks and ISDN Systems 30(13): 1237-1248.
- principal scientist
- Board member