IRT-AI

LREC-COLING 2024 Tutorial : Hallucination in Large Language Models

Date: May 25, 2024 | Torino, Italy

In the fast-paced domain of Large Language Models (LLMs), the issue of hallucination is a prominent challenge. Despite continuous endeavors to address this concern, it remains a highly active area of research within the LLM landscape. Grasping the intricacies of this problem can be daunting, especially for those new to the field. This tutorial aims to bridge this knowledge gap by introducing the emerging realm of hallucination in LLMs. It will comprehensively explore the key aspects of hallucination, including benchmarking, detection, and mitigation techniques. Furthermore, we will delve into the specific constraints and shortcomings of current approaches, providing valuable insights to guide future research efforts for participants.

For more information, visit Our Website

AAMAS 2025 Tutorial : Multiagent CoPilot in Industrial AI Applications

Date: May 19-20, 2025 | Detroit, Michigan, USA

In the era of smart automation and digital transformation, achieving efficiency, precision, and adaptability is essential for industries to remain competitive. Sectors, including manufacturing, supply chain and logistics, healthcare, finance, and retail, face significant challenges in deploying Artificial Intelligence (AI) solutions tailored to their unique needs, particularly in critical, resource-constrained applications. According to Gartner’s 2024 Hype Cycle for Artificial Intelligence, composite AI, which integrates techniques like machine learning, knowledge graphs, and rule-based systems, is becoming foundational for industries, enhancing predictions, decisions, and scalability across complex environments. The complexity of real-world systems requires Industrial AI solutions to be customizable to business needs, compact for efficient deployment on resource-constrained devices, and agile to adapt to changing requirements. By being neurosymbolic, such solutions integrate data, knowledge, and human expertise to create robust, explainable, and trustworthy AI that supports planning and reasoning. In this tutorial, we will introduce Multiagent CoPilot for Industrial AI applications focusing on the primary use case of manufacturing (offering requirements, data, knowledge, human expertise). The use cases we will describe are inspired by collaborations with, or similar efforts at Bosch, Hewlett Packard Enterprise, Siemens, and others. AAMAS audience will learn about human-in-the-loop CoPilots as we explore how multiagent coordination, collaboration, and decision-making can enhance the functionality of industrial AI models. With our primary use case, we will demonstrate how to address the unique challenges faced by the manufacturing industry, from improving operational efficiency to enhancing adaptability in critical tasks. However, the knowledge and insights gained from this tutorial are applicable and generalizable to various industries, like transportation and healthcare, offering valuable perspectives for researchers and professionals across domains seeking to adopt these technologies in real-world applications.

For more information, visit Our Website

AAAI 2025 Tutorial : Hallucinations in Large Multimodal Models

Date: February 26, 2025 | Philadelphia, Pennsylvania, USA

Large Language Models (LLMs) have made significant strides in generating human-like text, but their tendency to hallucinate—producing factually incorrect or fabricated information—remains a pressing issue. This tutorial provides a comprehensive exploration of hallucinations in LLMs, introducing participants to the key concepts and challenges in this domain. We will cover the types of hallucination, including Factual Mirage and Silver Lining, and present the latest approaches for benchmarking, detection, and mitigation. The motivation for understanding hallucination is particularly critical from a multimodal standpoint, as Vision-Language Models (VLMs) can exacerbate the problem by blending hallucinated text with misleading images or video. The tutorial will offer practical techniques to reduce hallucinations using both black-box and gray-box methods. Designed for researchers and professionals in generative AI, this tutorial bridges the gap between emerging research and practical solutions, providing attendees with valuable insights and tools to enhance the factual accuracy of LLM outputs. Participants will gain a deeper understanding of the complexities surrounding LLM hallucination and be equipped with strategies to drive future innovations in the field.

For more information, visit Our Website

AAAI'25-Lab: Developing Explainable Multimodal AI Models With Hands-on Lab on the Life-cycle of Rare Event Prediction in Manufacturing

Date: February 25-26, 2025 | Philadelphia, Pennsylvania, USA

In the age of Industry 4.0 and smart automation, unplanned downtime costs industries over $50 billion annually. Even with preventive maintenance, industries like automotive lose more than $2 million per hour due to downtime caused by unexpected or “rare” events. The extreme rarity of these events makes their detection and prediction a significant challenge for AI practitioners. Factors such as the lack of high-quality data, methodological gaps in the literature, and limited practical experience with multimodal data exacerbate the difficulty of rare event detection and prediction. This lab will provide hands-on experience to learn how to address these challenges by exploring the entire lifecycle of rare event analysis, from data generation and preprocessing to model development and evaluation. Developing a process ontology for user-level/application-level/domain-specific explanations will also be demonstrated. Participants will be introduced to the limited publicly available datasets and, will gain hands-on experience with a newly developed multi-modal dataset designed explicitly for rare event prediction. Through several hands-on sessions, participants will learn how to generate such a high-quality dataset and the practical use of this dataset to develop rare event prediction models. Those interested in developing AI models involving diverse multimodal data for other applications will also benefit from participation. The learning from this lab will also be relevant to other domains and applications, such as healthcare, finance, and energy, where predictive maintenance can help prevent costly failures in complex systems. Participants will gain valuable insights and skills transferrable across industries where rare events impact operational efficiency and require advanced predictive techniques.

For more information, visit Our Website

IEEE BigData 2024: Neuro-Symbolic AI for Deep Analysis of Social Media Big Data

Date: December 15-18, 2024 | Washington, DC, USA

This tutorial introduces a neuro-symbolic AI framework to analyze big data from social media platforms. Integrating human-curated knowledge through symbolic AI with the pattern recognition capabilities of neural networks enhances the adaptability and efficiency of traditional neural network approaches. Knowledge-guided zero-shot learning techniques enable swift adaption to new linguistic contexts and emerging events [6]. Participants will explore how to design, develop, and utilize these models in specific domains, such as public health surveillance that require dynamic adaptation to new terminologies. The tutorial aims to equip attendees with practical skills and a deep understanding of how to apply neuro-symbolic AI to manage and analyze large-scale social media datasets effectively.

For more information, visit Our Website

IEEE BigData 2024: Knowledge-driven Processes for Big Data Management and Applications

Date: December 15-18, 2024 | Washington, DC, USA

The unparalleled volume of data generated has heightened the need for approaches that can manage and translate them into actionable insights. While the contemporary data-driven and generative systems are popular for handling large volume of changing and diverse data, they are not silver bullets due to the inherent lack of knowledge grounding. The emerging use of knowledge-driven processes have surfaced as compelling approaches for leveraging external knowledge and structured representation to complement the shortcomings within data-driven systems. Such processes which while exploiting data, also use extensive knowledge in the form of Knowledge Graphs (KGs). In this tutorial, we will introduce and provide interactive hands-on and lab-oriented sessions on the knowledge-driven processes for big data management and applications using realworld datasets ranging from structured, semi-structured, and unstructured formats. Specifically we will use the EMPWR platform for creating and maintaining large KGs and demonstrate recent innovations in three concrete real world use-cases: (i) development of a pharmaceutical KG with over 6M triples, 1.5M nodes, and 3000 relation types; (ii) development of a suite of large scale KGs with 10M+ triples, 2M+ entities, and 19 relations from real-world driving scenes and their use in machine perception tasks; and (iii) AI pipelines recommender system with KG consisting of 78M triples, 8M nodes, and 25M relations.

For more information, visit Our Website

Neurosymbolic Customized and Compact CoPilots

Large Language Models (LLMs) are credible with open-domain interactions such as question answering, summarization, and explanation generation. LLM reasoning is based on parametrized knowledge, and as a consequence, the models often produce absurdities and inconsistencies in outputs (eg, hallucinations and confirmation biases)[2]. In essence, they are fundamentally hard to control to prevent off-the-rails behaviors, are hard to fine-tune, customize for tailored needs, prompt effectively (due to the “tug-of-war” between external and parametric memory), and extremely resource-hungry due to the enormous size of their extensive parametric configurations. Thus, significant challenges arise when these models are required to perform in critical applications in domains such as healthcare and finance, that need better guarantees and in turn, need to support grounding, alignment, and instructibility. AI models for such critical applications should be customizable or tailored as appropriate for supporting user assistance in various tasks, compact to perform in real-world resource-constraint settings, and capable of controlled, robust, reliable, interpretable, and grounded reasoning (grounded in rules, guidelines, and protocols). This special session explores the development of compact, custom neurosymbolic AI models and their use through human-in-the-loop co-pilots for use in critical applications.

Lecture-style Tutorial: Causal AI for web and health care

Improving the performance and explanations of ML algorithms is a priority for adoption by humans in the real world. In critical domains such as healthcare, such technology has significant potential to reduce the burden on humans and considerably reduce manual assessments by providing quality assistance at scale. In today’s data driven world, artificial intelligence (AI) systems are still experiencing issues with bias, explainability, and human-like reasoning and interpretability. Causal AI is the technique that can reason and make human like choices making it possible to go beyond narrow Machine learning based techniques and can be integrated into human decision making. It also offers intrinsic explainability, new domain adaptability, bias free predictions and works with datasets of all sizes. In this tutorial of type lecture style we detail how a richer representation of causality in AI systems using a knowledge graph (KG) based approach is needed for intervention and counterfactual reasoning (Figure 1), how do we get to model based and domain explainability, how causal representations helps in web and health care.

Neuro-symbolic AI for mental healthcare

Artificial Intelligence (AI) systems for mental healthcare (MHCare) have been ever-growing after realizing the importance of early interventions for patients with chronic mental health (MH) condi- tions. Social media (SocMedia) emerged as the go-to platform for supporting patients seeking MHCare. The creation of peer-support groups without social stigma has resulted in patients transitioning from clinical settings to SocMedia supported interactions for quick help. Researchers started exploring SocMedia content in search of cues that showcase correlation or causation between different MH conditions to design better interventional strategies. User-level Classification-based AI systems were designed to leverage diverse SocMedia data from various MH conditions, to predict MH condi- tions. Subsequently, researchers created classification schemes to measure the severity of each MH condition. Such ad-hoc schemes, engineered features, and models not only require a large amount of data but fail to allow clinically acceptable and explainable reasoning over the outcomes. To improve Neural-AI for MHCare, infusion of clinical symbolic knowledge that clinicans use in decision mak- ing is required. An impactful use case of Neural-AI systems in MH is conversational systems. These systems require coordination between classification and generation to facilitate humanistic con- versation in conversational agents (CA). Current CAs with deep language models lack factual correctness, medical relevance, and safety in their generations, which intertwine with unexplainable statistical classification techniques.

Knowledge-infused Learning for Autonomous Driving

Autonomous Driving (AD) is considered as a testbed for tackling many hard AI problems. Despite the recent advancements in the field, AD is still far from achieving full autonomy due to core technical problems inherent in AD. The emerging field of neuro-symbolic AI and the methods for knowledge-infused learning are showing exciting ways of leveraging external knowledge within machine/deep learning solutions, with the potential benefits for interpretability, explainability, robustness, and transferability. In this tutorial, we will examine the use of knowledge-infused learning for three core state-of-the-art technical achievements within the AD domain. With a collaborative team from both academia and industry, we will demonstrate recent innovations using real-world datasets.

For more information, visit the KL4AD website.

Knowledge-infused Reinforcement Learning

Virtual health agents (VHAs) have received considerable attention, but the early focus has been on collecting data, helping patients follow generic health guidelines, and providing reminders for clinical appointments. While presenting the collected data and frequency of visits to the clinician is useful, further context and personalization are needed for a VHA to interpret and understand what the data means in clinical terms. This has made their use in managing health limited. Such understanding enables patient empowerment and self-appraisal -- i.e., aiding the patient in interpreting the data to understand the changes in the patient’s health conditions, and self-management -- i.e., to help a patient better manage their health through better adherence to the clinician guidelines and clinician recommended care plan. Crisis conditions such as the current pandemic have further stressed our healthcare system and have made the need for such advanced support more attractive and in demand. Consider the rapid growth in mental health because the patients who already had mental health conditions worsen, and many develop such conditions due to the challenges arising from lockdown, isolation, and economic hardships. The severe lack of timely availability of clinical expertise to meet the rapidly growing demand provides the motivation for advancing this research in developing more advanced VHAs and evaluating it in the context of mental health management.

For more information, visit the KiRL website.

Explainable AI using Knowledge Graphs

Date: January 02 - 04, 2021, Bangalore, India

During the last decade, traditional data-driven deep learning (DL) has shown remarkable success in essential natural language processing tasks, such as relation extraction. Yet, challenges remain in developing artificial intelligence (AI) methods in real-world cases that require explainability through human interpretable and traceable outcomes. The scarcity of labeled data for downstream supervised tasks and entangled embeddings produced as an outcome of self-supervised pre-training objectives also hinders interpretability and explainability. Additionally, data labeling in multiple unstructured domains, particularly healthcare and education, is computationally expensive as it requires a pool of human expertise. Consider Education Technology, where AI systems fall along a “capability spectrum” depending on how extensively they exploit various resources, such as academic content, granularity in student engagement, academic domain experts, and knowledge bases to identify concepts that would help achieve knowledge mastery for student goals. Likewise, the task of assessing human health using online conversations raises challenges for current statistical DL methods through evolving cultural and context-specific discussions. Hence, developing strategies that merge AI with stratified knowledge to identify concepts that would delineate healthcare conversations patterns and help healthcare professionals decide. Such technological innovations are imperative as they provide consistency and explainability in outcomes. This tutorial discusses the notion of explainability and interpretability through the use of knowledge graphs in (1) Healthcare on the Web, (2) Education Technology. This tutorial will provide details of knowledge-infused learning algorithms and its contribution to explainability for the above two applications that can be applied to any other domain using knowledge graphs.

For more information, visit https://aiisc.ai/xaikg/

ACM HT 2020 Tutorial: Knowledge-infused Deep Learning

Recent advances in statistical and data-driven deep learning demonstrate significant success in natural language understanding without using prior knowledge, especially in structured and generic domains, where data is abundant. On the other hand, in text processing problems that are dynamic and impact the society at large, existing data-dependent, state-of-the-art deep learning methods remain vulnerable to veracity considerations and especially, high volume that masks small, emergent signals. Statistical natural language processing methods have shown poor performance in capturing: (1) Human well being online especially in evolving events (e.g. mental health communications on Reddit, Twitter), (2) Culture and context specific discussion on the web (e.g. humor detection, extremism on social media), (3) Social Network Analysis (help-seeker and care-provider) during pandemic or disaster scenarios, and (4) Explainable methods of learning that drive technological innovations and inventions for community betterment. In such social hypertext, leveraging the semantic-web concept of knowledge graphs is a promising approach to the enhancement of deep learning and natural language processing.

According to Piagetian human learning theory, the activation of existing schema guides the apprehension of experience to support the generation of context sensitive responses. Activating prior knowledge connects current and past experience for identifying relations, supporting explanation, reducing ambiguity, structuring new knowledge, and application to novel materials. Further, human learning does not necessarily rely on large amounts of (annotated) cases to proceed. Because prior knowledge is so powerful in human learning, its incorporation at various levels of abstraction in deep learning could benefit outcomes. Example the desiderata include compensating for data limitations, improving inductive bias, generating explainable outcomes and enabling trust. These are particularly useful for data-limited but otherwise complex, evolving problems in domains such as mental healthcare, online social threats and epidemic/pandemic.

Despite the general agreement that structured prior knowledge and tacit knowledge (the inferred outcome of a model) resulting from deep learning should be combined, there has been little progress. Recent debates on Neuro-Symbolic AI , the inclusion of innate priors in deep learning, and AI fireside chat have identified knowledge-infused learning to improve explainability, interpretability, and trust in AI systems.

In this tutorial, we take use cases from the aforementioned two social good applications (Mental Health, Radicalization) and multimodal aspects of social media (e.g. scene understanding from images, video and text (hypermedia/hypertext) often found in documentation of critical events to explore the modern aspect of hypertext using semantic web in the form of Knowledge Graphs (KG). Specifically, the tutorial will provide a detailed walkthrough on Knowledge Graphs and their utility in developing knowledge-infusion techniques for interpretable and explainable learning for text, video, images, and graphical data on the web with the following agenda: Motivate the novel paradigm of knowledge-infused learning using computational learning and cognitive theories. Describe the different forms of knowledge, methods of automatic modeling of KG, and infusion methods in deep/machine learning. Discuss application-specific evaluation methods specifically for explainability and reasoning using benchmark datasets and knowledge-resources that show promise in advancing the capabilities of deep learning. Future directions of KGs and robust learning for the Web and Society.

Knowledge In - Wisdom Out - Explainable Data for AI in Cyber Social Threats and Public Health

Date: June 07 - 10, 2021

In today's data-driven world, organizations derive insights from massive amounts of data through large scale statistical machine learning models. However, statistical techniques can be easy to fool with adversarial instances (a neural network can predict a non-extremist as an extremist by mere presence of the word Jihad), which raises question in Data Quality. In high stakes decision making problems, such as cyber social threats, it is highly sensitive to classify a non-extremist as an extremist and vice-versa. Data quality is good if the data possesses adequate domain coverage and the labels contain adequate semantics. For example, is the semantics of an extremist vs. non-extremist vis-a-vis the word Jihad captured in the label (adequate semantics in labels)? Also, are there enough non-extremists with the word Jihad in the training data from the perspective of religion, hate, or ideology? Thus semantic annotation of the data, beyond mere labels attached to data instances, can significantly improve the robustness of model outcomes and ensure that the model has learned from trustworthy, knowledge-guided data standards. It is important to note that the knowledge-guided standards help de-bias the data if specified correctly (contextualized de-biasing extremist behavior data from bias towards the word Jihad). Therefore, in addition to trust in the robustness of outcomes, knowledge guided data creation also enables fair and ethical practices during real-world deployment of machine learning in high stakes decision making. We denote such data as Explainable Data. In this tutorial of type course and case-studies, we detail how to construct Explainable Data using various expert resources and knowledge graphs. All the materials (resources and implementations) presented during the tutorial will be made available on: KIWO-ICWSM, a week before the tutorial. We plan a 90 minute tutorial (Intermediate Level) with 2 breaks (5 mins each).