Loading…
Monday, October 28
 

8:00am EDT

Breakfast Pastries and Coffee Served
Monday October 28, 2024 8:00am - 9:00am EDT
Breakfast pastries and coffee will be served in both locations on Monday, October 28.
Monday October 28, 2024 8:00am - 9:00am EDT
Jared L. Cohon University Center, Rangos Ballroom 1 AND SEIber Cafe

8:00am EDT

Registration Check-In Table Opens
The registration table will be staffed from 8:00 a.m. until sessions end each day. Please stop by to check in and pick up your name badge. If you have any questions, please feel free to stop by and ask our staff. 

9:00am EDT

Data Fabric: Technologies, Modeling and Applications
Monday October 28, 2024 9:00am - 11:00am EDT
Data Fabric is an amalgamation of various database system technologies, offering extensive research opportunities for deploying end-to-end data management platform-based solutions. These platforms have seen advancements in middleware, advanced and powerful ETL pipelines, and generative AI-supported data pipelines with unifed storage and compute to establish compliance and governance and reduce latency. Deployed systems using technologies such as data mesh, data lakes, data warehouses, and cloud databases serve as data sources, and the data fabric solution manages data, query, and analytics pipelines by leveraging distributed computing capabilities and dynamically routing queries for optimal performance without centralizing data storage. Understanding the interconnections (technology and applications) among source systems, data fabric, domain, and application is crucial for establishing correct and complete data fabric solutions for user applications. This paper presents a holistic view of data fabric technologies and addresses the importance of understanding the interconnections among source data systems, data fabric, domain, and application, focusing on metadata and application development. For metadata, we envisage an ER model solution to provide an overall conceptual data landscape for the underlying data systems for a data fabric.
Monday October 28, 2024 9:00am - 11:00am EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

9:00am EDT

2nd Workshop on Modeling in the Age of Large Language Models (LLM4Modeling)
Monday October 28, 2024 9:00am - 1:00pm EDT
LLM4Modeling papers to be presented:
  • An LLM Assistant for Characterizing Conceptual Modeling Research Contributions
    Steve Liddle, Heinrich C. Mayr, Oscar Pastor, Veda Storey and Bernhard Thalheim
  • AI-Assisted Analytics – An Automated Approach to Data Visualization
    Alberto Alves, João Moura Pires, Maribel Yasmina Santos, Ana Léon and Andreia Almeida
  • Combining Natural Language Generation and Graph Algorithms to Explain Causal Maps through Meaningful Paragraphs
    Tyler Gandee and Philippe Giabbanelli

Monday October 28, 2024 9:00am - 1:00pm EDT
Jared L. Cohon University Center, McKenna Room

9:00am EDT

3rd International Workshop on Digital JUStice, Digital Law and Conceptual MODeling (JUSMOD24)
Monday October 28, 2024 9:00am - 1:00pm EDT
JUSMOD24 papers to be presented:
  • RESIGNIFING COMPLIANCE. BETWEEN ONTOLOGIES AND EPISTEMOLOGIES OF LAW
    Matteo Buffa
  • Modelling Legal Enforcement with UFO-L: a Case from Swedish Healthcare
    Jöran Lindeberg, Paul Johannesson, Martin Henkel, Erik Perjons and Katarina Fast Lappalainen
  • The eu-FAIRnews: A Preliminary Exploration of Bridging Disinformation and Digital Justice through FAIR Data Practices in Online News Sources
    Antonella Calo, Marco Zappatore, Antonella Longo, Davide Damiano Colella, Marco Longo and Priamo Tarantino
  • Safety Assurances in Autonomous Vessels    
    Sreekant Sreedharan, Muthu Ramachandran, Erik Røsæg and Børge Rokseth

Monday October 28, 2024 9:00am - 1:00pm EDT
Jared L. Cohon University Center, Wright Room 5032 Forbes Ave, Pittsburgh, PA 15213

9:00am EDT

The 5th International Workshop on Conceptual Modeling for Life Sciences (CMLS)
Monday October 28, 2024 9:00am - 1:00pm EDT
CMLS Papers to be Presented:
  • On the Expressiveness of Petri Nets for Modeling Biological Processes the Case for mRNA Translation and Protein Synthesis
    Luis Henrique Costa Neto, Sérgio Lifschitz, Fernanda Baiao, Marcos Catanho, Antonio Basílio de Miranda and Edward Hermann Haeusler
  • Enhancing Vaxign-DL for Vaccine Candidate Prediction with added ESM-Generated Features
    Yichao Chen, Yongqun He and Yuhan Zhang
  • Conceptual Modeling for Polygenic Risk Score Research: Improving Domain Understanding and Clinical Utility
    Diana Martínez-Minguet and Óscar Pastor
  • Integrative Ontology of Bipolar Disorder (OBD): Advancing Bipolar Disorder Research through an Interoperable Ontological Framework
    Yujia Tian, Yongqun He, Rachel Richesson and Melvin Mclnnis




















Monday October 28, 2024 9:00am - 1:00pm EDT
Jared L. Cohon University Center, Rangos Ballroom 3

9:00am EDT

The 17th International i* Workshop (iStar’24)
Monday October 28, 2024 9:00am - 3:30pm EDT
iStar24 papers to be presented:

(coming soon!)
Monday October 28, 2024 9:00am - 3:30pm EDT
SEI Training Room 1202 4500 Fifth Avenue, Pittsburgh, PA 15213

9:00am EDT

The First International Workshop on AI Services and Applications (AISA’2024) + The First International Workshop on AI-Driven Modeling and Management of Data (AIMM 2024)
Monday October 28, 2024 9:00am - 5:30pm EDT
AIMM 2024 is integrated in AISA’2024. Papers to be presented:
  • Big Data and Artificial Intelligence: innovation-oriented research to improve decision making    
    Distinguished Speaker: Juan Trujillo
  • Empirical case study of AI Service and Application for people with disabilities
    Jaehwan Lee and Jintaek Jung
  • A Methodological Framework for Designing Human-Centered Artificial Intelligence Services    
    Thang Le Dinh, Tran Duc Le and Jolita Ralyté
  • Beyond One-Fits-All: A Case Study Approach to AI System Design Methods
    Sabine Janzen and Hannah Stein
  • GRASPER: Leveraging Knowledge Graphs for Predictive Supply Chain Analytics
    Sabine Janzen, Hannah Stein and Sebastian Baer
  • An MLOps Framework to Data-Driven Modelling of Digital Twins with an Application to Virtual Test Rigs
    Denis Kruschinski, Dylan Tchawou Ngassam, Umut Durak and Sven Hartmann
  • Empirical Study on the Use of Artificial General Intelligence Healthcare in the Elderly
    Seungho Seo and Jintaek Jung
  • Effects of Perceived Ease of Use and Perceived Usefulness of Technology Acceptance Model on Intention to Continue Using Generative AI: Focusing on the Mediating Effect of Satisfaction and Moderating Effect of Innovation Resistance
    Sa-Rang Jeong, Si-Hoo Kim and Seung-Hee Lee
  • Conceptual Modeling for Public AI Systems
    Seonghwan Ju, Seoltae Ko and Andrew Lim
  • Self-Explanatory Retrieval-Augmented Generation for SDG Evidence Identification
    Dario Garigliotti

Monday October 28, 2024 9:00am - 5:30pm EDT
Jared L. Cohon University Center, Rangos Ballroom 2 5032 Forbes Ave, Pittsburgh, PA 15213

11:00am EDT

Morning Break
Monday October 28, 2024 11:00am - 11:30am EDT
Morning break and coffee refresher will be served in both locations on Monday, October 28.
Monday October 28, 2024 11:00am - 11:30am EDT
Jared L. Cohon University Center, Rangos Ballroom 1 AND SEIber Cafe

1:00pm EDT

Lunch
Monday October 28, 2024 1:00pm - 2:00pm EDT
Lunch will be served in both locations on Monday, October 28.
Monday October 28, 2024 1:00pm - 2:00pm EDT
Jared L. Cohon University Center, Rangos Ballroom 1 AND SEIber Cafe

2:00pm EDT

Spatial Conceptual Modeling: MM-AR Metamodeling Platform and Augmented Reality Workflow Modeling Language
Monday October 28, 2024 2:00pm - 3:30pm EDT
This tutorial introduces the concept of Spatial Conceptual Modeling and MM-AR, a new web-based, 2D/3D metamodeling platform. It covers the prototypical implementation of MM-AR and the Augmented Reality Workflow Modeling Language (ARWFML) together with use cases and a hands-on session.
Monday October 28, 2024 2:00pm - 3:30pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

2:00pm EDT

7th International Workshop on Empirical Methods in Conceptual Modeling (EmpER’24) + 5th International Workshop on Quality and Measurement of Model-Driven Software Development (QUAMES 2024)
Monday October 28, 2024 2:00pm - 5:30pm EDT
EmpER'24 and QUAMES 2024 are merged to EmpER'24 + QUAMES 2024. Papers to be presented:
  • How Does UML Look and Sound? Using Multimodal AI to Interpret UML Diagrams through Empirical Evidence
    Aleksandar Gavric, Dominik Bork and Henderik A. Proper
  • Can Large Language Models Learn Conceptual Modeling by Looking at Slide Decks and Pass Graduate Examinations? An Empirical Study
    Philippe Giabbanelli and Noé Flandre
  • Evaluating a Framework of Conceptual Modelling Research
    Jose Ignacio Panach Navarrete, Oscar Pastor, Stephen W. Liddle, Veda C. Storey, Heinrich C. Mayr and Bernhard Thalheim
  • Extending Goal Models with Execution Orders: An Investigation of the Impact on Comprehensibility
    Jeshwitha Jesus Raja, Akhila Vissom Raju, Jennifer Brings and Marian Daun
  • Towards leveraging gamified code-testing for effective model validation
    Felix Cammaerts and Monique Snoeck

Monday October 28, 2024 2:00pm - 5:30pm EDT
Jared L. Cohon University Center, Rangos Ballroom 3

3:30pm EDT

Afternoon Break
Monday October 28, 2024 3:30pm - 4:00pm EDT
An afternoon break will be served in both locations on Monday, October 28.
Monday October 28, 2024 3:30pm - 4:00pm EDT
Jared L. Cohon University Center, Rangos Ballroom 1 AND SEIber Cafe

4:00pm EDT

A Deep Dive Into Benchmarking Ontology Reasoners: Techniques, Tools, and Insights
Monday October 28, 2024 4:00pm - 5:30pm EDT
Ontology-based reasoners are crucial for knowledge representation and reasoning across various domains, including healthcare and finance, as they facilitate informed decision-making. This tutorial will guide participants through key performance metrics, experimental design strategies, and data considerations required for effective benchmarking, equipping them with the knowledge to evaluate and enhance the capabilities of the reasoners.
Monday October 28, 2024 4:00pm - 5:30pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

5:30pm EDT

Adjourn
Monday October 28, 2024 5:30pm - 5:45pm EDT
Adjourn for the evening
Monday October 28, 2024 5:30pm - 5:45pm EDT
All locations
 
Tuesday, October 29
 

8:00am EDT

Breakfast Pastries and Coffee Served
Tuesday October 29, 2024 8:00am - 9:00am EDT
Tuesday October 29, 2024 8:00am - 9:00am EDT
SEIber Café

8:00am EDT

Registration Check-In Table Opens
Tuesday October 29, 2024 8:00am - 6:30pm EDT
The registration table will be staffed from 8:00 a.m. until sessions end each day. Please stop by to check in and pick up your name badge. If you have any questions, please feel free to stop by and ask our staff. 
Tuesday October 29, 2024 8:00am - 6:30pm EDT
SEI First Floor Lobby 4500 Fifth Avenue

9:00am EDT

Opening
Tuesday October 29, 2024 9:00am - 9:30am EDT
Tuesday October 29, 2024 9:00am - 9:30am EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

9:30am EDT

Keynote: Towards explanatory conceptual models: a systematic approach based on ontological analysis
Tuesday October 29, 2024 9:30am - 11:00am EDT
A widespread requirement for conceptual models is that they need to have some kind of formal semantics in order to be used, and especially in order to be shared. In this talk I will defend that just having a formal semantics is not enough: in order to be shared and integrated, conceptual models need to be explained in terms of their ontological commitments to the world. I will distinguish therefore between ordinary conceptual models, which typically describe a domain in terms of its relevant entities and relationships, and have therefore a merely descriptive role, and explanatory conceptual models, which aim at explaining why those relationships hold. I will propose then a systematic approach to expand an ordinary model at two different levels of explanatory detail, based on the ontological notion of truth-making.
Speakers
avatar for Nicola Guarino

Nicola Guarino

Retired Research Associate, Institute of Cognitive Sciences and Technologies of the Italian National Research Council (ISTC-CNR)
Nicola Guarino is a retired research associate at the Institute of Cognitive Sciences and Technologies of the Italian National Research Council (ISTC-CNR), and former director of the ISTC-CNR Laboratory for Applied Ontology (LOA) based in Trento.He has been playing a leading role... Read More →
Tuesday October 29, 2024 9:30am - 11:00am EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

11:00am EDT

Morning Break
Tuesday October 29, 2024 11:00am - 11:30am EDT
Tuesday October 29, 2024 11:00am - 11:30am EDT
SEIber Café

11:30am EDT

Multi-Faceted Evaluation of Modeling Languages for Augmented Reality Applications - The Case of ARWFML
Tuesday October 29, 2024 11:30am - 12:00pm EDT
The evaluation of modeling languages for augmented reality applications poses particular challenges due to the three-dimensional environment they target. The previously introduced Augmented Reality Workflow Modeling Language (ARWFML) enables the model-based creation of augmented reality scenarios without programming knowledge. Building upon the first design cycle of the language's specification, this paper presents two further design iterations for refining the language based on multi-faceted evaluations. These include a comparative evaluation of implementation options and workflow capabilities, the introduction of a 3D notation, and the development of a new 3D modeling environment. On this basis, a comprehensibility study of the language was conducted. Thereby, we show how modeling languages for augmented reality can be evolved towards a maturity level suitable for empirical evaluations.
Tuesday October 29, 2024 11:30am - 12:00pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

12:00pm EDT

Application of the Tree-of-Thoughts Framework to LLM-Based Domain Modeling
Tuesday October 29, 2024 12:00pm - 12:30pm EDT
Domain modeling is typically an iterative process where the modeling expert interacts with the domain experts to complete and refine the model. Recently, we have seen several attempts trying to assist, or even replace, the modeler with a Large Language Model (LLM). Several prompting strategies have been attempted but with limited success so far.

In this paper, we advocate for the adoption of a Tree-of-Thoughts (ToT) strategy to overcome the limitations of current approaches based on simpler prompting strategies. With ToT, we can decompose the modeling process into several substeps using for each step a specialized set of generators and evaluators prompts to optimize the quality of the LLM output. As part of our adaptation, we provide a Domain-Specific Language to facilitate the formalization of the ToT process for domain modeling. Our approach is implemented as part of an open source tool availabe on GitHub.
Tuesday October 29, 2024 12:00pm - 12:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

12:30pm EDT

ECQL: Towards Succinct and Extensible Modeling of Multi-model Query Results
Tuesday October 29, 2024 12:30pm - 1:00pm EDT
On top of database technologies on diverse data models, the concept of polystore has been studied for unified management of heterogeneous data. However, the design of multi-model query languages, which are the major interface of polystores, is an important yet challenging task. There are still drawbacks in the existing multi-model query languages, including limited expressiveness of query representation and fixed data models.

To address these problems, this paper presents the Equivalently Combinations of Query Languages (ECQL) as a holistic representation of multi-model queries. This paper firstly makes fundamental abstractions of the query results of different data models, and then designs a new multi-model query language that combines the query languages of different data models based on these abstractions. We explain the expressiveness of ECQL that it supports both single-model query functionalities and cross-model queries. The evaluation results between ECQL and existing multi-model query languages on polystore benchmarks show the succinctness of ECQL. We also demonstrate the extensibility of ECQL that it can be integrated with new type of query languages.
Tuesday October 29, 2024 12:30pm - 1:00pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

1:00pm EDT

Lunch
Tuesday October 29, 2024 1:00pm - 2:00pm EDT
Tuesday October 29, 2024 1:00pm - 2:00pm EDT
SEIber Café

2:30pm EDT

An Analysis of the Semantic Foundation of KerML and SysML
Tuesday October 29, 2024 2:30pm - 3:00pm EDT
In the last decades, Model-Based Systems Engineering (MBSE) has received significant attention, leading to standards such as SysML. SysML is due to a recent and radical update, breaking the dependence of its specification from UML, and leading to the development of two languages: KerML, which provides a top layer of general constructs, and SysML v2, which specializes KerML for systems engineering. In this paper, we analyze the formal and real-world semantics of the proposed KerML and SysML v2 specifications, and draw implications for their improvement and further development. Our attention is focused towards key constructs of the languages. We also examine the approach taken in the specifications to deal with dynamic aspects, which is inspired by the 'four-dimensionalist' view.
Tuesday October 29, 2024 2:30pm - 3:00pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

2:30pm EDT

Panel: Conceptual Modeling and Large Language Models
Tuesday October 29, 2024 2:30pm - 4:00pm EDT
Growth of LLMs affects software and information systems engineering practice in a way that many researchers consider disruptive. Is it just "yet another hype"? Or is it really a disruptive technology that will affect how modeling is used in those disciplines? Under this "LLM wave", there are many relevant ongoing discussions on how LLMs can influence systems engineering in general. But in the ER context, we want to emphasize in this discussion the conceptual modeling (CM) dimension, taking CM as the main application domain for LLMs, and the value that LLMs can bring to it.

This panel will discuss how LLMs can influence CM practice. In particular, we want to know CM experts' opinions on concrete CM questions such as (1) how using LLMs can affect the human cognition process, not acting anymore as the mediator between the conceptual model-based documentation and the engineering system, or (2) how a CM-based prompt engineering process can provide a new opportunity to make "conceptual model programming" feasible, or (3) how LLMs can shift the role of conceptual modeling practice in software engineering and the way CMs are involved in engineering processes, or (4) wondering if the CM community can already delegate abstraction and design tasks to LLM-based products.
Tuesday October 29, 2024 2:30pm - 4:00pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

3:00pm EDT

Portions of Matter and their Existential Events: An ontology-based conceptual model
Tuesday October 29, 2024 3:00pm - 3:30pm EDT
In this work, we propose a conceptual model for existential events that affect portions of matter. Portions of matter are entities that constitute most of the physical entities, for instance, water, oil, gold, and blood. Existential events create and destroy entities. The proposed conceptual model is intended to support the design of information systems that capture the dynamic changes of portions of matter in nature and industrial processes. First, we propose a taxonomy for Collections of Objects (the entities that constitute portions of matter) and define the phase transitions that create and terminate the circumstances for
a portion of matter to exist. Second, we define three existential events that affect portions of matter:
(i) Forming Stuff, in which an instance of portion of matter is created as a collection of objects become connected;
(ii) Terminating Stuff, in which an instance of portion of matter is terminated as a collection of granules become scattered;
(iii) Changing Stuff Type, where a quantity of one kind is created and a quantity of another kind is terminated.

Lastly, we present an application example that uses the condensation of a portion of water and its freezing into a portion of
ice to illustrate how to use the conceptual model in concrete cases.
Tuesday October 29, 2024 3:00pm - 3:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

3:30pm EDT

Model-Driven Design and Generation of Training Simulators for Reinforcement Learning
Tuesday October 29, 2024 3:30pm - 4:00pm EDT
Reinforcement learning (RL) is an important class of machine learning techniques, in which intelligent agents optimize their behavior by observing and evaluating the outcomes of their repeated interactions with their environment. A key to successfully engineering such agents is providing them the opportunity to engage in a large number of such interactions safely and at a low cost. This is often achieved through developing simulators of such interactions, in which the agents can be trained while also different training strategies and parameters can be explored. However, specifying and implementing such simulators can be a complex endeavor requiring a systematic process for capturing and analyzing both the goals and actions of the agents and the characteristics of the target environment. We propose a framework for model-driven goal-oriented development of RL simulation environments. The framework utilizes a set of extensions to a standard goal modeling notation that allows concise modeling of a large number of ways by which an intelligent agent can interact with its environment. Though subsequent formalization, the model can be used by a specially constructed simulation engine to simulate agent behavior, such that off-the-shelf RL algorithms can use it as a training environment. We present the extension of the goal modeling language, sketch its semantics, and show how models built with it can become executable.
Tuesday October 29, 2024 3:30pm - 4:00pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

4:00pm EDT

Afternoon Break
Tuesday October 29, 2024 4:00pm - 4:30pm EDT
Tuesday October 29, 2024 4:00pm - 4:30pm EDT
SEIber Café

5:00pm EDT

Generating Secure Workflow Designs from Requirements Goal Models Using Patterns
Tuesday October 29, 2024 5:00pm - 5:30pm EDT
Identifying and analyzing security requirements is an essential part of the information systems engineering lifecycle. Several techniques have been introduced for comprehensively modeling such requirements. Once identified, security requirements must be translated into designs that allow domain actors to securely accomplish business tasks under given risk assumptions and contexts. Correctly translating requirements to such designs, however, can be challenging when considering both the complexity and specialized nature of security mechanisms, such as cryptography, and the role of varying practical and contextual aspects of the problem at hand in correctly applying such mechanisms. We propose a model-driven pattern-based approach for supporting the implementation of security requirements. Security requirements models, augmented with descriptions of contextual and threat assumptions, are combined with reusable domain-agnostic workflow patterns which model established ways for securely performing common business tasks. The combined models are compiled into a formal specification, whereby automated reasoning is applied for generating domain-appropriate workflows that satisfy the security requirements. Using the technique, analysts can efficiently explore the impact of different threat assumptions and pragmatic constraints to candidate security designs, while ensuring that the latter are consistent with tried-and-tested community knowledge.
Tuesday October 29, 2024 5:00pm - 5:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

5:30pm EDT

Modeling and Reasoning about Explanation Requirements using Goal Models
Tuesday October 29, 2024 5:30pm - 6:00pm EDT
Tuesday October 29, 2024 5:30pm - 6:00pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

6:00pm EDT

Enhancing Domain Modeling with Pre-trained Large Language Models: An Automated Assistant for Domain Modelers
Tuesday October 29, 2024 6:00pm - 6:30pm EDT
Domain modeling involves creating abstract representations of information within a specific domain using techniques such as conceptual modeling and ontology engineering. Traditionally, manual creation and maintenance of domain models are labor intensive and require modeling expertise. This paper explores the automation of domain modeling using pre-trained large language models (LLMs), presenting an experimental LLM-based conceptual modeling assistant that collaborates with a human expert. The assistant provides modeling suggestions based on a given textual description of the domain of interest, aiding in the design of classes, attributes, and associations. We present a generic framework for domain modeling assistants that consists of class, attribute, and association generators, and show how they can be implemented using an LLM. We demonstrate a concrete configuration of this framework and its implementation in a prototype application. We evaluated the effectiveness of the framework configuration across various domains. Our findings indicate that the assistant significantly enhances the efficiency of modeling while maintaining reasonable quality of the outputs.
Tuesday October 29, 2024 6:00pm - 6:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

6:30pm EDT

Welcome Reception, SEI Building
Tuesday October 29, 2024 6:30pm - 8:00pm EDT
Join us for the ER 2024 welcome reception to start the week off right! Renew acquaintences and make new friends while also getting some delicious refreshments.
Tuesday October 29, 2024 6:30pm - 8:00pm EDT
SEIber Café
 
Wednesday, October 30
 

8:00am EDT

Breakfast Pastries and Coffee Served
Wednesday October 30, 2024 8:00am - 9:00am EDT
Wednesday October 30, 2024 8:00am - 9:00am EDT
SEIber Café

8:00am EDT

Registration Check-In Table Opens
Wednesday October 30, 2024 8:00am - 5:30pm EDT
The registration table will be staffed from 8:00 a.m. until sessions end each day. Please stop by to check in and pick up your name badge. If you have any questions, please feel free to stop by and ask our staff. 
Wednesday October 30, 2024 8:00am - 5:30pm EDT
SEI First Floor Lobby 4500 Fifth Avenue

9:00am EDT

Keynote: Wood Wide Models
Wednesday October 30, 2024 9:00am - 10:30am EDT
Foundation models are monolithic models that are trained on a broad set of data, and which are then in principle fine-tuned to various specific tasks. But they are ill-suited to many heterogeneous settings, for instance numeric tabular data, or numeric time-series data, where training a single monolithic model over a large collection of such datasets is not meaningful. For instance, why should numeric times series of stock prices have anything to do with time series comprising the vital signs of an ICU patient? For such settings, we propose the class of wood wide models.

The wood wide web is often used to describe an underground network of fungal threads that connect many trees and plants together, which stands in contrast to a large concrete foundation on top of which we might build specialized buildings. Analogously, in contrast to a single foundation model upon which one might build specialized models, we have many smaller wood wide models that all borrow subtler ingredients from each other. But to be able to share nutrients from the wood wide web, trees need a special root based architecture that can connect to these fungal threads. Accordingly, to operationalize wood wide models, we develop a novel neuro-symbolic architecture, that we term "neuro-causal", that uses a synthesis of deep neural models and causal graphical models to automatically infer higher level symbolic information from lower level "raw features", while also allowing for rich relationships among the symbolic variables. Neuro-causal models retain the flexibility of modern deep neural network architectures while simultaneously capturing statistical semantics such as identifiability and causality, which are important to discuss ideal, target representations and their tradeoffs. But most interestingly, these can further form a web of wood wide models when they borrow in part from a shared conceptual ontology, as well as causal mechanisms. We provide conditions under which this entire architecture can be recovered uniquely. We also discuss efficient algorithms and provide experiments illustrating the algorithms in practice.
Speakers
avatar for Pradeep Ravikumar

Pradeep Ravikumar

Professor, School of Computer Science, Carnegie Mellon University
Dr. Pradeep Ravikumar is a Professor in the Machine Learning Department, School of Computer Science at Carnegie Mellon University.He is a Sloan Fellow, a Siebel Scholar, a recipient of the NSF CAREER Award, and a co-editor-chief of the Journal of Machine Learning Research.Ravikumar... Read More →
Wednesday October 30, 2024 9:00am - 10:30am EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

10:30am EDT

Morning Break
Wednesday October 30, 2024 10:30am - 11:00am EDT
Wednesday October 30, 2024 10:30am - 11:00am EDT
SEIber Café

11:00am EDT

How Are LLMs Used for Conceptual Modeling? An Exploratory Study on Interaction Behavior and User Perception
Wednesday October 30, 2024 11:00am - 11:30am EDT
Large Language Models (LLMs) have opened new opportunities in modeling in general, and conceptual modeling in particular. With their advanced reasoning capabilities, accessible through natural language interfaces, LLMs enable humans to deepen their understanding of different application domains and enhance their modeling skills. However, the open-ended nature of these interfaces results in diverse interaction behaviors, which may also affect the perceived usefulness of LLM-assisted conceptual modeling. Existing works focus on various quality metrics of LLM outcomes, yet limited attention is given to how users interact with LLMs for such modeling tasks. To address this gap, we present the design and findings of an empirical study conducted with information systems students. After labeling the interactions according to their intentions (e.g., Create Model, Discuss, or Present), and representing them as an event log, we applied process mining techniques to discover process models. These models vividly capture the interaction behaviors and reveal recurrent patterns. We explored the differences in interacting with two LLMs (GPT 4.0 and Code Llama) for two modeling tasks (use case and domain modeling) across three application domains. Additionally, we analyzed user perceptions regarding the usefulness and ease of use of LLM-assisted conceptual modeling.
Wednesday October 30, 2024 11:00am - 11:30am EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

11:00am EDT

Extending the SensorThings API Data Model – Improving Interoperability and Use Case Flexibility in IoT
Wednesday October 30, 2024 11:00am - 11:30am EDT
This study presents an enhancement to the OGC SensorThings API Data Model tailored for Internet of Things (IoT) environments, demonstrated with a Smart Farming application enhancement. The designed data model addresses critical challenges faced in real-world settings like industrial environments, adhering to the FAIR principles of Findability, Accessibility, and, in particular, Interoperability and Reusability.

Beyond the practical use case of crop monitoring, we offer a conceptual framework for future projects across various domains. The resulting architecture demonstrates how modular components improve adaptability and extendibility through standardization and interoperability. This modular approach decouples modules such as device management from data storage, ensuring consistent data handling and supporting the integration and maintenance of diverse and evolving applications.

The iterative development and evaluation process underlines the solution's effectiveness in managing IoT environments in practice. The findings highlight the potential for these extensible modules to be applied in other contexts, promoting a standardized yet flexible approach to IoT data management, supporting effective database design, and deriving best practices and design guidance for future projects. Additionally, our models approach IoT heterogeneity and interoperability, demonstrating clear advantages in modularity and standardized data handling, essential for managing complex, real-world IoT deployments in Smart Farming.
Wednesday October 30, 2024 11:00am - 11:30am EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

11:30am EDT

Small, Medium, and Large Language Models for Text-to-SQL
Wednesday October 30, 2024 11:30am - 12:00pm EDT
This paper investigates how the model size affects the ability of a Generative AI Language Model, or briefly a GLM, to support the text-to-SQL task for databases with large schemas typical of real-world applications. The paper first introduces a text-to-SQL framework that combines Prompt engineering and a Retrieval-Augmented generation (RAG) technique, leaving as flexibilization points the GLM and the database. Then, it describes a benchmark based on an open-source database with a large schema, with a complexity similar to real-world databases. The paper proceeds with experiments to assess the performance of the text-to-SQL framework instantiated with the benchmark database and GLMs of different sizes. The paper concludes with recommendations to help select which GLM size is appropriate for a text-to-SQL scenario, characterized by the difficulty of the expected NL questions and the data privacy requirements, among other characteristics.
Wednesday October 30, 2024 11:30am - 12:00pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

11:30am EDT

A conceptual approach to using relevant patterns in genomic data analysis
Wednesday October 30, 2024 11:30am - 12:00pm EDT
Several models have been proposed to represent human genomic information. An interesting approach for supporting genomic applications for health consists of a two-layer representation, where high-level concepts, describing distinct aspects of the human genome at an abstract level, are mapped to data, which represent actual physical measurements of reality. The two-layer approach allows users to formulate high-level queries on the concepts and map them onto real datasets. The approach is extensible: new conceptual views, each corresponding to a given genomic feature, can be mapped to the lower data layers without impacting on previous mappings.
We here present how concept layers and data layers can be composed into patterns corresponding to classic genomic studies: diseases with case-control comparisons, multi-omic representations for the same patients, and comparisons within families for rare genetic diseases. We show that these patterns effectively support genomic data users (i.e., clinicians, geneticists, and bioinformaticians) in genomic analysis practices.
Wednesday October 30, 2024 11:30am - 12:00pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

12:00pm EDT

Establishing Traceability between Natural Language Requirements and Software Artifacts by Combining RAG and LLMs
Wednesday October 30, 2024 12:00pm - 12:30pm EDT
Software Engineering research aims to enable the effective transformation of stakeholders' requirements involved in the software development life cycle into code such that purpose of the code is to fulfil the requirements. Crucial software maintenance and evolution tasks such as change impact and software reusability require an explicit linkage between the requirements and their code manifestation. However, manually creating such links for large codebases is a prohibitive task. Furthermore, the requirements are given in natural language (NL) and therefore contain natural language semantics that capture the purpose of the corresponding codebase. Existing traceability approaches rely on utilizing textual similarities between requirements and code to establish links, however, such approaches tend to achieve low precision due to their inability to bridge the semantic gap between high-level natural language requirements and syntactic code. With the advent of Large Language Models (LLMs), their sophisticated capabilities to comprehend both natural language and code syntax offer a transformative potential for bridging this semantic gap. LLMs' ability to understand the underlying intent of NL requirements and their corresponding code snippets can significantly enhance the precision of traceability links. By representing code as a knowledge graph, this framework can also leverage graph structural information to further refine and strengthen traceability. Therefore, in this paper, we present an LLM-supported retrieval augmented generation based framework for traceability that supports vector, graph, and hybrid (combined) indexing. We provide a comparative evaluation with an existing approach and between different indexing techniques. Finally, we discuss the results of our evaluation, showcasing how our method improves the efficiency and accuracy of establishing traceability links in software development.
Wednesday October 30, 2024 12:00pm - 12:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

12:00pm EDT

SISO: A Conceptual Model-based Method for Variant Interpretation Systematization
Wednesday October 30, 2024 12:00pm - 12:30pm EDT
Variant interpretation is the process by which clinical experts determine if a DNA variant has a significant impact on a patient's health. Current practices in variant interpretation suffer from a lack of traceability and reproducibility due to the chaotic nature of genomic data and the imprecision of existing variant interpretation guidelines. These issues pose substantial barriers to the routine clinical application of variant interpretation. This paper introduces SISO, a conceptual model-based method designed to translate the inherent imprecision of variant interpretation into a concise and well-defined set of steps. The practical utility of the SISO method is demonstrated through a use case involving a variant identified in a patient with suspected familial breast-ovarian cancer syndrome. The SISO method lays the foundations for variant interpretation systematization by guiding decision-making and ensuring reproducibility. As a result, variant interpretation will be a more reliable and consistent process in clinical practice.
Wednesday October 30, 2024 12:00pm - 12:30pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

12:30pm EDT

Lunch
Wednesday October 30, 2024 12:30pm - 2:00pm EDT
Wednesday October 30, 2024 12:30pm - 2:00pm EDT
SEIber Café

2:00pm EDT

Keynote Panel Theme: Conceptual Modeling, AI & Beyond
Wednesday October 30, 2024 2:00pm - 3:30pm EDT
Keynotes
avatar for Nicola Guarino

Nicola Guarino

Retired Research Associate, Institute of Cognitive Sciences and Technologies of the Italian National Research Council (ISTC-CNR)
Nicola Guarino is a retired research associate at the Institute of Cognitive Sciences and Technologies of the Italian National Research Council (ISTC-CNR), and former director of the ISTC-CNR Laboratory for Applied Ontology (LOA) based in Trento.He has been playing a leading role... Read More →
avatar for Paul Nielsen

Paul Nielsen

Director and CEO, Carnegie Mellon University Software Engineering Institute
Dr. Paul D. Nielsen is the Director and Chief Executive Officer of the Carnegie Mellon University Software Engineering Institute (CMU SEI), a U.S. Department of Defense federally funded research and development center (FFRDC). The SEI is a global leader in advancing software engineering... Read More →
avatar for Pradeep Ravikumar

Pradeep Ravikumar

Professor, School of Computer Science, Carnegie Mellon University
Dr. Pradeep Ravikumar is a Professor in the Machine Learning Department, School of Computer Science at Carnegie Mellon University.He is a Sloan Fellow, a Siebel Scholar, a recipient of the NSF CAREER Award, and a co-editor-chief of the Journal of Machine Learning Research.Ravikumar... Read More →
Wednesday October 30, 2024 2:00pm - 3:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

3:30pm EDT

Afternoon Break
Wednesday October 30, 2024 3:30pm - 4:00pm EDT
Wednesday October 30, 2024 3:30pm - 4:00pm EDT
SEIber Café

4:00pm EDT

Conceptual Framework for Designing Hippocratic APIs
Wednesday October 30, 2024 4:00pm - 4:30pm EDT
The rapid proliferation of Application Programming Interface (API) enhances data exchange but introduces significant privacy and security risks. This issue is particularly acute in the Internet of Things (IoT), where APIs interact with numerous devices, often lacking adequate mechanisms for managing privacy and security on the API and client’s side, leading to increased vulnerabilities. Hippocratic databases provide mechanisms, e.g., role-based access, to control access to the databases. However, to effectively manage data access to the Hippocratic database, proper design of API is needed. This paper proposes a conceptual framework for Hippocratic API (HAPI) inspired by the Hippocratic Oath and GDPR. Our framework revises traditional API designs using AI to integrate ethical considerations, aiming to protect data subjects’ rights and enhance security. By embedding data protection and ethical standards into API operations, HAPIs rectify inadequacies in consent mechanisms and mitigate privacy risks. We leverage an ontology-based approach to identify and categorize non-functional requirements (NFRs) and employ design techniques found through extensive research of literature, books, and tutorials. This methodology, informed by GDPR, ISO/IEC 27001, and Hippocratic databases, employs the Unified Foundational Ontology (UFO) framework to model goals, intentions, and qualities, ensuring ethical principles are intrinsic to the system’s design. We hope HAPIs will inspire further research and encourage both industry and academic communities to refine and expand upon our design, fostering a future where technological development is inherently aligned with the ethical management of user data.
Wednesday October 30, 2024 4:00pm - 4:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

4:00pm EDT

ER Poster Sessions
Wednesday October 30, 2024 4:00pm - 5:30pm EDT
Authors of accepted posters will have five minutes to present their poster and approximately five minutes for questions. The accepted posters are as follows (schedule to be determined):

An Architecture for Integrating Large Language Models into Metamodeling Platforms: The Example of MM-AR
Gunakar Challa, Aya Gartini, Fabian Muff and Hans-Georg Fill

Smart Model-Driven Engineering to Improve the Music Valuation
Giovanni Giachetti, Daniel Catalá-Pérez, Blanca De-Miguel-Molina, Conrado Carrascosa, María de Miguel, Jesús Carreño and Oscar Pastor

Supporting Safety Assessment in Human-Robot Collaboration using Process Models
Philipp Kranz, Shaza Elbishbishy, Jeshwitha Jesus Raja and Marian Daun

An Ontological Approach to Breast Cancer Screening: Risk Assessment and Personalized Testing Recommendations
Bruno Szilagyi, Edelweis Rohrer, Yasmine Anchén and Regina Motz

Utilize the PROVE Tool to Evaluate Ontological Modeling
Asiyah Lin and Avi Shaked

OWL2Gen: Towards a Configurable Ontology Generator for Benchmarking
Gunjan Singh, Ashwat Kumar, Sumit Bhatia and Raghava Mutharaju






Wednesday October 30, 2024 4:00pm - 5:30pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

4:30pm EDT

SAQI: An Ontology based Knowledge Graph Platform for Social Air Quality Index
Wednesday October 30, 2024 4:30pm - 5:00pm EDT
Air Quality Index (AQI) is a number aggregated from several air quality sensors deployed in an area. AQI is useful in communicating the air quality to the general public and in making governance decisions to tackle air pollution. However, our ethnographic surveys revealed the existence of a knowledge barrier in interpreting the AQI and data illiteracy in understanding AQI-related charts and trends commonly facilitated by government organizations. This knowledge gap is wider for the marginalized sections of society, who, it turns out, are more exposed to pollution. We use an ontological approach to homogenize the air quality data with social and spatial aspects. The Social Air Quality Index (SAQI) ontology integrates the data from local and central air quality monitoring sensors, meteorological data, and field surveys. This data is converted into a Knowledge Graph, which is used to build an application for civic engagement with the public on pollution to improve community participation of the local institutions and individuals. We evaluated this application through a user survey and received positive feedback. The ontologies, code, and datasets are available under the Apache 2.0 License at https://github.com/kracr/aq-structured-platform.
Wednesday October 30, 2024 4:30pm - 5:00pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

5:00pm EDT

GenACT: An Ontology based Temporal Web Data Generator
Wednesday October 30, 2024 5:00pm - 5:30pm EDT
Time is a fundamental concept in data processing. The growth of social media (SM) data and the rise of the Internet of Things (IoT) underscores the necessity for studying temporal data on the Web. However, accessing realistic temporal data poses significant challenges across data collection, knowledge representation, and real-time provisioning, with no comprehensive solution available yet. To tackle these challenges, we introduce GenACT, a novel data generator rooted in the dynamics of Academic Conference Tweets (ACT), which serves as an ideal domain for eliciting application scenarios spanning temporality, dynamicity, and timeliness. The foundation of GenACT is a domain-specific ontology crafted to conceptualize tweets around an Academic Conference Event (ACE) realistically. The ACE ontology is available in all four OWL 2 profiles. Additionally, RDF instantiation allows for real-time simulation of ongoing academic discussions on Twitter. GenACT stands out for its ability to configure different data segments using SPARQL-based partitioning strategies. This versatility makes it adaptable to various analytical tasks, enabling researchers to focus on specific aspects of the data for their studies. GenACT is designed to seamlessly provide temporal and static data in a streaming format, tailored specifically for applications in studying knowledge graph evolution, temporal reasoning, and stream reasoning. The ontology, code, and documentation are available under the Apache 2.0 License at \url{https://anonymous.4open.science/r/genACT/}.
Wednesday October 30, 2024 5:00pm - 5:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

5:30pm EDT

Bus Boarding and Departure for Conference Dinner Cruise
Wednesday October 30, 2024 5:30pm - 5:45pm EDT
Board buses at the SEI to go to the Conference Dinner on the Gateway Clipper River Cruise. Boarding takes place from 5:30-5:45 p.m. at the SEI Lobby gathering point. Watch for signage.
Wednesday October 30, 2024 5:30pm - 5:45pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

6:15pm EDT

Conference Dinner Cruise on the Gateway Clipper River Boat
Wednesday October 30, 2024 6:15pm - 10:45pm EDT
Join us for a fun cruise up and down Pittsburgh's three rivers while networking with your fellow registrants. The cruise will start at 6:15 p.m. when the buses arrive from the SEI and end at approximately 9:30 p.m. Then we'll re-board the buses to travel back to the SEI at the end of the evening.
Wednesday October 30, 2024 6:15pm - 10:45pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213
 
Thursday, October 31
 

8:00am EDT

Breakfast Pastries and Coffee Served
Thursday October 31, 2024 8:00am - 9:00am EDT
Thursday October 31, 2024 8:00am - 9:00am EDT
SEIber Café

8:00am EDT

Registration Check-In Table Opens
Thursday October 31, 2024 8:00am - 3:00pm EDT
The registration table will be staffed from 8:00 a.m. until sessions end each day. Please stop by to check in and pick up your name badge. If you have any questions, please feel free to stop by and ask our staff. 
Thursday October 31, 2024 8:00am - 3:00pm EDT
SEI First Floor Lobby 4500 Fifth Avenue

9:00am EDT

Keynote: New Technology, Same Basics
Thursday October 31, 2024 9:00am - 10:30am EDT
Who hasn't dabbled with popular Artificial Intelligence (AI) Large Language Models such as Gemini or ChatGPT, either as part of a project or just to see what it might produce? The popularity and idea that AI can transform human effort is everywhere in our scientific and industrial communities. AI clearly is a growing business driver.

And AI has value for conceptional modelling. For instance, natural language processing can help in identifying the relational data of entities and generative AI and inverse relationship approaches can help create deeply insightful entity relationship (ER) diagrams and models for the systems. To get the usable, reliable AI systems people want, designers and developers need to learn how to repeatably build sound AI systems. With those systems, we will see improvements in areas such as modeling (e.g., digital twins for smart cyber-physical systems), security engineering to defend large-scale systems, and the use of modern software practices (e.g., in developing threat models for DevSecOps).

Building sound AI systems repeatably, we need to pursue these system qualities: robust, secure, scalable, and human centered. Those quality attributes form the basis for forming an AI Engineering discipline. There is much work to do
to create and implement the AI Engineering discipline in areas such as making a commitment to moving R&D in practice to get ahead of the galloping use of AI tools, recognition that we cannot abandon good practices such as practicing cyber hygiene and the use of modern software engineering practices, and the creation of a new concept of the digital-engineering-aware workforce able to interact effectively with AI models.
Speakers
avatar for Paul Nielsen

Paul Nielsen

Director and CEO, Carnegie Mellon University Software Engineering Institute
Dr. Paul D. Nielsen is the Director and Chief Executive Officer of the Carnegie Mellon University Software Engineering Institute (CMU SEI), a U.S. Department of Defense federally funded research and development center (FFRDC). The SEI is a global leader in advancing software engineering... Read More →
Thursday October 31, 2024 9:00am - 10:30am EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

10:30am EDT

Morning Break
Thursday October 31, 2024 10:30am - 11:00am EDT
Thursday October 31, 2024 10:30am - 11:00am EDT
SEIber Café

11:00am EDT

Enhancing BERT Performance: Multi-Teacher Adversarial Distillation with Clean and Robust Guidance
Thursday October 31, 2024 11:00am - 11:30am EDT
The BERT model has slow inference speed due to the massive parameters, and it is susceptible to adversarial attacks due to lack of robustness. Knowledge distillation tackles the first issue by compressing the model, while adversarial training resolves the second issue by incorporating disturbances into training. However, existing knowledge distillation methods often struggle to balance between model accuracy and robustness. To address this, we propose a multi-teacher adversarial distillation approach for the BERT model. Our approach employs a clean teacher, providing soft labels to guide the student model in learning the clean samples, and a robust teacher, which employs adversarial training and semi-supervised learning to guide the student model in learning adversarial samples. Additionally, we introduce adaptive loss weights, which enable the student model to focus on the more challenging knowledge during training process. Through experiments with various adversarial attack methods, we validate the effectiveness of our proposed approach. On the typical text classification dataset IMDB, we improve the robust accuracy of the student DistilBERT model from 19.78 to 35.17, while maintaining a high clean accuracy of 93.17.
Thursday October 31, 2024 11:00am - 11:30am EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

11:00am EDT

Towards a Conceptual Model for FAIR Metadata Schemas
Thursday October 31, 2024 11:00am - 11:30am EDT
This paper explores the design and creation of metadata schemas based on the FAIR Data Principles. We provide a clear interpretation of these principles, focusing on how they apply to metadata schemas. Leveraging the OntoUML language, we developed a conceptual model that explains the key components of a FAIR-compliant metadata schema. Through detailed discussion and provision of examples for each model component, this work aims to help metadata designers and curators better understand how to incorporate the FAIR Data principles into their schemas.
Thursday October 31, 2024 11:00am - 11:30am EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

11:30am EDT

On the Task-specific Effects of Fragmentation in Modular Process Models
Thursday October 31, 2024 11:30am - 12:00pm EDT
Modularization has been extensively investigated for its role in enhancing the comprehension of process models by dividing them into smaller, self-contained and manageable modules. However, existing studies have reported inconclusive results, prompting further exploration into when modularization supports or impedes process comprehension. Among the key factors suggested to influence the effect of modularization is the type of the task at hand. Indeed, the fragmentation of information across several modules caused by modularization can challenge readers' comprehension of process models, especially if the task is not confined to a single module. While the effect of fragmentation has been explored for flow-based tasks, requiring understanding procedural aspects of process models, our work extends to circumstantial tasks, which instead demand comprehending model rules and constraints that rely on the broader context associated with specific process parts. Using eye-tracking, we investigate how the fragmentation of information caused by modularization impacts cognitive integration (i.e., the collection and synthesis of dispersed information) and its subsequent effect on cognitive load and model comprehension. Our findings demonstrate that fragmentation significantly affects cognitive integration in flow-based tasks only. Moreover, our results show that cognitive integration is linked to both cognitive load and readers' comprehension of process models. The outcome of this work provides a detailed model, explaining the task-specific effects of fragmentation. Practically, our findings highlight the need for modeling tools that dynamically adjust their user interface} to reduce cognitive integration demands, thereby enhancing model comprehension.
Thursday October 31, 2024 11:30am - 12:00pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

11:30am EDT

A Model-driven Approach to Enhance Data Visualization through Domain Knowledge Integration
Thursday October 31, 2024 11:30am - 12:00pm EDT
Big Data is challenging analytical contexts, namely when aligning data and analytical requirements. While the capacity to collect and store new data is expanding rapidly, the pace at which it can be analyzed is developing more slowly. Defining these analytical requirements and selecting the most appropriate visualizations often depends on an in-depth understanding of what users need from the data. To address this problem, this paper proposes an assisted model-driven analytics approach to support visualization, taking domain knowledge and data as input. It allows the user to be guided in the mapping between domain concepts and available data, as well as in the translation of domain questions into analytical tasks that can be supported by useful visualizations for decision support. The approach is supported by a Meta-Model that formalizes concepts needed to answer three fundamental questions, what, why and how. This Meta-Model contextualizes the data, the analytical tasks and the supporting visualizations. The applicability of the proposal is shown through a demonstration case focused on the genome domain. The results highlight how useful visualizations are derived from the specified domain questions.
Thursday October 31, 2024 11:30am - 12:00pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

12:00pm EDT

A Universal Prompting Strategy for Extracting Process Model Information from Natural Language Text using Large Language Models
Thursday October 31, 2024 12:00pm - 12:30pm EDT
Over the past decade, extensive research efforts have been dedicated to the extraction of information from textual process descriptions. Despite the remarkable progress witnessed in natural language processing (NLP), information extraction within the Business Process Management domain remains predominantly reliant on rule-based systems and machine learning methodologies. A key reason for this is the data scarcity in the domain, which has so far prevented the successful application of deep learning techniques. However, rapid advancements in generative large language models (LLMs) allow solving many NLP tasks with very high quality, with only a few examples (few-shot), or even just a description of the problem (zero-shot).Therefore, we systematically investigate the potential of LLMs to solve the task of information extraction from textual process descriptions, targeting the detection of process elements, such as activities and actors, and relations between them. Based on a novel prompting strategy, we show that LLMs are able to outperformstate-of-the-art machine learning approaches with absolute performance improvements of up to 8\% $F_1$ score across three different datasets.We evaluate our novel prompting strategy on eight different LLMs, showingit is universally applicable, while also analyzing the impact of certain promptparts on extraction quality. The number of example texts, the specificity of definitions, and the rigour of format instructions are identified as key for improving the accuracy of extracted information. Our code, prompts, and data is available at https://anonymous.4open.science/r/llm-process-generation-2140/README.md and our model generation proof-of-concept at https://anonymous.4open.science/r/pet-to-bpmn-poc-B465/README.md.
Thursday October 31, 2024 12:00pm - 12:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

12:00pm EDT

Integrating Business and Process Analytics for Enhanced Data-Driven Decision-Making
Thursday October 31, 2024 12:00pm - 12:30pm EDT
By integrating Business Analytics and Process Analytics, organizations can gain a deeper understanding of the relationship between process inefficiencies and business outcomes, leading to improved data-driven decision-making. This integration, however, is often overlooked, with limited methodological guidance for systematically combining these two analytical domains. Motivated by this challenge, this paper proposes a methodological approach for the identification of analytical visualizations that allow the aforementioned integration to be achieved systematically. The proposed approach is tested in a running example, which includes an instantiation of a Data Warehouse system for supporting data integration and analysis for business process analytics.
Thursday October 31, 2024 12:00pm - 12:30pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

12:30pm EDT

Agent System Event Data: Concepts, Dimensions, Applications
Thursday October 31, 2024 12:30pm - 1:00pm EDT
Event data is a collection of recorded events that capture performed actions and observed states of business processes supported by information systems. It describes the times of event occurrences, event types, event attributes, and process cases of events identified by one or more objects the events relate to. Process mining uses event data to analyze and improve the processes in organizations. These processes are often performed by actors or agents, such as employees, resources, and systems, in different roles within organizations. In this paper, we present Agent System Event Data (ASED), a new type of event data that describes business processes as interactions of agents. ASED provides a new scope for analyzing individual agents involved in multiple processes, interactions of agents, and systems of agents that enact the processes. We formalize ASED as a conceptual data model, discuss its dimensional data modeling aspects, and argue that event data, in general, benefits from dimensional representation. We review existing event data types and discuss the complementary nature of existing models and ASED. Finally, we validate ASED by demonstrating its ability to express existing business process compliance rules, significantly expanding the scope of compliance analysis addressed by existing data models.
Thursday October 31, 2024 12:30pm - 1:00pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

12:30pm EDT

The Ontology of Fields as a Foundation for Conceptual Modeling
Thursday October 31, 2024 12:30pm - 1:00pm EDT
As a discipline matures, its research communities often need to reflect on the epistemological underpinnings of their research and discuss how the discipline can, and should, advance. This type of reflection is often found in the natural sciences, where scientists question the nature of their fields, the standards of rigor applied, as well as the research impact. As Systems Analysis and Design (SAND) researchers, we should continuously reflect on and discuss how to conduct SAND research to stay relevant and in synch with ongoing advances in information technologies.
Among more recent changes in the advances in information technologies is the rise of large systems with ill-defined boundaries. Platforms such as Google and Facebook exhibit fuzzy, ill-defined boundaries because they permeate multiple aspects of digital and real-world interactions, making it challenging to delineate where their influence starts and ends. These platforms extend beyond their primary functions—search and social networking—into areas like advertising, data analytics, e-commerce, and even artificial intelligence, creating a vast interconnected ecosystem. Their services often integrate with third-party applications and websites, further blurring the lines of their operational scope. Additionally, their user base spans global demographics, intertwining with various cultural, social, and economic spheres, complicating the identification of distinct boundaries. Accurately modeling these systems and shaping the progression of their designs and their impacts is becoming increasingly complex and difficult.
Traditionally, SAND subscribes to the ontology of individual substances, wherein the reality is made of stand-alone substances, which can be referred to as things, objects, individuals, or entities (Benovsky, 2008; Bunge, 1977; Harman, 2018). These substances commonly have properties or attributes and undergo change, resulting in events and processes. This view is epitomized by Bunge’s ontology that stipulates that the world is “made up of things” that are substantial individuals (Bunge, 1977).
To better understand and design systems with fuzzy boundaries, such as Facebook or Google, we suggest exploring alternative ontological approaches. The traditional ontology of individual substances, while valuable in many contexts, may face limitations when applied to these complex, interconnected systems. Interestingly, similar challenges have been observed in philosophy and natural sciences. Following advances in physics, the claim emerged that “there are no particles, there are only fields” (Hobson, 2013, p. 211; emphasis added). This conclusion was drawn as a result of the accumulation of evidence from diverse scientific disciplines. Unfortunately, unlike many famous ontologies of substance, no established ontology of fields has been developed, with this being a stated goal in modern philosophy (Peuquet et al., 1998). We synthesize the ideas proposed as part of the debate on the ontology of fields and propose foundational concepts of ontology of fields as foundations for SAND scholarship.
The concept of a field is pivotal in both modern science and philosophy, serving as a useful model to understand reality. By adopting the scientific notion of a field, we can broadly conceptualize various aspects of existence. This approach helps to relate the concept of the field to other essential ideas in SAND, such as objects and processes. In this context, a field is defined as any physical or conceptual entity that exhibits different values across space and time, resulting from the oscillations that constitute and sustain these fields.
Fields can be understood by examining the properties at individual points within them. For example, a specific point in the sky modeled as a field may have properties like air temperature, humidity, and chemical composition. Many fields possess multiple properties, which can be represented in a hyperplane— a conceptual space with numerous dimensions corresponding to different property types. The variations in these properties manifest as modeling patterns such as peaks, plateaus, and valleys, where peaks often indicate regions with extreme concentrations of mass, energy, or charge.
The boundaries of fields are typically ambiguous and difficult to measure, often leading to challenges in defining where a field begins or ends. These fuzzy boundaries are a common issue in identifying the limits of natural features like mountains, neighborhoods, or even countries. Determining these boundaries often relies on conventions, which can lead to debates, such as whether Mount Elbrus belongs to Europe or Asia. Fuzzy boundaries can offer explanations for complex phenomena, like the wave-particle duality of light, where particles are viewed as quantized peaks within a field. We have these same types of fuzzy boundaries in our SAND efforts.
By incorporating ideas from physics and geography, ontology of fields offers fresh perspectives on understanding the diverse range of information systems today. The ontology of fields provides a more flexible approach to SAND than was previously possible. This new approach allows for the description of information systems that can be built for a single individual (object) compared to those that are built for many people in (a field). Field-like aspects of information systems may have fuzzy boundaries. Previously, more traditional frameworks (see Siau et al., 2022) are less suitable for the current nature of information systems that are present in today’s society (e.g., intelligent transportation systems).
While the ontology of fields will supplement the existing frameworks, approaches, and ontology of SAND, it does require new ways of thinking about and designing systems. It provides new vocabulary and conceptual tools for understanding and designing modern information systems. Current and future technologies will benefit from the ontology of fields to provide a comprehensive understanding of their composition.
Applying field theory to information systems design encourages a holistic view, considering the entire ecosystem rather than primarily isolated components. Describing systems as fields with characteristics such as peaks and valleys (of mountains) helps in understanding and managing their nature, providing a new lens for various types of engagement.
Many modern applications are inherently field-like, with fuzzy boundaries, multiple stakeholders, and public-private partnerships that blur traditional organizational lines. Viewing these systems through the lens of a field ontology acknowledges their interconnected nature. The ontology of fields is intended to additionally address the need for flexible representations.
This ER Forum submission presents a research endeavor (i.e., using an ontology of fields for SAND). Although still in its early stages, it could potentially develop into a major stream for SAND and conceptual modeling research and perhaps serve as a foundation for impactful research. Future research will expand and apply the ontology of fields to demonstrate its feasibility and effectiveness.
Thursday October 31, 2024 12:30pm - 1:00pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

1:00pm EDT

Lunch
Thursday October 31, 2024 1:00pm - 2:00pm EDT
Thursday October 31, 2024 1:00pm - 2:00pm EDT
SEIber Café

2:00pm EDT

The Role-Artifact-Function Framework for Understanding Digital Identity Models
Thursday October 31, 2024 2:00pm - 2:30pm EDT
Identity and Access Management (IAM) is a crucial aspect of online interactions, and researchers and practitioners have been exploring this area since the early days of the Internet. Over the years, many digital identity protocols and standards were created. While OAuth 2.0 and OpenID Connect (OIDC) are currently the most widely used and mature protocols, the emerging concept of Self-Sovereign Identity (SSI) and its underlying protocols promise greater user privacy and control. Although much emphasis has been placed on new protocols and implementations, the ontological and formal understanding of digital identity models still needs to be addressed. In this paper, we make three contributions: (i) introduce the Role-Artifact-Function (RAF) framework, which comprises a meta-metamodel and a metamodel; (ii) describe and compare digital identity models through the instantiation of the RAF metamodel; and (iii) discuss and arrive at intriguing conclusions, such as the conceptual equivalence between the well-established family of x509-based protocols and the nascent technologies behind SSI.
Thursday October 31, 2024 2:00pm - 2:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

2:00pm EDT

Managing Trade-offs in the Nested Iterative Cycles of Responsible AI
Thursday October 31, 2024 2:00pm - 2:30pm EDT
Dealing with iterative cycles in machine learning (ML) development presents several challenges because decisions made in one cycle can have effects on subsequent cycles. Goal-oriented conceptual modeling can be used to identify and analyze conflict areas between design decision points by way of refining goals, the alternative tasks that can achieve those goals, and softgoals which those tasks contribute to. Decision-making for ML development involves complex tradeoffs across various iterative cycles, involving conflicts and tensions among business, technical, and Responsible AI goals. Each of these iterative cycles consist of evaluation results that vary within each stage and from stage to stage. This paper investigates the distribution of Responsible AI decisions and tradeoffs along different ML iterative cycles and how they interact with each other. We propose three goal modeling constructs for processes with iterative decision-making that are contingent on observed intermediate results, such as those in ML development: Sensors, Actuators, and Iterative Loops. The goal modeling presented in this paper supports design reasoning about where to place Sensors and Actuators in nested iterative cycles, and to consider which decision criteria are applicable at which level of nesting.
Thursday October 31, 2024 2:00pm - 2:30pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

2:30pm EDT

Ontological Foundations of Resilience
Thursday October 31, 2024 2:30pm - 3:00pm EDT
Amid rising global challenges, resilience is attracting increased interest across various disciplines. However, significant ambiguities, vagueness, and variations in its definitions present notable obstacles to interdisciplinary communication and practical application. These semantic issues negatively impact not only the applications that rely on these definitions but also undermine the integrity of the conceptual models built from them, rendering them unable to ensure interoperability and clarity for their intended users. Recognizing the need for a robust and clear definition of resilience, this work addresses the complexity inherent in the concept by performing an ontological analysis using the Unified Foundational Ontology (UFO) and creating a sound core ontology model with OntoUML, UFO’s related conceptual modeling language. This research unfolds the conceptualization of resilience by investigating its fundamental categories and related concepts. We explore essential aspects of resilience, examining whether it preexists disturbances or is developed in response to them. We also identify its relational dependencies, establish how it is actualized, and determine the circumstances under which it becomes perceptible. Following our analysis of the ontological nature of resilience, the paper establishes an ontologically well-founded definition of this concept. To illustrate the practical application of our theoretical findings, we present a specific case in the field of production management.
Thursday October 31, 2024 2:30pm - 3:00pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

2:30pm EDT

A Framework for Conceptual Model Augmented Generative Artificial Intelligence
Thursday October 31, 2024 2:30pm - 3:00pm EDT
The advent of generative artificial intelligence, and in particular large language models, has opened up new possibilities for information processing in a multitude of domains. Nevertheless, it is essential to validate their output in order to ensure its validity within the specified context. This is due to their nature as probabilistic models of language, which may lead to the generation of inaccuracies or non-existent facts commonly known as hallucinations. As a solution, we propose a framework and a prompt structure for the validation of the results of generative artificial intelligence in formats that are more human-comprehensible through the use of conceptual models. We denote this as conceptual model augmented generative artificial intelligence (CMAG). We illustrate the approach through application examples in the domains of data management, knowledge graphs and cultural heritage, and software engineering.
Thursday October 31, 2024 2:30pm - 3:00pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

3:00pm EDT

Conceptual modelling method for digital twins
Thursday October 31, 2024 3:00pm - 3:30pm EDT
Effective conceptual modelling is critical for accurately representing the physical world in digital twin implementations, yet existing methods often lack comprehensive guidance or overlook key aspects like communication modelling. This research proposes the Entity-Relationship Digital Twin (ERDT) modelling method tailored for industrial digital twins. Building on established Entity-Relationship (ER) models, ERDT extends them with constructs for handling historical data, and defining security interfaces and data flows between physical and virtual components. The method provides a systematic process covering digital twin objective setting, physical component selection, conceptual entity definition, interface specification, and communication modelling. It emphasizes principles like encapsulation and separation of concerns to enhance maintainability. Evaluated against ISO 25010 quality criteria, ERDT demonstrates strengths in functional appropriateness, usability, portability, and maintainability while identifying opportunities for further improvement. A logistics case study illustrates applying the method to a real-world industrial scenario. By bridging conceptual modelling with software engineering best practices, ERDT facilitates developing robust, maintainable digital twins.
Thursday October 31, 2024 3:00pm - 3:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213

3:00pm EDT

Comparison of Dependencies for Human-Robot Interaction Types
Thursday October 31, 2024 3:00pm - 3:30pm EDT
Typically, four types of human-robot interaction are distinguished: Coexistence, Synchronization, Cooperation, and Collaboration. They differ in the degree of interaction between human and robot and have therefore high impact on safety requirements for a system. Leading to human-robot cooperation and collaboration systems rarely being introduced in industrial practice due to the high risks associated with them. An underlying problem for this, is the lack of understanding for the differences and necessities of these interaction types on a conceptual level. In this paper, we investigate the differences between the four human-robot interaction types with respect to the dependencies between human and robot. Although in all interaction types humans and robots depend on each other, we show that these dependencies are very different in nature. For future use cases, this analysis helps in binding safety risks to concrete properties of the human-robot dependencies.
Thursday October 31, 2024 3:00pm - 3:30pm EDT
SEI Training Room 1201 4500 Fifth Avenue, Pittsburgh, PA 15213

3:30pm EDT

Afternoon Break
Thursday October 31, 2024 3:30pm - 4:00pm EDT
Thursday October 31, 2024 3:30pm - 4:00pm EDT
SEIber Café

4:00pm EDT

Closing
Thursday October 31, 2024 4:00pm - 4:30pm EDT
Thursday October 31, 2024 4:00pm - 4:30pm EDT
SEI Jordan Auditorium 4500 Fifth Avenue, Pittsburgh, PA 15213