What am I currently working on?
Master thesis about "Developing spatial knowledge graphs for enhanced visualization and analysis of supply chains:
A geoinformatics approach towards sustainability and resilience"
Reduction of data silos
Using knowledge graphs to comprehensively map dependencies in supply chains and environmental risks and to take a holistic approach to avoiding data silos for an overall picture of supply chain risks.
Proactive dialogue with companies
For my work, I am super happy that deuter and bluesign have agreed to work with me alongside esri as a technology partner to get a realistic insight into the real world of supply chains, due diligence and challenges. Also working with WWF on georisks in supply chains.
Battling the jungle of supply chain data
What supply chain data currently exists, especially at the spatial scale? Inclusion of external and internal company data for comprehensive risk management at all levels of the supply chain. How can geopolitical, climatic and social risks be incorporated?
Risk management master thesis
Coordinating communication with all partners, working out the challenges and requirements of the project. Implementing the technical solution, developing data models and practical applications for the project partners. Understanding the current supply chain laws and of course writing the research paper.
Why am I doing this?
As a self-proclaimed data nerd, I thrive on creatively and logically engaging with data and its structures. Currently, there is a pressing need for solutions that enhance supply chain transparency, a topic that has gained significant attention and urgency. Graphs, particularly Spatial Knowledge Graphs, are emerging as a key trend in data science due to their ability to reveal complex relationships beyond just coordinates.
I am keen to explore the practical applications and potential of these graphs, especially in the realm of geospatial data, where relationships between geo phenomena offer valuable insights. Furthermore, my interest is aroused by the historical context of global trade in Augsburg using the example of the Fugger and Welser families and also the question of how we can trade more responsibly on a global scale today. By leveraging advanced geoinformatics, I aim to contribute to the ongoing conversation about supply chain sustainability and resilience.
How did I start?
I began my project by conducting an online data search, working with supplier lists that I found on company websites and the Open Supply Hub. Using this data, I built initial graphs and enriched them with additional risk information.
With the help of my colleague Daniela, I connected with WWF and spoke with Liam, who provided invaluable support. He shared WWF's extensive experience in sustainable supply chain consulting and guided me on using their WWF Risk Filter Suite. This tool allowed me to enrich factory locations with data not just at the country level but also at the river basin level. Liam's insights and inspiration were crucial in identifying key factors and considerations for my project.
How did I further concretise this?
Then I started to concretise the problem I want to investigate and how I could do this with graphs:
Problem: Companies face multiple challenges, including ensuring compliance with international standards and minimizing environmental risks and geopolitical uncertainties. Traditional data management systems are often unable to adequately visualize and analyze the complex and dynamic relationships within modern supply chains.
Solution: This project implements Spatial Knowledge Graphs for the comprehensive, dynamic and spatially oriented visualization of supply chains. These graphs integrate detailed geospatial data that goes far beyond simple location information and captures environmental influences, risks and interactions between different nodes in the supply chain.
Advantages:
-
Improved risk management: By integrating and providing an overview of all supply chain data, the approach enables early detection of and response to potential risks.
-
More informed decision-making: The clear visualization of supply chain relationships enables more strategic and informed decision-making and communication.
-
Sustainability and compliance: The model supports companies in complying with legal requirements and implementing sustainable practices, resulting in an improved market position and increased customer satisfaction.
How did I proceed from there?
I decided to focus on textile supply chains because some textile companies share information about their suppliers, driven by consumer demand and global reporting on production conditions. I was fortunate that Fabian from Bluesign responded to my email request. Bluesign, which also has an office in Augsburg :), is known for its stringent sustainability standards, particularly in chemical processes during production. They ensure that all production steps and individual parts up to the finished product meet their high standards, and they work closely with companies in the outdoor fashion industry.
Another significant development was deuter's positive response from Natalie and Marco to my collaboration request. As a long-time Deuter customer and someone who grew up in Gersthofen, (small town in the north of Augsburg), where deuter is based, this was personally gratifying. Growing up with deuter products made this collaboration particularly meaningful to me.
Deuter is doing a great job in their CSR team to fulfil the due diligence obligations towards their suppliers in their supply chain. Furthermore, deuter is also a longtime bluesign system partner.
How do I build a Knoweldge Graph from real supply chain data?
I received detailed information from deuter regarding their risk assessment processes, how they gather information about their suppliers, and how they evaluate risks for countries, products, and facilities. This insight was incredibly fascinating to me. Additionally, I obtained data about the materials used in Deuter’s products, their production locations, and additional information from their facilities. My task was to understand this data and build data structures and datasets from it, particularly because it included many text blocks. I needed to identify relationships within the data and find a way to model them effectively.
To enrich the supply chain data, I incorporated risk information from the WWF Risk Filter Suite, travel warnings from the Foreign Office, geodata on potential natural disasters, data on geopolitical events and anti-shipping attacks. I also wanted to try to model the sustainability reports from Deuter’s website using Neo4j’s graph builder, to get some extra information and to see if this might also be a way to model the sustainability reports as graphs very fastly. Additionally, I included headlines from the news to obtain up-to-date information on social risks. Since there were no existing datasets for this, I employed natural language processing (NLP) to categorize news as risky or not. hankfully, today's technology allows me to to process news in different languages, capturing headlines and events that might not reach us in Europe.


Graphbuilder Sustainability Report deuter

Graph Country Information with Text Chunks

Graph Emissions Information
What does the graph currently look like?
The knowledge graph construction started from the materials, establishing relationships to identify which suppliers purchase which materials and which other factories are involved in producing these materials. Each factory and supplier is geolocated, tied to a specific country, province, and nearby seascape. These seascapes are evaluated for geo-risks concerning water and biodiversity. The graph includes information from Tier 1 suppliers, detailing their locations and purchasing relationships.
For both Tier 1 and Tier 2 suppliers, deuter conducts risk evaluations at the country level, which are integrated into the graph and linked to source materials. Articles from these source evaluations are also included, allowing for keyword searches within the graph to discover connections and detailed information within the articles. The graph illustrates how materials are used to create products, each with varying risks that impact vulnerable groups and different stages of production.

Additionally, the graph is enriched with live geo-risk information to identify areas currently affected by ongoing natural events such as floods, earthquakes, or fires. This dynamic data integration enhances the graph’s utility in providing real-time risk assessments.
What can I now do with this graph?
With the constructed knowledge graph, I can perform visual exploration to uncover insights about the supply chain. Different visualization techniques help to detect hierarchies and clusters within the graph, making it easier to understand complex relationships. I can use centrality measures to identify critical nodes that play a significant role in the network. Additionally, I am able to conduct in-depth analyses for specific use cases, providing detailed insights into supply chain dynamics and risk factors. This enables more informed decision-making and strategic planning based on the interconnected data within the graph.
What happens if an entire area is affected by an earthquake, for example, and production can no longer take place there? How does this outage affect my supply chain?
Small analysis of the centrality of the graph and relationships in the supply chain
Exkurs: What are my thoughts on problems around making supply chains more transparent?
One of the biggest challenges in the industry, from my persepctive, is identifying the precise location of a factory. The structure of addresses is not standardized, and no geocoding algorithm consistently works. Often, only large industry parks are given as locations, which is usually sufficient for assessing geo-risks and understanding the risks in the area. Additionally, names are often spelled differently, complicating the search further. For example, Google Maps was effective for locating factories in Vietnam due to job hunting comments, but China and South Korea proved to be very challenging for identifying coordinates based on addresses and company names.
A potential solution might be Open Supply Hub (OSH), where addresses and coordinates are verified. However, OSH's search algorithm is subpar, requiring precise spelling of names, which is particularly problematic for asian sources due to language barriers. Most companies lack
knowledge about the locations of their suppliers, extending beyond Tier 1 levels. Available data is often handwritten by suppliers themselves, leading to inconsistencies and missing information that might be self-evident locally but not shared further.
Cultural differences must also be considered. For instance, worker representation in Europe is
highly regarded, but in other countries initiatives like that might be corrupt, primarily serving as a means for some people to extract money and move on to the next company.
The EU's push for anonymous complaint systems faces resistance as workers fear repercussions. It takes a long time to establish trust with suppliers and local workers. Especially in rapidly changing supply relationships, this is a problem that cannot be solved so easily due to the fast pace of life. In addition, there are often language barriers and the level of education is very low. The local people affected cannot read or write and do not know their rights, nor do they often know who they are actually producing for.
Information available to German/European companies about supply chain risks or local conditions is often limited to PDFs from trade associations, lacking comprehensive data sets. Furthermore, the middle segments of supply chains are frequently overlooked. In textiles, the focus is primarily on seamstresses, while in food, it's on cultivation, ignoring intermediary stages. Different stages of supply chains carry varied risks; for example, early stages in the textile industry involve fewer people but greater exposure to hazardous chemicals, whereas later stages impact more people but have less visible environmental effects.
Environmental impacts on production, such as earthquakes, floods, and heat, and vulnerable
groups, like women or migrant workers, also need special consideration. Logistic chains are significantly influenced by environmental conditions, exemplified by the El Niño phenomenon affecting water levels in the Panama Canal.
Comprehensive risk and resilience assessments for supply chains are rarely conducted in practice. Many companies rely on inadequate tools for managing complex supply chain information. Companies report difficulties in accurately presenting the data they collect due to its multi-dimensional and interconnected nature.
Collaboration between research and business is limited, particularly in assessing supply chain
resilience with real data. EU regulations are challenging to implement, as suppliers are reluctant to disclose detailed supply chain data due to fears of being bypassed. The intention to address
deficiencies without simply replacing suppliers is often misunderstood or mistrusted.
While graphs are recognized as a powerful tool, there is a general reluctance to engage with or
understand them. Companies face significant IT and data challenges, often lacking in-house
expertise to tackle technical issues.
Conclusion right now: Spatial Knowledge Graphs offer a promising approach to overcoming the
complex challenges in supply chain management. By leveraging geospatial data and advanced
visualization techniques, companies can achieve greater sustainability and resilience in their
supply chains, ensuring compliance with international regulations and improving overall
operational efficiency.
Especially the resilience can be evaluated very easily. Also, data silos are a huge issue that could be overcome by using graphs.
Talks and Speeches


Showcase: Nachhaltiges Lieferkettenmanagement | Esri Konferenz Schweiz 2024

10 - SAE - Leonie Engemann.mp4






