Can you help us unlock faster, smarter vulnerability insights?
When machinery is procured to support national security and defence it has to be thoroughly checked for security vulnerabilities — and those vulnerabilities need to be understood and addressed. The work depends on highly skilled security researchers who assess vulnerabilities and advise on mitigations. But before the expert analysis can begin, there’s a significant bottleneck: finding, indexing and understanding the vast amount of open-source technical information that exists about complex industrial machinery.
HMGCC Co-Creation are launching this challenge to develop a software tool to Technology Readiness Level 6, that works without an internet connection and can assist a security researcher to index, search and understand vast quantities of data faster, enabling faster decision making.
HMGCC Co-Creation will provide funding for time, materials, overheads and other indirect expenses for successful applicants, with a total budget of up to £60k.
-
This challenge is open to sole innovators, industry, academic and research organisations of all types and sizes. There is no requirement for security clearances.
Solution providers or direct collaboration from countries listed by the UK government under trade sanctions and/or arms embargoes are not eligible for HMGCC CoCreation challenges.
Technology themes: artificial intelligence, app development, cybersecurity, data science and engineering, information technology, machine learning, software development, threat modelling, vulnerability research.
-
National security organisations undertake sensitive activities but also depend on complex supply chains to acquire and maintain the technology they need to operate. As part of that, security researchers carry out detailed tear-downs examining software, hardware and data components to identify possible vulnerabilities.
The first stage of this process is research. When a security researcher is tasked to examine a new product, particularly in the context of industrial control systems, they need to draw on open-source information at a micro-component level, such as technical specifications, datasheets, schematics and technical forum discussions. It is laborious and takes time that could be better spent on the analysis itself.
This challenge is about changing that. We believe that human-machine teaming offers a real opportunity to reduce the research burden. Specifically, we are looking for a system that can do three things:
- Index both structured and unstructured technical information about a product and its components.
- Generate a clear technical summary of the product and its individual components.
- Allow the researcher to ask natural-language questions about the product and explore the information interactively, adapting their line of investigation as new information emerges.
-
Industrial control systems can be highly complex and thus time consuming to index and query related information. Complexities can arise from the following:
- Products vary significantly, meaning there is no one size fits all.
- There can be multiple product versions with varied components and software updates.
- Security researchers rely on their experience, processes and trusted sources such as information directly from the vendor and trusted online forums.
- A chain of trust is formed from:
- Physical components such as filters, fuses, processors and memory sensors.
- Software across a range of forms including source code or binary for multiple different processors and operating systems in the same product.
-
Alicia is an experienced security researcher with a focus on industrial control systems. She has been tasked to assess an industrial additive manufacturing machine. The machine will be used in a manufacturing facility without an internet connection, to build critical, classified, components for national security and defence operations. Any vulnerabilities must be understood and mitigated.
She begins with the vendor manual, supplied in paper copy and PDF. Using the wiring diagram and schematic she starts to investigate the hardware architecture, including interfaces and components and microprocessors. She sources datasheets online for each component and also finds photos of various teardowns.
She starts to pull-out all the available code supplied by the vendor. She consults online forums, some trusted and some new to her.
Alicia starts to build up a large library of information on the product and its components. She drags and drops each bit of information into her ‘tear down assistant’ tool.
As she builds this library of information, she naturally starts to learn about how the machine works, but she also needs to be able to call back on this vast amount of information efficiently. An intelligent, easy to use search and summary capability is essential.
When she wants to explore the machine’s interfaces, she types a query into the tool and receives a conversational response, backed by a reliable source. She builds on this with follow-up questions, each time receiving a well-grounded answer, citing sources. Where answers are not clear, this is highlighted with alternative theories. As the assistant starts to learn Alicia’s behaviour, it adapts to her needs.
This operates more than just a search tool but is more like a personal assistant who really understands the subject matter, and Alicia.
-
This challenge focuses on building a standalone software tool that can ingest relevant open-source information, compile it into a searchable library, support natural language queries across multi-modal formats and provide conversational intelligent and well-informed answers. We would like to see proposals which don’t just focus on off the shelf Retrieval-Augmented Generation systems.
After the 12-week project, the final deliverable should be a software tool meeting the stated requirements for testing in-house at HMGCC.
Essential requirements:
- The tool must have the ability to understand system architecture of a selected machine. A non-exhaustive list of components to understand are the physical interface interactions, data interfaces and protocols.
- Have an ability to check and validate responses before publishing, to prevent erroneous information and hallucinations.
- Characterise from multimedia inputs, such as including manuals, schematics, datasheets, corporate databases, images, code, handwritten annotations.
- Verify information by listing sources and cross checking against high confidence data such as industry publications, academic research and manufacturer documentation.
- Flag a confidence score and if more source data is required.
- The solution should be capable of operating on a laptop without an internet connection, allowing users to characterise complex systems and identify vulnerabilities in environments with limited or no connectivity.
- Provide an easy to search and intelligent function to query the dataset in a chat-like manner.
- Keep a memory of queries so conversations can be continued over several weeks without repetition of prompts.
Desirable requirements:
- Build a profile of the user and adapt to their needs, for example to present information in preferred formats and even proactively provide information that is frequently requested.
- Ability to translate and index non-English data sources (e.g. datasheets and forum posts).
- Recognise and mitigate cultural biases to ensure a nuanced understanding.
- Ensure the software tool remains up-to-date when offline. Consider in a future iteration how the solution may incorporate a mechanism for periodic updates of the core tool and its indexing/search algorithms.
Constraints:
- The tool must work without an internet connection.
Not required:
- For this challenge, the system does not need to autonomously identify or search for source data (e.g. datasheets, schematics and forum posts). Test data will be provided.
-
Competition opens: Monday 16 March 2026
Briefing Call: 10am Friday 17 April 2026 (join the briefing call on MS Teams)
Clarifying questions deadline: Friday 17 April 2026
Clarifying questions published: Tuesday 28 April 2026
Competition closes: Thursday 7 May 2026
Applicants notified: Friday 22 May 2026
Pitch Day: Tuesday 2 June 2026
Pitch Day outcome: Monday 8 June 2026
Commercial onboarding begins: Friday 12 June 2026 (the successful solution provider will be expected to have availability for a one-hour onboarding call via MS Teams)
Target project kick-off: July 2026
For any queries, please email Co-Creation@dstl.gov.uk and cocreation@hmgcc.gov.uk. You can also contact Innovate UK Business Connect’s Security and Defence team.