This section focuses on the stakeholders who use the data for various purposes.
Most existing OD systems are neither user-driven nor balance demand-supply matching. Although users' role is assumed to be critical for the success of OD ecosystems, suppliers drive current ecosys-tems (Gascó-Hernández et al., 2018; Sieber & Johnson, 2015; Van Loenen, 2018; Crusoe et al., 2019). This means that provisioning data from governments to end users is often made through a one-way data portal or platform (Evans & Campos, 2013) without differentiating between various types of users (including cross-border users) or providing data in the way that the data supplier perceives as satisfying (see European Commission, 2020c). Even more, the published data may have no value or are not directly usable (Rhind, 2014). OD providers do not close (enable) the feedback loop – a mech-anism for demand-supply matching – and hence are unaware of the actual use and users of their data, and what data and in what form the users need the data to create value (Harrison et al., 2012; Susha, 2015). That is, they only know that opening datasets increases “usage numbers”, i.e. number of downloads (van Loenen et al, 2018).
CircularExisting OD systems are often linear. In linear OD ecosystems, OD users typically capture value without adding value to the system themselves. At the moment, only very few stakeholders in the ecosystem profit from OD, and usually, the ones who profit are not the ones who invested most effort, time, and money (Pollock, 2011; Welle Donker & Van Loenen, 2018). As a result of this unfair distribution of value in the OD ecosystem, stakeholders may be reluctant to participate. An OD eco-system is circular (BSI Group, no year) when data is opened by data providers and then processed so that intermediaries, value-adding resellers, and enrichers can keep processed data in use for as long as possible. Users extract the maximum value from derived data/services. Finally, closing the cycle, stakeholders provide additional value to the ecosystem (Alexopoulos et al., 2014). For exam-ple, they can deliver value-added products based on the OD or evaluate data usage processes with data providers (Charalabidis, 2016). The necessary type of ecosystem is circular in nature, building upon a complex network of value generated by different participants that are creating valuable in-formation and products and services. However, note that a circle that is too small can harm circular OD systems because prosumers are then focused on serving data to themselves.
InclusiveMany current OD systems are exclusive. Research and practice mainly focus on a minimal selec-tion of OD providers and user groups, namely governments as data providers and companies as OD re-users. At the same time, many more stakeholders exist (Van Loenen et al., 2018). Consequently, government data and non-government data (e.g., from businesses, citizens, researchers, and others) are often not integrated and do not cross borders. Moreover, not all stakeholders (e.g., data provid-ers, intermediaries, value-adding resellers, enrichers, facilitators, end-users) from any nature (com-mercial, government, scientific, citizen) participate in OD ecosystem processes (eu2020.de, 2020; Eu-ropean Commission 2020).
Skill-basedExisting OD systems usually depend on best effort. There is a lack of skilled people to use OD. Re-search indicates that many businesses and governments do not have the knowledge and skills to decide on and implement effective strategies of making the best use of their data (Barbero et al., 2018; Davies et al., 2019). This is already reflected in the digital gap within the EU public administra-tions and the significant variations in the quality of digital public services and which is heavily in-fluenced by the discrepancies in digital skills and competences (Chinn et al., 2020; European Union, 2020). There are currently not enough people who have the right data skills (European Commission, 2020b; IDC & Open Evidence, 2017). This problem is likely to get even worse in the future (Deloitte, 2017). Despite the relevance of soft skills (Laker and Powell, 2011) when considering technological changes (Snape, 2017), the skills required are a higher education degree in economics, mathematics, physics, or other relevant scientific disciplines, plus familiarity with the industry concerned (Econ-omist Intelligence Unit, 2012). The combination of all these skills is difficult to obtain (Deloitte, 2017).
Using the ODECO paradigm, we can group the identified research gaps into four categories based on ODECO lenses: User-Driven, Circular, Inclusive, and Skill-based. However, some of the identified gaps are horizontal in nature, which do not belong to the ODECO lenses but cover the entire open data ecosystem.
The identified research gaps (challenges) in Figure 4 have been categorized based on open data ecosystem drivers, aiming to find solutions through these drivers. For example, OD drivers address these challenges by providing potential solutions, as described in the given below Table.
Some research gaps are cross-cutting and require continuous efforts to be resolved, such as infrastructure, resources, governance, and skills. Specifically, the user-driven approach in an open data ecosystem enhances adaptability, scalability, and innovation by ensuring data is discoverable, high-quality, and efficiently shared with stakeholders as per their needs.
Standardizing the classification of datasets ensures consistency and improves data discoverability. Adopting established classification frameworks, such as those used by the European Data Portal, facilitates interoperability and alignment with broader data ecosystems. Thematic categorization enhances the usability of datasets, enabling more efficient search, retrieval, and analysis by end users. Moreover, enhancing metadata quality for improved data discoverability to maximize the impact of open data, policymakers should enforce rigorous metadata requirements, ensuring datasets include comprehensive descriptions, provenance, and structured classifications. This will improve data discoverability, support machine-readable formats, and enable more effective data utilization by researchers, businesses, and the public sector.
Governments should prioritize the adoption of Linked Open Data principles to strengthen semantic interoperability. By leveraging ontologies such as SKOS and Wikidata, data portals can interlink datasets more effectively, allowing for better contextualization, cross-domain analysis, and enhanced knowledge extraction. Governments and organizations should establish mandatory compliance with widely accepted interoperability standards such as DCAT, FOAF, and Schema.org. Ensuring adherence to these standards will facilitate seamless data exchange across platforms, enhance data usability, and promote a more integrated open data ecosystem. This will strengthen open data Interoperability Through Standardization.
A dedicated governance framework should be implemented to oversee the technical validation of open data, ensuring data consistency, reliability, and compliance with best practices. This framework should include automated validation tools, periodic audits, and collaborative stakeholder engagement to maintain high data quality standards.
To enhance user engagement and data quality, policymakers should mandate the establishment of feedback mechanisms such as user surveys, dataset ratings, and discussion forums on open data platforms. These tools allow users to report issues, suggest improvements, and engage in continuous dialogue with data providers. Integrating AI-driven sentiment analysis and real-time dashboards can help improve platform responsiveness and trust, creating a dynamic and user-responsive ecosystem that fosters continuous data quality improvement and broader adoption.
Governments should adopt standardized interfaces for open data portals that streamline the data publication process. A uniform framework for data entry, metadata documentation, and access management will reduce inconsistencies and ensure datasets are discoverable and interoperable across platforms. Automation tools like data validation and version control can enhance data quality, while open APIs and export functionalities will maximize data sharing and integration, ultimately improving accessibility and encouraging broader participation in the open data ecosystem.
Enhancing data usability requires offering diverse access points, including various data formats, APIs, and user-friendly interfaces that accommodate both technical and non-technical users. Applying FAIR (Findable, Accessible, Interoperable, and Reusable) principles, open standards, and data quality profiling techniques ensures greater accessibility and engagement. By enabling multiple entry points, users can retrieve and interact with data more efficiently, fostering broader adoption and utilization of open data. Policymakers should encourage the adoption of technical openness in data publication by adhering to machine-readable formats, following Open API specifications, and utilizing technical standards that improve data integration across systems. Offering tools such as API explorers and fostering collaboration on technical solutions will ensure that datasets are accessible and reusable. A methodology to quantify the technical openness of datasets should be developed to ensure that open data platforms meet the highest standards of data interoperability. Ensuring inclusivity in open data platforms involves designing intuitive interfaces, offering interactive visualizations, and providing multilingual support. Features such as screen reader compatibility, high-contrast visuals, and accessible navigation cater to diverse user needs, including those with disabilities. By reducing usability barriers, open data becomes more accessible to a wider audience, driving engagement and effective utilization.
Encouraging private entities, research institutions, and civil society organizations to share data requires addressing legal, technical, and organizational barriers. Establishing clear licensing frameworks, anonymization techniques, and secure data-sharing mechanisms can help build trust among non-government data holders. Open-source platforms can further support seamless collaboration by ensuring interoperability and compliance with existing data standards. Partnerships and coordination efforts should also be strengthened to promote contributions from diverse stakeholders.
Fostering collaboration among government agencies, private sector actors, researchers, and the public necessitates the development of shared objectives and values. Organizing stakeholder consultations, workshops, and collaborative forums can help align goals and create a common vision for open data initiatives. Promoting a culture of transparency, innovation, and community-driven solutions strengthens long-term commitment to open data practices.
Addressing local challenges through open data requires conducting needs assessments and actively engaging with local stakeholders. By identifying specific problems and collaborating with communities, policymakers can develop targeted data-driven interventions. Building sustainable data-sharing networks and fostering partnerships with local organizations help ensure that open data initiatives effectively respond to regional priorities and constraints.
Bridging the gap between subject matter experts and data scientists enhances the quality and applicability of open data solutions. Structured initiatives such as hackathons, interdisciplinary workshops, and collaborative research projects provide opportunities for knowledge exchange. Additionally, offering incentives like grants and training programs encourages long-term engagement and fosters innovation in data-driven decision-making.
To create a more inclusive open data ecosystem, governments should ensure that marginalized groups, such as small-scale data contributors and NGOs, have the resources and opportunities to engage meaningfully in data-sharing activities. This can be achieved by creating accessible, multilingual feedback channels, offering capacity-building programs, and incentivizing participation through training and support. By adopting collaborative governance models and fostering community-driven data stewardship, policymakers can bridge power imbalances and ensure that the open data ecosystem is equitable and sustainable.
To enhance the accessibility of open data for non-technical users, governments should support the development and distribution of low-code data analysis tools. These platforms will allow users with varying levels of expertise to analyze open data and generate insights. Policymakers should also provide training for educators and intermediaries on how to use these tools effectively, ensuring that a wide range of stakeholders can participate in the data-driven decision-making process. Moreover, Governments should encourage the development and distribution of data augmentation and validation tools that support various dataset types. These tools, developed by intermediaries within the open data ecosystem, should be adaptable to different data types and intended uses. By enhancing data quality through these tools, governments can ensure that open data remains accurate, relevant, and usable, further improving its value to stakeholders across sectors.