World Wide Web

World Wide Web: Internet and Web Information Systems (WWW) is an international, archival, peer-reviewed journal that covers all aspects of the Web, including ...

List of Papers (Total 176)

Determining modified versions of social media images

Apr 2025 | He, Qijun, Umair, Muhammad, Bouguettaya, Athman, et al.

Social media platforms usually contain several modified versions of an image. This proliferation of versions questions the trust of social media images. We propose a novel framework to find modified versions of social media images using only their metadata. We consider several aspects to determine if an image is a modified version of another image. These aspects include topic of...

Apr 2025
He, Qijun, Umair, Muhammad, Bouguettaya, Athman, et al.

Causal integration in graph neural networks toward enhanced classification: benchmarking and advancements for robust performance

Apr 2025 | Job, Simi, Tao, Xiaohui, Cai, Taotao, et al.

The expansion of Graph Neural Networks (GNNs) has highlighted the importance of evaluating their performance in real-world scenarios. However, existing evaluation frameworks often overlook the integration of causality, a critical component that is essential for more robust evaluation of GNNs. To address this gap, we present a benchmark study that systematically compares standard...

Apr 2025
Job, Simi, Tao, Xiaohui, Cai, Taotao, et al.

JAL: an algebra for JSON query optimization

Mar 2025 | Langerak, Anne Jasmijn, Frasincar, Flavius, Klinkhamer, Jasmijn

As databases become larger and less structured, the JavaScript Object Notation (JSON) data format has risen in usage compared to other data formats like XML. At the same time, while extracting data from these large datasets efficiently is of obvious importance, there has been far less research regarding the optimization of JSON queries than there has relating to the querying of...

Mar 2025
Langerak, Anne Jasmijn, Frasincar, Flavius, Klinkhamer, Jasmijn

Guest Editorial: Special issue on “Neuro-Symbolic Intelligence: large Language Model enabled Knowledge Engineering”

Jan 2025 | Wang, Haofen, Khan, Arijit, Liu, Jun, et al.

Jan 2025
Wang, Haofen, Khan, Arijit, Liu, Jun, et al.

Integration and innovation of blockchain in Web3.0: current status and standardization prospects

Dec 2024 | Xiangjuan, Jia, Xinwei, Fang, Yijie, Zhang, et al.

In the Web3.0 era, which does not rely on any centralized organization and emphasizes user control, security, trustworthiness and the importance of data privacy, blockchain plays a key role. Its decentralization, security and trustworthiness and other characteristics have become Building the infrastructure of trusted interconnection and value interconnection in the Web3.0 era has...

Dec 2024
Xiangjuan, Jia, Xinwei, Fang, Yijie, Zhang, et al.

Special issue on the 15th International Conference on Management of Digital EcoSystems (MEDES 2023) and the 27th International Database Engineered Applications Symposium (IDEAS 2023)

Dec 2024 | Tekli, Joe, Benslimane, Djamal, Chbeir, Richard, et al.

Dec 2024
Tekli, Joe, Benslimane, Djamal, Chbeir, Richard, et al.

Use of prompt-based learning for code-mixed and code-switched text classification

Sep 2024 | Udawatta, Pasindu, Udayangana, Indunil, Gamage, Chathulanka, et al.

Code-mixing and code-switching (CMCS) are prevalent phenomena observed in social media conversations and various other modes of communication. When developing applications such as sentiment analysers and hate-speech detectors that operate on this social media data, CMCS text poses challenges. Recent studies have demonstrated that prompt-based learning of pre-trained language...

Sep 2024
Udawatta, Pasindu, Udayangana, Indunil, Gamage, Chathulanka, et al.

FSSDroid: Feature subset selection for Android malware detection

Jul 2024 | Polatidis, Nikolaos, Kapetanakis, Stelios, Trovati, Marcello, et al.

Android malware has become an increasingly important threat to individuals, organizations, and society, posing significant risks to data security, privacy, and infrastructure. As malware evolves in sophistication and complexity, the detection and mitigation of these malicious software instances have become more challenging and time consuming since the required number of features...

Jul 2024
Polatidis, Nikolaos, Kapetanakis, Stelios, Trovati, Marcello, et al.

Hierarchical adaptive evolution framework for privacy-preserving data publishing

Jul 2024 | You, Mingshan, Ge, Yong-Feng, Wang, Kate, et al.

The growing need for data publication and the escalating concerns regarding data privacy have led to a surge in interest in Privacy-Preserving Data Publishing (PPDP) across research, industry, and government sectors. Despite its significance, PPDP remains a challenging NP-hard problem, particularly when dealing with complex datasets, often rendering traditional traversal search...

Jul 2024
You, Mingshan, Ge, Yong-Feng, Wang, Kate, et al.

Editorial on the Special Issue of the World Wide Web journal with selected papers from the 22nd International Conference on Web Information Systems Engineering (WISE)

Jul 2024 | Chbeir, Richard, Huang, Helen, Manolopoulos, Yannis, et al.

Jul 2024
Chbeir, Richard, Huang, Helen, Manolopoulos, Yannis, et al.

When large language models meet personalization: perspectives of challenges and opportunities

Jun 2024 | Chen, Jin, Liu, Zheng, Huang, Xu, et al.

The advent of large language models marks a revolutionary breakthrough in artificial intelligence. With the unprecedented scale of training and model parameters, the capability of large language models has been dramatically improved, leading to human-like performances in understanding, language synthesizing, common-sense reasoning, etc. Such a major leap forward in general AI...

Jun 2024
Chen, Jin, Liu, Zheng, Huang, Xu, et al.

Using knowledge graphs for audio retrieval: a case study on copyright infringement detection

Jun 2024 | Montanaro, Marco, Rinaldi, Antonio Maria, Russo, Cristiano, et al.

Identifying cases of intellectual property violation in multimedia files poses significant challenges for the Internet infrastructure, especially when dealing with extensive document collections. Typically, techniques used to tackle such issues can be categorized into either of two groups: proactive and reactive approaches. This article introduces an approach combining both...

Jun 2024
Montanaro, Marco, Rinaldi, Antonio Maria, Russo, Cristiano, et al.

Cloud storage cost: a taxonomy and survey

May 2024 | Khan, Akif Quddus, Matskin, Mihhail, Prodan, Radu, et al.

Cloud service providers offer application providers with virtually infinite storage and computing resources, while providing cost-efficiency and various other quality of service (QoS) properties through a storage-as-a-service (StaaS) approach. Organizations also use multi-cloud or hybrid solutions by combining multiple public and/or private cloud service providers to avoid vendor...

May 2024
Khan, Akif Quddus, Matskin, Mihhail, Prodan, Radu, et al.

A heterogeneous graph-based semi-supervised learning framework for access control decision-making

May 2024 | Yin, Jiao, Chen, Guihong, Hong, Wei, et al.

For modern information systems, robust access control mechanisms are vital in safeguarding data integrity and ensuring the entire system’s security. This paper proposes a novel semi-supervised learning framework that leverages heterogeneous graph neural network-based embedding to encapsulate both the intricate relationships within the organizational structure and interactions...

May 2024
Yin, Jiao, Chen, Guihong, Hong, Wei, et al.

The medium is the message: toxicity declines in structured vs unstructured online deliberations

May 2024 | Klein, Mark, Majdoubi, Nouhayla

Humanity needs to deliberate effectively at scale about highly complex and contentious problems. Current online deliberation tools—such as email, chatrooms, and forums—are however plagued by levels of discussion toxicity that deeply undercut the willingness and ability of the participants to engage in thoughtful, meaningful, deliberations. This has led many organizations to...

May 2024
Klein, Mark, Majdoubi, Nouhayla

OntoMedRec: Logically-pretrained model-agnostic ontology encoders for medication recommendation

Apr 2024 | Tan, Weicong, Wang, Weiqing, Zhou, Xin, et al.

Recommending medications with electronic health records (EHRs) is a challenging task for data-driven clinical decision support systems. Most existing models learnt representations for medical concepts based on EHRs and make recommendations with the learnt representations. However, most medications appear in EHR datasets for limited times (the frequency distribution of medications...

Apr 2024
Tan, Weicong, Wang, Weiqing, Zhou, Xin, et al.

Efficient processing of coverage centrality queries on road networks

Apr 2024 | Xu, Yehong, Zhang, Mengxuan, Wu, Ruizhong, et al.

Coverage Centrality is an important metric to evaluate vertex importance in road networks. However, current solutions have to compute the coverage centrality of all the vertices together, which is resource-wasting, especially when only some vertices centrality is required. In addition, they have poor adaption to the dynamic scenario because of the computation inefficiency. In...

Apr 2024
Xu, Yehong, Zhang, Mengxuan, Wu, Ruizhong, et al.

Adaptive retrofitting for industrial machines: utilizing webassembly and peer-to-peer connectivity on the edge

Jan 2024 | Nakakaze, Otoya, Koren, István, Brillowski, Florian, et al.

Leveraging previously untapped data sources offers significant potential for value creation in the manufacturing sector. However, asset-heavy shop floors, extended machine replacement cycles, and equipment diversity necessitate considerable investments for achieving smart manufacturing, which can be particularly challenging for small businesses. Retrofitting presents a viable...

Jan 2024
Nakakaze, Otoya, Koren, István, Brillowski, Florian, et al.

Privacy-preserving data publishing: an information-driven distributed genetic algorithm

Jan 2024 | Ge, Yong-Feng, Wang, Hua, Cao, Jinli, et al.

The privacy-preserving data publishing (PPDP) problem has gained substantial attention from research communities, industries, and governments due to the increasing requirements for data publishing and concerns about data privacy. However, achieving a balance between preserving privacy and maintaining data quality remains a challenging task in PPDP. This paper presents an...

Jan 2024
Ge, Yong-Feng, Wang, Hua, Cao, Jinli, et al.

Enhancing bitcoin transaction confirmation prediction: a hybrid model combining neural networks and XGBoost

Dec 2023 | Zhang, Limeng, Zhou, Rui, Liu, Qing, et al.

With Bitcoin being universally recognized as the most popular cryptocurrency, more Bitcoin transactions are expected to be populated to the Bitcoin blockchain system. As a result, many transactions can encounter different confirmation delays. Concerned about this, it becomes vital to help a user understand (if possible) how long it may take for a transaction to be confirmed in...

Dec 2023
Zhang, Limeng, Zhou, Rui, Liu, Qing, et al.

Entity alignment via graph neural networks: a component-level study

Nov 2023 | Shu, Yanfeng, Zhang, Ji, Huang, Guangyan, et al.

Entity alignment plays an essential role in the integration of knowledge graphs (KGs) as it seeks to identify entities that refer to the same real-world objects across different KGs. Recent research has primarily centred on embedding-based approaches. Among these approaches, there is a growing interest in graph neural networks (GNNs) due to their ability to capture complex...

Nov 2023
Shu, Yanfeng, Zhang, Ji, Huang, Guangyan, et al.

Death comes but why: A multi-task memory-fused prediction for accurate and explainable illness severity in ICUs

Nov 2023 | Chen, Weitong, Zhang, Wei Emma, Yue, Lin

Predicting the severity of an illness is crucial in intensive care units (ICUs) if a patient‘s life is to be saved. The existing prediction methods often fail to provide sufficient evidence for time-critical decisions required in dynamic and changing ICU environments. In this research, a new method called MM-RNN (multi-task memory-fused recurrent neural network) was developed to...

Nov 2023
Chen, Weitong, Zhang, Wei Emma, Yue, Lin

KC-GEE: knowledge-based conditioning for generative event extraction

Oct 2023 | Wu, Tongtong, Shiri, Fatemeh, Kang, Jingqi, et al.

Event extraction is an important, but challenging task. Many existing techniques decompose it into event and argument detection/classification subtasks, which are complex structured prediction problems. Generation-based extraction techniques lessen the complexity of the problem formulation and are able to leverage the reasoning capabilities of large pretrained language models...

Oct 2023
Wu, Tongtong, Shiri, Fatemeh, Kang, Jingqi, et al.

FPGN: follower prediction framework for infectious disease prevention

Sep 2023 | Yu, Jianke, Zhang, Xianhang, Wang, Hanchen, et al.

In recent years, how to prevent the widespread transmission of infectious diseases in communities has been a research hot spot. Tracing close contact with infected individuals is one of the most severe problems. In this work, we present a model called Follower Prediction Graph Network (FPGN) to identify high-risk visitors, which is known as follower prediction. The model is...

Sep 2023
Yu, Jianke, Zhang, Xianhang, Wang, Hanchen, et al.

Efficient continuous kNN join over dynamic high-dimensional data

Sep 2023 | Ukey, Nimish, Zhang, Guangjian, Yang, Zhengyi, et al.

Given a user dataset $$\varvec{U}$$ and an object dataset $$\varvec{I}$$ , a kNN join query in high-dimensional space returns the $$\varvec{k}$$ nearest neighbors of each object in dataset $$\varvec{U}$$ from the object dataset $$\varvec{I}$$ . The kNN join is a basic and necessary operation in many applications, such as databases, data mining, computer vision, multi-media...

Sep 2023
Ukey, Nimish, Zhang, Guangjian, Yang, Zhengyi, et al.