Advanced search    

Search: Data Mining and Knowledge Discovery

462,486 papers found. Showing first 1,000 results.
Use AND, OR, NOT, +word, -word, "long phrase", (parentheses) to fine-tune your search.

Leveraging language representation for materials exploration and discovery

Data-driven approaches to materials exploration and discovery are building momentum due to emerging advances in machine learning. However, parsimonious representations of crystals for navigating the ... , towards functional materials design and discovery. Representing materials in the format of natural language enables effective utilization of materials science knowledge learnt from evergrowing unstructured

Species delimitation, discovery and conservation in a tiger beetle species complex despite discordant genetic data

characters, as well as genetic or other data. Tiger beetles are charismatic, of conservation concern, and much work has been done on the morphological delineation of species and subspecies, but few of these ... subspecies validity despite discordant data. The discovery and description of new biodiversity during an era of rapid species declines is vital for all the life sciences, especially because biodiversity is

MatKG: An autonomously generated knowledge graph in Material Science

In this paper, we present MatKG, a knowledge graph in materials science that offers a repository of entities and relationships extracted from scientific literature. Using advanced natural language ... comprehensive and integrated approach to organizing and sharing materials data is needed to enable effective data-driven discovery and to advance the field. One promising solution is the use of Knowledge graphs

GIS-based non-grain cultivated land susceptibility prediction using data mining methods

The purpose of the present study is to predict and draw up non-grain cultivated land (NCL) susceptibility map based on optimized Extreme Gradient Boosting (XGBoost) model using the Particle Swarm ... takes Chenggu County, Hanzhong City, Shaanxi Province, as the research area. Based on the natural driving factors of NCL and utilizing various data mining methods, we quantitatively predict the spatial

Integrating imaging and genomic data for the discovery of distinct glioblastoma subtypes: a joint learning approach

characterizing this heterogeneity. In this study, we developed unsupervised joint machine learning between radiomic and genomic data, thereby identifying distinct glioblastoma subtypes. A retrospective cohort of ... Vol.:(0123456789) www.nature.com/scientificreports/ Characteristics Discovery cohort Replication cohort No. of patients, n (%) 285 286 With imaging and genomic data 131 (45.96) 115 (40.21

Cluster analysis and visualisation of electronic health records data to identify undiagnosed patients with rare genetic diseases

and mining. Data mining, in the form of cluster analysis and visualisation, was performed on a database containing deidentified health records of 1.28 million patients across 3 major hospitals in ... previously. This proof-ofconcept study showed that it is possible to perform mining on EHR data albeit with some challenges and limitations. Rare genetic diseases affect 5–8% of the population and account for

Statistical detection of selfish mining in proof-of-work blockchain systems

, Ethereum and Bitcoin Cash. Our method is based on the realisation that selfish mining behaviour will cause identifiable anomalies in the statistics of miner’s successive blocks discovery. Secondly, we apply ... learning framework, called SquirRL, to evaluate both single and multiple agent selfish mining attacks in Bitcoin, Monacoin and Litecoin, The empirical data they scraped is the estimated hourly total hash

Mining the impact of social media information on public green consumption attitudes: a framework based on ELM and text data mining

employs the Elaboration Likelihood Model (ELM) and text data mining to examine how information strategies from government, businesses, and media influence consumer attitudes toward green consumption. The ... the combination of ELM theory and text data mining techniques to monitor public attitude change, applicable not only to green consumption but also to other fields. - Introduction 2021 i Driven by the

Automating data analysis for hydrogen/deuterium exchange mass spectrometry using data-independent acquisition methodology

enables true auto-curation of HX data by mining a rich set of deuterated fragments, generated by collisional-induced dissociation (CID), to simultaneously confirm the peptide ID and authenticate MS1-based ... suitable number of peptides in the sample workup process. The method has been reviewed extensively in recent years4–12. The standardization of experimental and data reporting protocols have improved the 1

Towards a practical use of text mining approaches in electrodiagnostic data

Healthcare professionals produce abounding textual data in their daily clinical practice. Text mining can yield valuable insights from unstructured data. Extracting insights from multiple information ... ( 2008 ). 11. Luque , C. et al. An advanced review on text mining in medicine . Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 9 ( 3 ), e1302 ( 2019 ). 12. Jensen , P. B. , Jensen , L

Discovery and structural mechanism of DNA endonucleases guided by RAGATH-18-derived RNAs

CRISPR-Cas systems and IS200/IS605 transposon-associated TnpBs have been utilized for the development of genome editing technologies. Using bioinformatics analysis and biochemical experiments, here ... revolution in the field of genome editing.12–16 A comprehensive understanding of bacterial defense systems and the discovery of specific molecular components within them may provide new insights into overcoming

Mosaic integration and knowledge transfer of single-cell multimodal data with MIDAS

atlas to new data. Applications in mosaic integration, pseudotime analysis and cross-tissue knowledge transfer on bone marrow mosaic datasets demonstrate the versatility and superiority of MIDAS. MIDAS is ... multimodal data. Such an atlas can serve as an encyclopedia, allowing researchers the ability to transfer knowledge to their new data and in-house studies13–15. Several methods for single-cell multimodal

AI and the democratization of knowledge

interpreted and used by a broad community comprising both non-experts and AI (Fig. 1). Thus, the transformations of data to knowledge should be made with the explicit aim of democratizing the data and knowledge ... , and we should always ask of any data set: which transformations are required to produce a knowledge representation that is usable by as large a community as possible, including machines? Training AI to

Construction of knowledge constraints: a case study of 3D structural modeling

by knowledge. Specifically, we focus on utilizing knowledge rule reasoning technology to extract topological semantic knowledge from interpretive data and employ knowledge inference to derive ... ) conversion of geological data into constraints of a topology knowledge graph, (2) mining of entity and relationship information in geological data through knowledge reasoning, and (3) expert determination of

De novo and somatic structural variant discovery with SVision-pro

Long-read-based de novo and somatic structural variant (SV) discovery remains challenging, necessitating genomic comparison between samples. We developed SVision-pro, a neural-network-based instance ... comparative SV detection and genotyping, addressing the challenges in de novo and somatic SV discovery from long-read data. SVision-pro visually compares genomic features encoded from sequencing alignments

Modeling multiple sclerosis using mobile and wearable sensor data

as well as their ability to distinguish people with MS (PwMS) from healthy controls, recognize MS disability and fatigue levels. To this end, we formalize clinical knowledge and derive behavioral ... Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (ACM KDD 2016), pp. 785–794 (2016). 58. Géron, A. Hands-on machine learning with scikit-learn and tensorflow

Data encoding for healthcare data democratization and information leakage prevention

The lack of data democratization and information leakage from trained models hinder the development and acceptance of robust deep learning-based healthcare solutions. This paper argues that ... data . In: International Conference on Knowledge Discovery and Data Mining . 245 - 250 ( PMLR , 2001 ). 20. Vempala , S. S. The Random Projection Method Vol. 65 . (American Mathematical Soc ., 2005 ). 21

The OREGANO knowledge graph for computational drug repurposing

Drug repositioning is a faster and more affordable solution than traditional drug discovery approaches. From this perspective, computational drug repositioning using knowledge graphs is a very ... promising direction. Knowledge graphs constructed from drug data and information can be used to generate hypotheses (molecule/drug - target links) through link prediction using machine learning algorithms

Characteristics of surrounding rock damage and control technology of a facing-mining excavating roadway in north Shaanxi mining area

investigates the deformation and damage characteristics of the surrounding rock in different stages using FLAC 3D numerical simulation, taking the facing-mining excavating roadway of this coal mine as the ... stability is conducted on-site, showing that the roadway is significantly affected by mining at the 50 m point ahead of the working face. Based on the numerical simulation and on-site monitoring results, the

A comprehensive historical and geolocalized database of mining activities in Canada

This paper introduces the MinCan database that presents mine-level estimates for the Canadian mining industry with a persistent annual coverage between 1950 and 2022. These estimates are based on ... provincial and territorial data over secondary sources like Mindat and non-experts in the collection of information. Lastly, Natural Resources Canada hold additional mining statistics exist but are aggregated