Advanced search    

Search: authors:"Xiaoqian Jiang"

17 papers found.
Use AND, OR, NOT, +word, -word, "long phrase", (parentheses) to fine-tune your search.

FORESEE: Fully Outsourced secuRe gEnome Study basEd on homomorphic Encryption

Background The increasing availability of genome data motivates massive research studies in personalized treatment and precision medicine. Public cloud services provide a flexible way to mitigate the storage and computation burden in conducting genome-wide association studies (GWAS). However, data privacy has been widely concerned when sharing the sensitive information in a cloud...

Protecting genomic data analytics in the cloud: state of the art and opportunities

The outsourcing of genomic data into public cloud computing settings raises concerns over privacy and security. Significant advancements in secure computation methods have emerged over the past several years, but such techniques need to be rigorously evaluated for their ability to support the analysis of human genomic data in an efficient and cost-effective manner. With respect...

Influences of Dominance and Evolution of Sex in Finite Diploid Populations

Most eukaryotes reproduce sexually. Although the benefits of sex in diploids mainly stem from recombination and segregation, the relative effects of recombination and segregation are relatively less known. In this study, we adopt an infinite loci model to illustrate how dominance coefficient of mutations affects the above-mentioned genetic events. However, we assume mutational...

Privacy-preserving GWAS analysis on federated genomic datasets

Background The biomedical community benefits from the increasing availability of genomic data to support meaningful scientific research, e.g., Genome-Wide Association Studies (GWAS). However, high quality GWAS usually requires a large amount of samples, which can grow beyond the capability of a single institution. Federated genomic data analysis holds the promise of enabling...

Grid multi-category response logistic models

Background Multi-category response models are very important complements to binary logistic models in medical decision-making. Decomposing model construction by aggregating computation developed at different sites is necessary when data cannot be moved outside institutions due to privacy or other concerns. Such decomposition makes it possible to conduct grid computing to protect...

A community assessment of privacy preserving techniques for human genomes

To answer the need for the rigorous protection of biomedical data, we organized the Critical Assessment of Data Privacy and Protection initiative as a community effort to evaluate privacy-preserving dissemination techniques for biomedical data. We focused on the challenge of sharing aggregate human genomic data (e.g., allele frequencies) in a way that preserves the privacy of the...

GAMUT: GPU accelerated microRNA analysis to uncover target genes through CUDA-miRanda

Background Non-coding sequences such as microRNAs have important roles in disease processes. Computational microRNA target identification (CMTI) is becoming increasingly important since traditional experimental methods for target identification pose many difficulties. These methods are time-consuming, costly, and often need guidance from computational methods to narrow down...

Differentially private distributed logistic regression using private and public data

Background Privacy protecting is an important issue in medical informatics and differential privacy is a state-of-the-art framework for data privacy research. Differential privacy offers provable privacy against attackers who have auxiliary information, and can be applied to data mining models (for example, logistic regression). However, differentially private methods sometimes...

Evolution of the F-Box Gene Family in Euarchontoglires: Gene Number Variation and Selection Patterns

F-box proteins are substrate adaptors used by the SKP1–CUL1–F-box protein (SCF) complex, a type of E3 ubiquitin ligase complex in the ubiquitin proteasome system (UPS). SCF-mediated ubiquitylation regulates proteolysis of hundreds of cellular proteins involved in key signaling and disease systems. However, our knowledge of the evolution of the F-box gene family in...

A new computational strategy for predicting essential genes

Background Determination of the minimum gene set for cellular life is one of the central goals in biology. Genome-wide essential gene identification has progressed rapidly in certain bacterial species; however, it remains difficult to achieve in most eukaryotic species. Several computational models have recently been developed to integrate gene features and used as alternatives...

WebGLORE: a Web service for Grid LOgistic REgression

WebGLORE is a free web service that enables privacy-preserving construction of a global logistic regression model from distributed datasets that are sensitive. It only transfers aggregated local statistics (from participants) through Hypertext Transfer Protocol Secure to a trusted server, where the global model is synthesized. WebGLORE seamlessly integrates AJAX, JAVA Applet...

DNA-COMPACT: DNA COMpression Based on a Pattern-Aware Contextual Modeling Technique

Genome data are becoming increasingly important for modern medicine. As the rate of increase in DNA sequencing outstrips the rate of increase in disk storage capacity, the storage and data transferring of large genome data are becoming important concerns for biomedical researchers. We propose a two-pass lossless genome compression algorithm, which highlights the synthesis of...

Doubly Optimized Calibrated Support Vector Machine (DOC-SVM): An Algorithm for Joint Optimization of Discrimination and Calibration

Historically, probabilistic models for decision support have focused on discrimination, e.g., minimizing the ranking error of predicted outcomes. Unfortunately, these models ignore another important aspect, calibration, which indicates the magnitude of correctness of model predictions. Using discrimination and calibration simultaneously can be helpful for many clinical decisions...

Patterns of nucleotides that flank substitutions in human orthologous genes

Background Sequence context is an important aspect of base mutagenesis, and three-base periodicity is an intrinsic property of coding sequences. However, how three-base periodicity is influenced in the vicinity of substitutions is still unclear. The effect of context on mutagenesis should be revealed in the usage of nucleotides that flank substitutions. Relative entropy (also...

The Influence of Deleterious Mutations on Adaptation in Asexual Populations

We study the dynamics of adaptation in asexual populations that undergo both beneficial and deleterious mutations. In particular, how the deleterious mutations affect the fixation of beneficial mutations was investigated. Using extensive Monte Carlo simulations, we find that in the “strong-selection weak mutation (SSWM)” regime or in the “clonal interference (CI)” regime...

Impacts of mutation effects and population size on mutation rate in asexual populations: a simulation study

Background In any natural population, mutation is the primary source of genetic variation required for evolutionary novelty and adaptation. Nevertheless, most mutations, especially those with phenotypic effects, are harmful and are consequently removed by natural selection. For this reason, under natural selection, an organism will evolve to a lower mutation rate. Overall, the...

Conservation and Evolution in and among SRF- and MEF2-Type MADS Domains and Their Binding Sites

Serum response factor (SRF) and myocyte enhancer factor 2 (MEF2) represent two types of members of the MCM1, AGAMOUS, DEFICIENS, and SRF (MADS)-box transcription factor family present in animals and fungi. Each type has distinct biological functions, which are reflected by the distinct specificities of the proteins bound to their cognate DNA-binding sites and activated by their...