Qualitative-Concentration
Description
The “quantitative-concentration” module takes phenotype/bioactivity data, with the minimum inhibitory concentration per sample specified (concentration/dilution at which a phenotypic signal was still observed).
Samples inactive at any concentration are specified with 0. Multiple measurements can be specified for each sample, and multiple assays can be provided. The algorithm works as follows:
- Duplicate measurements per sample are averaged using either the mean or median.
- The areas of molecular features detected in more than three samples are z-transformed.
- MIC values are converted to their reciprocals (1 / measurement) or left as zero if the concentration was zero, then z-transformed.
- Transformed feature areas and MIC values are correlated using Pearson correlation.
- The resulting p-values are corrected for multiple hypothesis testing using a user-specified correction method (e.g. Bonferroni).
- Features that exceed user-defined thresholds for both correlation coefficient and adjusted p-value are classified as bioactivity-associated.
Limitations
- This method assumes that the prerequisites with regard to sample reproducibility are met (see Input/Output).
- This method assumes a negative linear relationship between phenotype (concentration) and concentration (area of feature) - the lower the minimal inhibitory concentration, the higher the concentration.
- This method does not take into account any synergistic or quenching effects.
Parameters
Key | Possible Values | Default |
activate_module | true, false | false |
sample_avg | mean, median | mean |
value | area | area |
algorithm | pearson | pearson |
fdr_corr | bonferroni, sidak, holm-sidak, holm, simes-hochberg, hommel, fdr_bh, fdr_by, fdr_tsbh, fdr_tsbky | bonferroni |
p_val_cutoff | 0.0-1.0 | 0.05 |
coeff_cutoff | 0.0-1.0 | 0.7 |
Explanation
sample_avg
: specifies the algorithm to summarize multiple measurements per sample for same assay. Possible algorithms aremean
andmedian
.value
: specifies value per feature to be correlated with concentration. Onlyarea
is currently allowed.algorithm
: specifies the statistical algorithm to use. Onlypearson
is currently allowed.fdr_corr
: the method used for false-discovery-rate correction. FERMO uses the statsmodels library for this purpose - please see their documentation for information on the different algorithms.p_val_cutoff
: Maximum FDR-corrected p-value to consider, with zero disabling cutoff filtering for both p-value and coefficient.coeff_cutoff
: Minimum correlation coefficient to consider, with zero disabling cutoff filtering for both p-value and coefficient.