ISSN 2410-7751 (Print)
ISSN 2410-776X (Online)
Biotechnologia Acta V. 14, No 1, 2021
Р. 38-45, Bibliography 22, English
Universal Decimal Classification: 577.218
https://doi.org/10.15407/biotech14.01.038
O. Lykhenko, A. Frolova, M. Obolenska
Institute of Molecular Biology and Genetics of the National Academy of the Sciences of Ukraine, Kyiv
The purpose of the study was to provide the pipeline for processing of publicly available unprocessed data on gene expression via integration and differential gene expression analysis.
Data collection from open gene expression databases, normalization and integration into a single expression matrix in accordance with metadata and determination of differentially expressed genes were fulfilled. To demonstrate all stages of data processing and integrative analysis, there were used the data from gene expression in the human placenta from the first and second trimesters of normal pregnancy.
The source code for the integrative analysis was written in the R programming language and publicly available as a repository on GitHub. Four clusters of functionally enriched differentially expressed genes were identified for the human placenta in the interval between the first and second trimester of pregnancy.
Immune processes, developmental processes, vasculogenesis and angiogenesis, signaling and the processes associated with zinc ions varied in the considered interval between the first and second trimester of placental development. The proposed sequence of actions for integrative analysis could be applied to any data obtained by microarray technology.
Key words: microarray, transcript, integrative analysis, Bayesian empirical method, meta-analysis, differentially expressed genes, placenta.
© Palladin Institute of Biochemistry of National Academy of Sciences of Ukraine, 2021
References
1. Taminau J., Lazar C., Meganck S., Now? A. Comparison of merging and meta-analysis as alternative approaches for integrative gene expression analysis. ISRN Bioinform. 2014,V. 2014, P. 345106. https://doi.org/10.1155/2014/345106
2. Uitert M., Moerland P. D., Enquobahrie D. A., Laivuori H., Joris A. M. van der Post, RisStalpers C., Afink G. B. Meta-analysis of placental transcriptome data identifies a novel molecular pathway related to preeclampsia. PLoS One. 2015, V. 10, P. e0132468. https://doi.org/10.1371/journal.pone.0132468
3. Cosmin L., Meganck S., Taminau J., Steenhoff D., Coletta A., Molter C., WeissSol?s D. Y., Duque R., Bersini H., Now? A. Batch effect removal methods for microarray gene expression data integration: a survey. Briefings Bioinf. 2013, 14 (4), 469–490. https://doi.org/10.1093/bib/bbs037
4. Turnbull A. K., Kitchen R. R., Larionov A. A., Renshaw L., Dixon J., Sims A. H. Direct integration of intensity-level data from Affymetrix and Illumina microarrays improves statistical power for robust reanalysis. BMC Medical Genomics. 2012, 5 (1), 35. https://doi.org/10.1186/1755-8794-5-35
5. Tseng G. C., Ghosh D., Feingold E. Comprehensive literature review and statistical considerations for microarray metaanalysis. Nucleic acids research. 2012, 40 (9), 3785–99. https://doi.org/10.1093/nar/gkr1265
6. Lykhenko O., Frolova A., Obolenska M. Creation of gene expression database on preeclampsiaaffected human placenta. Biopolym. Cell. 2017, 33 (6), 442–452. https://doi.org/10.7124/bc.000967
7. Gautier L., Cope L., Bolstad B. M., Irizarry R. A. Affy — analysis of Affymetrix GeneChip data at the probe level. Bioinformatics. 2004, 20 (3), 307–315 https://doi.org/10.1093/bioinformatics/btg405
8. Irizarry R. A., Bolstad B. M., Collin F., Cope L. M., Hobbs B., Speed T. P. Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2002, 31 (4), e15. https://doi.org/10.1093/nar/gng015
9. Zhao Yaxing, Limsoon Wong, Wilson Wen Bin Goh. How to do quantile normalization correctly for gene expression data analyses. Sci. Rep. 2020, 10 (15534), 1–11.https://doi.org/10.1038/s41598-020-72664-6
10. Frolova A. O, Bondarenko V. S., Obolens ka M. Yu. Cross-platform integration of experimental microarrays and its effect on the value of gene expression in the analysis of human breast cancer samples. Medychna informatyka ta inzheneriia. 2016, No 2, P. 5–14.
11. Sandberg R., Larsson O. Improved precision and accuracy for microarrays using updated probe set definitions. BMC Bioinf. 2007, 8 (1), 48. https://doi.org/10.1186/1471-2105-8-48
12. Johnson W. Evan, Cheng Li, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007, 8 (1), 118– 127. https://doi.org/10.1093/biostatistics/kxj037
13. Clifton Vicki. Sex-based functional features of human placentae. Zdorove zhenshchyni. 2011, No 4, P. 24–29. (In Ukrainian).
14. Buckberry Sam, Stephen J. Bent, Tina Bianco-Miotto, Claire T. Roberts, Author Notes. massiR: a method for predicting the sex of samples in gene expression microarray datasets. Bioinformatics. 2014, 30 (14), 2084–2085. https://doi.org/10.1093/bioinformatics/btu161
15. Venables W. N., Ripley B. D. Modern Applied Statistics with S. Springer Springer-Verlag. New York. 2002, 562 p. https://doi.org/10.1007/978-0-387-21706-2
16. Van Der Maaten, Hinton L. J. P., Hinton G. E. Visualizing High-Dimensional Data Using t-SNE. J. Machine Learning Res. 2008, V. 9, P. 2579–2605.
17. Smyth G. K. Bioinformatics and Computational Biology Solutions Using R and Bioconductor. limma: Linear Models for Microarray Data. Springer. 2005, P. 397–420. https://doi.org/10.1007/0-387- 29362-0_23
18. Szklarczyk D., Gable A. L., Lyon D., Junge A., Wyder S., Huerta-Cepas J., Simonovic M., Doncheva N. T., Morris J. H., Bork P. STRING v11: protein– protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019, 47 (Database), D607–D613. https://doi.org/10.1093/nar/gky1131
19. Albonici L., Benvenuto M., Focaccetti C., Cifaldi L., Miele M.T., Limana F., Manzari V., Bei R. PlGF Immunological Impact during Pregnancy. Int. J. Mol. Sci. 2020, 21 (22), 8714. https://doi.org/10.3390/ijms21228714
20. Scott G. F. Morphogenesis and Cell Adhesion. Development Biology, 6th edition. Sunderland (MA): Sinauer Associates. 2000, P. 325–335.
21. Wessels I., Maywald M., Rink L. Zinc as a Gatekeeper of Immune Function. Nutrients. 2017, 9 (12), 1286. https://doi.org/10.3390/ nu9121286 22. Williams R. J. Zinc: what is its role in biology? Endeavour. 1984, 8 (2), 65–70. https://doi. org/10.1016/0160-9327(84)9004=0-1