Novel ChIP-seq simulating program with superior versatility: IsChIP

Chromatin immunoprecipitation followed by next-generation sequencing (ChIP-seq) is recognized as an extremely powerful tool to study the interaction of numerous transcription factors and other chromatin-associated proteins with DNA. The core problem in the optimization of ChIP-seq protocol and the following computational data analysis is that a 'true' pattern of binding events for a given protein factor is unknown. Computer simulation of the ChIP-seq process based on 'a-priory known binding template' can contribute to a drastically reduce the number of wet lab experiments and finally help achieve radical optimization of the entire processing pipeline. We present a newly developed ChIP-sequencing simulation algorithm implemented in the novel software, in silico ChIP-seq (isChIP). We demonstrate that isChIP closely approximates real ChIP-seq protocols and is able to model data similar to those obtained from experimental sequencing. We validated isChIP using publicly available datasets generated for well-characterized transcription factors Oct4 and Sox2. Although the novel software is compatible with the Illumina protocols by default, it can also successfully perform simulations with a number of alternative sequencing platforms such as Roche454, Ion Torrent and SOLiD as well as model ChIP -Exo. The versatility of isChIP was demonstrated through modelling a wide range of binding events, including those of transcription factors and chromatin modifiers. We also performed a comparative analysis against a few existing ChIP-seq simulators and showed the fundamental superiority of our model. Due to its ability to utilize known binding templates, isChIP can potentially be employed to help investigators choose the most appropriate analytical software through benchmarking of available ChIP-seq programs and optimize the experimental parameters of ChIP-seq protocol. isChIP software is freely available at https://github.com/fnaumenko/isChIP. © 2020 The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Authors
Subkhankulova T.1 , Naumenko F.2 , Tolmachov O.E.3 , Orlov Y.L. 4
Publisher
Oxford University Press
Number of issue
4
Language
English
Status
Published
Number
bbaa352
Volume
22
Year
2021
Organizations
  • 1 University College, London, WC1E 6BT, United Kingdom
  • 2 Novosibirsk State University, Russian Federation
  • 3 Imperial College London, UK and University of Greenwich, United Kingdom
  • 4 Digital Health Institute, I.M. Sechenov First Moscow State Med. Univ., and Sr. Scientist at Agrarian and Technol. Institute, Peoples' Friendship University of Russia (RUDN University), Russian Federation
Keywords
binding site/domain; binding template; ChIP-seq; ChIP-seq modelling/simulation; fragment size distribution; peak-calling programs; single cell ChIP-seq simulation; transcription factor
Date of creation
16.12.2021
Date of change
15.03.2022
Short link
https://repository.rudn.ru/en/records/article/record/76747/
Share

Other records