Investigation of Drought and Salinity Tolerance Related Genes and their Regulatory Mechanisms in Arabidopsis ( Arabidopsis thaliana )

Results: Under drought stress, 2558 gene accessions in root and 3691 in shoot tissues had significantly differential expression with respect to control condition. Likewise, under salinity stress 9078 gene accessions in root and 5785 in shoot tissues were discriminated between stressed and non-stressed conditions. Furthermore, the transcription regulatory activity of differentially expressed genes was mainly due to hormone, light, circadian and stress responsive cis-acting regulatory elements among which ABRE, ERE, P-box, TATC-box, CGTCA-motif, GARE-motif, TGACG-motif, GAG-motif, GA-motif, GATAmotif, TCT-motif, GT1-motif, Box 4, G-Box, I-box, LAMP-element, Sp1, MBS, TC-rich repeats, TCA-element and HSE were the most important elements in the identified up-regulated genes.


INTRODUCTION
Plants respond to abiotic stresses in various ways to survive under harsh environmental conditions.Plants have acquired various stress tolerance mechanisms including physiological and biochemical changes allowing adaptation to In Arabidopsis, the AtGenExpress project (http://www.arabidopsis.org/portals/expression/microarray/ATGenExpress.jsp) has collected thousands of transcript profiles on the basis of the Affymetrix ATH1 GeneChip that are now publicly available.This contribution has enabled the discovery of candidate genes on the basis of expression profiles in various tissues, developmental stages, and environmental conditions.Transcriptome analysis technologies have advanced to the point where high-through-put DNA sequencers and high-density microarrays such as tiling arrays are readily available [1].The results of several studies indicated the usefulness of the microarray technology to the analysis of expression profiles in response to abiotic stresses, such as drought, cold and high-salinity [19 -24].
In silico, identification and characterization of the genes in various organisms under different conditions are important due to growing data in the databases [25].According to the importance of bioinformatic tools to analyze current mass biology data to identify key genes and their regulatory pattern to achieve abiotic stresses tolerance mechanism, herein we assessed the most important and common genes contributing to the key factors in response to abiotic stress in model plant Arabidopsis thaliana that can be used in practical plant breeding programs for generated new abiotic stress tolerant staple food crops.Another important aim of this study was to characterize the regulatory mode of abiotic tolerance responsible genes.

Microarray Data Mining
The comparative analysis of the microarray data from different tissues of Arabidopsis under drought and salinity stresses was conducted.The Affymetrix raw data (.CEL) files from two independent experiments -drought stress time course (Unpublished data from Kudla Group, ME00338 (Expression Set: 1007966668)) and salt stress time course (Unpublished data from Kudla Group, ME00328 (Expression Set: 1007966888))-were retrieved from the TAIR dataset (https://www.arabidopsis.org/).The collected data were analyzed by FlexArray software version 1.6.3.In the first step, CEL data were normalized by Robust Multiarray Average (RMA) algorithm [26] and in the second step the RMA signal values were transformed into log 2 .In order to select Differentially Expressed Genes (DGEs), the RMA expression values were analyzed by two-sample student's t-test and the up-regulated genes with more than 1symmetrical raw fold changes, and down-regulated genes of less than 0-symmetrical raw fold changes were selected for further data analysis.p-values were adjusted in false discovery rate as less than <0.05.

Comparing Gene List
To compare results of microarray data analysis, the Venn diagram was used to identify the overlapped of the resulting genes (differentially expressed genes) and representing interactions among these gene sets in different tissues under both salinity and drought stresses conditions.The Venn diagram could be available in effective and free web application at http://bioinformatics.psb.ugent.be/webtools/Venn/.

Functional Annotation Analysis
Gene Ontology (GO) analysis was conducted with AgriGO database tool (http://bioinfo.cau.edu.cn/agriGO/).Singular Enrichment Analysis (SEA) was done to complete GO analysis of Arabidopsis thaliana species and the Affymetrix ATH1 genome Array was selected as a reference background.In the next step, the Fisher and Yekutieli (FDR under dependency) methods were chosen for statistical test and multi-test adjustment at 0.05 significance level, respectively.Finally, the results were described in three categories including, molecular function, biological processes and cellular components [27].

Cis-Acting Regulatory Element Analysis (CAREs Analysis)
Based on the GO analysis, the complete nucleotide sequence of 13 gene accession numbers (representative of 21 known genes from binding category) involved in transcription regulatory activity under salinity and drought stress conditions were downloaded from the National Center for Biotechnology Information (NCBI) (https://www.ncbi.nlm.nih.gov/)database.For CAREs analysis, the 1.5 kbp of 5´ upstream of each gene was retrieved from Phytozome (http://www.phytozome.net/)database and they were subjected to the PlantCare (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/)database to identify common CAREs in the responsive genes for both salinity and drought stress conditions [28].

Identification of Differentially Expressed Genes
The availability of a large volume of microarray data provides an opportunity to study the gene expression at wholegenome, gene family or single gene level and to identify the gene(s) involved in various biological processes [29].
The results of t-test for the microarray data demonstrated 2558 and 3691 gene accessions were differentially expressed in response to drought stress in root and shoot tissue, respectively.For salt stress, 9078 gene accessions in root and 5785 gene accessions in shoot tissue were differentially expressed.
Differences and cross-talk of gene expression among drought and salinity stress responses were analyzed based on Venn diagram analysis.As shown in Fig. (1), 683, 1226, 2276 and 1634 gene accessions were identified as drought root-, drought shoot-, salinity root-and salinity shoot-induced genes with higher than one-fold change induction, respectively (Fig. 1).The list and expression data of up-regulated genes in response to drought and salinity stress are available as a supplementary material (Table S1).A number of 450 gene accessions were induced in response to both drought and salinity in shoot tissue, whereas only 224 gene accessions were induced by both drought and salinity stress in root tissue (Fig. 1).These results indicate the existence of greater crosstalk between drought and salinity stress signaling processes in shoot tissue, than those in root tissue.Analysis of overlapping on the Venn diagram showed that 49 gene accessions were induced under all two stresses (Table 1), of which 0.47% (49 out of 10467 of the gene accessions) were responded to drought stress regulated by salinity stress.Therefore, it can be concluded that there is a strong correlation between the responses of plant to drought and salinity stress.Analysis of stress-down-regulated as well as stress-up-regulated genes is important in understanding molecular responses to abiotic stresses [22].In the present study, a total of 635, 553, 3762 and 1186 gene accessions were identified as drought root-, drought shoot-, salinity root-and salinity shoot-down-regulated genes by means of microarray analysis (Fig. 2).The list and expression data on these drought and salinity stress-down-regulated gene accessions are available as a supplementary material (Table S2).A total 358 down-regulated gene accessions were observed in shoot tissue under both drought and salinity and expression of 362 number of gene accessions were decreased by both drought and salinity stress in root tissue (Fig. 2).The results indicated that 0.25% (27 out of 10645) of DEGs from both tissues showed possible crosstalk in response to drought and salinity stresses (Table 2).Based on the result presented in Figs.(1 and 2), the number of down-regulated genes was higher than the number of up-regulated ones indicating most of Arabidopsis genes have been switched off under drought and salinity stresses.

Functional Annotation Analysis of DGEs
The GO analysis of up-regulated common genes (49 gene accessions, Table 1) of root and shoot tissues under salinity and drought stress revealed that DEGs products were dissected into three distinct categories: biological process, molecular function and cellular component (Fig. 3).In the biological process category, the overrepresented genes belonged to the metabolic (42.86%) and cell (40.82%) processes (Fig. 3).In molecular function process, the largest group with 46.94% and 38.78% related to the binding and catalytic activity, respectively (Fig. 3).In the cellular component process, the largest group belonged to the cell (46.94%) and cell part (46.94%).To identify the functional key genes, we used non-repetitive genes distinct in each functional category.In metabolic process, the genes AT5G10770, AT2G01460, AT1G19190, AT5G03490 and AT1G51210 were identified to contribute to proteolysis, kinase, hydrolase and UDP-glucosyltransferase activity under both salinity and drought stresses.Proteome analysis has contributed significantly to understanding the plant response to abiotic stress through changes in the abundance of proteases [30].For instance, up-regulation of different subunits of the 20S proteasome has been detected in desiccation-tolerant maize embryos during desiccation [31], leaves of drought-treated Medicago sativa plants [32], wheat stems under drought [33], and during dehydration of desiccation tolerant grass Sporobolus stapfianus [34].On the other hand, in response to dehydration, proteins involved in proteolysis carried out by the proteasome were found to be expressed to constant levels in nuclei in the desiccation tolerant Xerophyta viscosa, probably to help maintain minimum viability in cells under stress [35].Specific proteolysis is essential for the activation of the many enzymes and regulatory proteins during plant development and diverse environmental changes [36].
There are different products of genes with kinase activity, which typically makes them useful for abiotic stress tolerance.Among the protein kinases involved in stress signal transduction in plants are those common to all eukaryotic organisms, i.e.Mitogen-Activated Protein Kinases (MAPKs) [36 -38], Glycogen Synthase Kinase3 (GSK3) [40 -41], S6 Kinase (S6K) [42], Calcium-Dependent Protein Kinases (CDPKs) [43 -45] and most of SNF1-related kinases (SnRKs).Also, SnRK2 family members are plant-specific serine/threonine kinases involved in plant response to abiotic stresses and abscisic acid (ABA)-dependent plant development [46].The results of GO analysis indicated that in metabolic process, the gene AT1G19190 had hydrolase activity and controls alpha/beta-Hydrolases superfamily protein expression.Recently, a novel α/β-Hydrolase gene IbMas containing maspardin domain from α/β -hydrolase superfamily has been discovered [47].The biological functions of α/β-hydrolase fold enzymes in organisms are widely variable and include biosynthesis, metabolism, signal transduction and gene regulation [48].To date, several of α/β-hydrolase fold enzymes (esterase, phospholipase D and OsPOP 5 ) have been shown to be involved in plant salt tolerance [49 -52].
For binding category HSFC1, AT3G24518, CID12, CID11, GATA10, GATA11, APC1, APC3, RHC2A, PDCB4, APRR2, GRP2, GRP2B, psaC, AT5G10770, NBR1, AT1G13190, AT5G55670, TIM23-1, TIM23-2 and TIM23-3 genes had important effect on DNA duplex unwinding, transcription factor activity, protein binding, RNA binding, mRNA binding and DNA binding protein.Among these genes, the role of AT3G24518 is noticeable because it codes a noncoding RNA.Accordingly, this gene could reflect various epigenetic regulation mechanisms i.e. histone modification and chromatin remodeling to induce a number of genes against different environmental stimuli.Furthermore, transcription factors like DRE-Binding Protein (DREB), C-repeat-Binding Factor (CBF), ABA-Binding Factor (ABF), myelocytomatosis oncogene (MYC) and myeloblastosis oncogene (MYB) [53 -55] have important role on regulation of the expression of environmental stress-inducible genes.DREBs (dehydration responsive element binding) are important plant Transcription Factors (TFs) regulating the expression of many stress-inducible genes mostly in an ABAindependent manner and play a critical role in improving the abiotic stress tolerance of plants by interacting with a DRE/CRT cis-element present in the promoter region of various abiotic stress-responsive genes [56].The protein binding is a critical part for the regulation of the genes against abiotic stresses [57].
RNA-binding proteins are important in cases such as post-transcriptional gene regulation mechanism.Most of RNA-binding proteins are plant specific with specific function.Definitely, recent research on RNA-binding proteins indicated that, beside their regulation function, they are also important for adaptation of plants to various environments.Their function is associated with the regulation of pre-mRNA splicing, polyadenylation, RNA stability and RNA export [58].The DREB TFs contain a highly conserved AP2/ERF DNA-binding domain across the plant kingdom including Arabidopsis, rice, soybean, chickpea, tomato, tobacco, and millets [56].
Phosphorylation may be necessary for the activation of proteins under drought-stress conditions, thus enhancing the DNA-binding activity of several transcription regulators [59].The functional gene group for DNA binding activity, plays a key role in providing tolerance to multiple stresses, generally in an ABA-independent manner through DRE/ CRT cis-elements and the AP2/ERF DNA binding domain [56].

Identification of Cares in the Promoter Site of Transcription Factors
Transcription Factors (TFs) have a critical role in response to unregular and harsh environmental condition such as abiotic stresses through regulation of gene expression.According to the results of GO analysis, 21 genes including HSFC1, AT3G24518, CID12, CID11, GATA10, GATA11, APC1, APC3, RHC2A, PDCB4, APRR2, GRP2, GRP2B, psaC, AT5G10770, NBR1, AT1G13190, AT5G55670, TIM23-1, TIM23-2 and TIM23-3 had significant role in binding activity as TF elements.In order to identify the presence of most presented CAREs that regulated the expression of the salinity and drought responsive genes in Arabidopsis, a genomic sequence of 1.5 kbp from aforementioned genes were used in the promoter analysis.The most present CAREs in the promoter region are shown in Table (3).CAREs functions were predominantly associated with light response, hormonal regulation and stress response that were detected in the upstream of DGEs under drought and salinity stress conditions in Arabidopsis (Table 3).The hormonal regulatory elements included Methyl jasmonate (MeJA), Gibberellin (GA), Ethylene (ET) and abscisic acid (ABA) responsive such as ABRE, ERE, P-box, TATC-box, CGTCA-motif, GARE-motif and TGACG-motif elements were presented in the majority of DGEs under salinity and drought stress conditions (Table 3).Plants perceive and respond adaptively to abiotic stress imposed by salt, cold, drought and wounding and the adaptive process is controlled mainly by the phytohormone, abscisic acid (ABA), which acts as an endogenous messenger in the regulation of the plant's water status [60].ABA is generated as a signal during a plant's life cycle to control developmental processes such as seed germination.The ABA function can target specifically guard cells for the induction of stomatal closure but may also involve in signaling pathway for adjustment towards severe water shortage conditions.Since various stresses induce ABA synthesis, it is considered as a plant stress hormone [61,62].Various transcription factors are known to regulate the ABA-responsive gene expression [61,63].ABA induced expression often relies on the presence of cisacting element called ABRE element (ABA-responsive element) [6,8,64].ABRE element regulates the dehydration and salinity responses in model plants like Arabidopsis and rice [65].In addition, this element has been involved in abiotic stress responses in Arabidopsis [66] and Brassica napus [67].Jasmonates (JAs) [Jasmonic Acid (JA) and methyl jasmonates (MeJAs)] are known to participate in various physiological processes [68].An important phytohormone, JA (Jasmonic Acid) and its methyl ester, methyl jasmonates (MeJAs), are derivatives of the fatty acid metabolism [69 -71].According to the results of this study, TGACG and CGTCA motives are involved in Methyl jasmonate (MeJA) response that affects the regulation of plant defense against biotic stresses.In a study the role of these motives in abiotic stress tolerance has been highlighted [72].The recently published research indicated both JA and MeJA promote plant defense against different stresses [73].In the case of ethylene (ET), the Ethylene Response Factor Protein, JERF3, has been demonstrated to activate the expression of oxidative genes, resulting in lower accumulation of ROS and, in turn, enhanced adaptation to drought, freezing, and salt stresses in tobacco [74].The effect of ethylene as stress hormones controlling adaptation to abiotic and biotic stress has been reported [75,76].In the present study, the results indicated that these elements interfere with drought and salinity stress responses.The presence of these hormonal regulatory elements in the majority of expressed genes helps to expand our knowledge to better understand the function of hormone responsive genes under different abiotic stresses in Arabidopsis and consequently in other plant species.The central role of the gibberellin (GA) class of growth hormones in the response to abiotic stress is becoming increasingly evident [77].Reduction in GA levels and signaling has been shown to contribute to plant growth restriction on exposure to several stresses, including cold, salt and osmotic stress.Conversely, increased GA biosynthesis and signaling promote growth in plant escape responses to shading and submergence.In several cases, GA signaling has also been linked to stress tolerance [77].The interaction of the GAsignaling molecule -DELLA-with the components of the signaling pathways for the stress hormone jasmonic acid indicates the role of an additional mechanism by which GA signaling may integrate multiple hormone signaling pathways in response to abiotic stresses [77].

Cis-Acting Regulatory Element (CARE) Function
The results of this study revealed that the light responsive elements such as GAG-motif, GA-motif, GATA-motif, TCT-motif, GT1-motif, Box 4, G-Box, G-box, I-box, LAMP-element and Sp1 were located in the promoter region of expressed salinity and drought responsive genes (Table 3).The result of CARE analysis demonstrated that the light responsive elements were mainly controlled by the responsive genes.These regulatory elements play a fundamental role in the regulation of transcriptional activity [78].GATA-box, G-box, Box-4, GAG, GAP, GA-motif and I-box are known as light responsive cis-elements (LREs).They have been found in the regulatory region of light-regulated genes, with the GATA-motif and I-box being the most prominent among them, apparently essential for light-controlled transcriptional activity [79,80].G-Box element is one of the parts of the response mechanism against abiotic stresses in Arabidopsis [81].GAG motif plays a major role in abiotic stress in tobacco [82].In addition, the role of G-Box element under drought stress on wheat ancestors has been reported [83].
The circadian clock in plants coordinates the responses to multiple simultaneous environmental challenges as the sessile plant cannot avoid.With respect to this issue, several circadian elements were found in drought and salinity responsive genes in this study (Table 3).These circadian CAREs control photoperiod-responsive genes.On the one hand, these genes are involved in opening and stomata closure, regulation of flowering, water use efficiency and response to hormone and light especially in leaves.They affect transpiration and regulate the biological processes and are involved in metabolic and physiological networks that are essential for growth, reproduction and proper response to abiotic and biotic stresses [84 -86].The role of this element to improve Arabidopsis tolerance against salinity, drought and cold stress has been confirmed [87].Therefore, our study confirmed the role of circadian regulatory element in response to salinity and drought stresses in Arabidopsis.
The stress responsive elements such as MBS were presented in the majority of responsive genes (Table 3).MBS is a binding site for MYB transcription factors, which control many abiotic stress responses [88].Genetic and molecular approaches have facilitated extensive functional characterization of MYB domain proteins, particularly the R2R3-type members in various plant species, including rice, maize and soybean.A genome wide comparative analysis of MYB genes and their expression in Arabidopsis and rice have indicated the potential role of several MYB domain proteins in plant stress responses [89].Several members of R2R3-type MYB transcription factors are involved in the regulation of phenylpropanoid pathway and production of various secondary metabolic compounds under abiotic stress conditions in plants.In addition, most studies declared the role of MBS element against drought stress in common Bean [90] and maize [91].
TC-rich repeats and TCA-element were another CAREs that are present in the majority of responsive genes (Table 3).The TC-rich repeats and TCA-element have been associated with stress responses according to previous reports [92 -95].
In the present study, the HSE element has been found in the regulatory region of responsive genes showing a possible role of this CARE in response to salinity and drought stress in Arabidopsis.TF is an HSF that binds to HSE cis-acting elements in the promoter of stress-inducible genes and plays central roles in the acquisition of plant tolerance against abiotic stresses [96].Furthermore, ARE regulatory elements are essential for the induction of anaerobic respiration (Table 3).The role of these elements against abiotic stresses is limited, although our study indicates the possibility of the role of these elements in tolerance against drought and salinity stresses.
Moreover, other elements such as GCN4_motif and Skn-1 motives were present in the majority of responsive genes specifically in endosperm (Table 3).Although our study shows that these elements could be considered as regulatory candidates against abiotic stresses, their function and contribution are still unknown.

CONCLUSION
In the present study, the expression pattern and regulation network of Arabidopsis as genetic model crop under salinity and drought stresses were highlighted.According to higher number of co-expressed gene accessions in shoot (450 genes) with respect to root tissue (224 genes) under both salinity and drought stresses it can be concluded that there was a great crosstalk between drought and salinity stress signaling processes in shoot tissue.Also, there was a strong correlation between the responses of plant to drought and salinity stress.Furthermore, a number of 13 gene accessions had important effect on binding activity such as DNA duplex unwinding and transcription factor activity.Among genes, the role of AT3G24518 was remarkable because it codes a non-coding RNA which suggests some possible epigenetic regulation mechanisms like histone modification and chromatin remodeling.The result of CARE analysis indicated that hormonal regulatory and light responsive elements were the most presented CAREs in the majority of DGEs under salinity and drought stress conditions.Overall, these kinds of studies on model plants and such high-throughput analyses give an insight into the genetic architecture of important plants like cereals under stress condition and consequently help plant breeders to perform comprehensive breeding programs in order to develop abiotic stress-tolerant cultivars.

ETHICS APPROVAL AND CONSENT TO PARTICIPATE
Not applicable.

HUMAN AND ANIMAL RIGHTS
No Animals/Humans were used for studies that are base of this research.

CONSENT FOR PUBLICATION
Not applicable.

Fig. ( 1 )
Fig. (1).Venn diagram of the number of up-regulated genes differentially expressed in response to drought and salinity stresses in Arabidopsis.

Fig. ( 2
Fig. (2).Venn diagram of the number of down-regulated genes differentially expressed in response to drought and salinity stresses in Arabidopsis.

Fig. ( 3 )
Fig. (3).Functional categorization of common genes in different tissues (root and shoot) of Arabidopsis in response to drought and salinity stresses.

5UTR
Py-rich stretch cis-acting element conferring high transcription levels ABRE cis-acting element involved in the abscisic acid responsiveness ARE cis-acting regulatory element essential for the anaerobic induction Box 4 part of a conserved DNA module involved in light responsiveness Investigation of Drought and Salinity The Open Bioinformatics Journal, 2018, Volume 11 21 Cis-Acting Regulatory Element (CARE) Function CAAT-box common cis-acting element in promoter and enhancer regions CGTCA-motif cis-acting regulatory element involved in the MeJA-responsiveness circadian cis-acting regulatory element involved in circadian control ERE ethylene-responsive element GAG-motif part of a light responsive element GA-motif part of a light responsive element GATA-motif part of a light responsive element GARE-motif gibberellin-responsive element G-Box cis-acting regulatory element involved in light responsiveness G-box cis-acting regulatory element involved in light responsiveness GT1-motif light responsive element GCN4_motif cis-regulatory element involved in endosperm expression HSE cis-acting element involved in heat stress responsiveness I-box part of a light responsive element LAMP-element part of a light responsive element MBS MYB binding site involved in drought-inducibility P-box gibberellin-responsive element Skn-1-motif cis-acting regulatory element required for endosperm expression Sp1 light responsive element TATA-box core promoter element around -30 of transcription start TATC-box cis-acting element involved in gibberellin-responsiveness TCA-element cis-acting element involved in salicylic acid responsiveness TC-rich repeats cis-acting element involved in defense and stress responsiveness TCT-motif part of a light responsive element TGACG-motif cis-acting regulatory element involved in the MeJA-responsiveness