SlideShare ist ein Scribd-Unternehmen logo
1 von 16
Downloaden Sie, um offline zu lesen
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

PREDICTIVE COMPARATIVE QSAR ANALYSIS OF
AS 5-NITROFURAN-2-YL DERIVATIVES MYCO
BACTERIUM TUBERCULOSIS H37RV
INHIBITORS
Doreswamy1 and Chanabasayya .M. Vastrad2
1

Department of Computer Science Mangalore University , Mangalagangotri-574 199, Karnataka,
INDIA
2
Department of Computer Science Mangalore University , Mangalagangotri-574 199, Karnataka,
INDIA

ABSTRACT
Antitubercular activity of 5-nitrofuran-2-yl Derivatives series were subjected to Quantitative Structure
Activity Relationship (QSAR) Analysis with an effort to derive and understand a correlation between the
biological activity as response variable and different molecular descriptors as independent variables.
QSAR models are built using 40 molecular descriptor dataset. Different statistical regression expressions
were got using Partial Least Squares (PLS) ,Multiple Linear Regression (MLR) and Principal Component
Regression (PCR) techniques. The among these technique, Partial Least Square Regression (PLS)
technique has shown very promising result as compared to MLR technique A QSAR model was build by a
training set of 30 molecules with correlation coefficient (	‫ ݎ‬ଶ ) of 0.8484 , significant cross validated
	 ଶ
correlation coefficient (‫ ݍ‬ଶ ) is 0.0939, ‫ ݐݏ݁ݐ	ܨ‬is 48.5187, ‫ ݎ‬ଶ 		for external test set (‫ ) ݎ_݀݁ݎ݌‬is -0.5604,
coefficient of correlation of predicted data set (‫ ݎ_݀݁ݎ݌‬ଶ ‫ )݁ݏ‬is 0.7252 and degree of freedom is 26 by
Partial Least Squares Regression technique.

KEYWORDS
TB, MLR , PLS , PCR , LOO

1. INTRODUCTION
Tuberculosis in humans is generally caused by mycobacterium tuberculosis(TB). The desease is
spread by respirable droplets generated during effective expiratory manoeuvres such as coughing.
TB desease can be either active or latent[1] . The World Health Organization (WHO) asses that
within the next twenty years about thirty million people will be troubled with the bacillus [2-3].
The analytic management of TB has depends dully on a limited number of drugs such as
Isonicotinic acid, Hydrazide,Rifadin, Rimactane ,Myambutol ,Streptomycin, Ethionamide,
Pyrazinamide, Fluroquinolones etc [4]. Still with the origin of these special chemical drugs the
DOI: 10.5121/hiij.2013.2404

47
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

spread of TB has not been eradicated completely because of delayed treatment programmes
.There is now recognition that new drugs to treat TB are necessarily required, particularly for use
in shorter medication procedure than are possible with the current agents and which can be
engaged to treat multi-drug resistant and hidden disease[5].
5-nitrofuran-2-yl shows effective in vitro and in vivo antimycobacterial activity [6]. There is also
a great effort to find and develop newer, 5-nitrofuran-2-yl, and some of them might have value in
the remedy of TB [7]. Chemo informatics[26] and computer-aided drug design (CADD) are
likely to contribute to a possible solution for the dangerous situation regarding this infectious
disease by assisting in the swift identification of new effective anti-TB agents. The other way for
overcoming the absence of empirical analysis for biological systems is depends on the activity to
develop quantitative structure activity relationship (QSAR) [8] . QSAR models are mathematical
expressions formulating a relationship between chemical structures and biological activities.
These models have different capability, which is providing a deeper knowledge about the process
of biological activity. In the first step of a usual QSAR study one needs to find a set of molecular
descriptors with the higher influence on the biological activity of interest [9]. A broad scope of
molecular descriptors[10] has been used in QSAR modeling. These molecular descriptors[11]
have been categorised into different classes, including constitutional, geometrical, topological,
quantum chemical and so on. Using such an way one could predict the activities of newly
formulated compounds before a conclusion is being made whether these compounds should be
truly synthesized and tested. We examine the performance of Partial Least Squares(PLS) based
QSAR models with the results produced by Multi Linear Regression(MLR ) and Principal
Component Regression (PCR) methods to discover basic requirements for additional bettered
antitubercular activity.

2. MATERIALS AND METHODS
2.1 MOLECULAR DESCRIPTOR DATA SETS
A set of fourty molecule compounds relates to derivatives for mycobacterium TB(H37Rv)
inhibitors were taken from large antitubercular drug molecule databases[12] using substructure
mining tool Schrodinger Canvas 2010(Trial version) [13]. All molecules were handled by the
Vlife MDS [14] - 2D coordinates of atoms were recalculated counter ions and salts were
eliminated from molecular structures, molecules were neutralized, mesomerized and aromatized.
Data sets were then refined from duplicates. The 2D-QSAR models were produced using a
training set of thirty molecules. Predictive ability of the models was assessed by a test set of ten
molecules with consistently distributed biological activities. The observed selection of test set
molecules was made by seeing the fact that test set molecules shows a range of biological activity
similar to the training set. The actual and predicted biological activities of the training and test
set molecules are given in Table 1.

48
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

Sl
no
1

Compound

IC50a(µg/ml)

PIC50b
Obsr
Pred

Residual

4.41

5.3820

0.027

6.41

5.193

5.2282

0.0352

3.29

5.482

5.2282

0.2538

8.7

5.060

5.2282

0.1682

11.38

4.943

6.0161

1.0731

6.83

5.165

5.3820

0.217

2.53

5.596

5.3820

0.214

-4.35

5.361

5.3820

0.021

-0.95

H3 C

2

5.355

6.022

5.3820

0.64

-4.12

5.385

5.3820

0.003

-7.8

5.062

5.3820

0.32

CH 3
HN

O

O

3

4
H3 C
HN

O

O

5

6

7

H3 C

8

O
CH 3
S
Cl

N

O
O

HN

O

CH

9

O
O

O
S

N

H3 C

O

O
HN
H3C

O

O

CH

10

O
O

O
S

N

H3 C

O

O
HN
H3C

O

O

11

H3 C

O
O
CH 3
S
N

O
O

H3C
HN

O

49
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

12
-0.82

6.086

6.0867

0.0007

4.01

5.317

5.2282

0.0888

3.9

5.408

5.2282

0.1798

-5.44

5.264

5.2282

0.0358

0.15

6.823

5.3820

1.441

6.79

5.168

5.3820

0.214

9.26

5.033

5.3820

0.349

-3.15

5.501

5.3820

0.119

6.04

5.218

5.5359

0.3179

-3.47

5.459

5.3820

0.077

13
CH 3
O
O
CH 3
H3 C

S
O

N
O
O

HN

H
N

O
O

14

15

16

17

18

19

20

21

50
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

22
-4.23

5.373

5.3820

0.009

-2.48

5.605

5.3820

0.223

-1.65

5.782

5.3820

0.4

-3.62

5.441

5.6625

0.2215

6.2

5.207

5.3820

0.175

98.23

4.007

4.0090

0.002

-5.11

5.291

5.2282

0.0628

1.15

5.939

5.2282

0.7108

-6.59

5.181

5.2282

0.0472

-0.08

7.096

7.0986

0.0026

23

24

O

25

CH 3

O
O

NH

O

O

O

O

26

27

28

29

O

30
H2 N

N
N

O

O

N

31
H3 C

O

O
O

51
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

32

O

N
O

0.15

6.823

6.7122

0.1108

-2.55

5.593

5.3820

0.211

-1.05

5.978

5.0744

0.9036

-9.38

5.027

5.2282

0.2012

-4.8

5.318

5.0744

0.2436

10.73

4.969

5.0744

0.1054

6.25

5.204

5.2282

0.0242

4.26

5.370

5.2282

0.1418

-8.01

N

5.096

5.2282

0.1322

O

33

O

F

HN

O

34

35

H3 C
O
HS

N
Cl
Cl
HN
Cl
HN

O

O

36

37

38

CH 3

O

HN

O

O

39

40

Table 1 Molecular structure with Observed and Predicted activity of 5-nitrofuran-2-yl used in
training and test set using Model-1 (PLS)
Expt. = Experimental activity, Pred. = Predicted
activity IC50a = Compound concentration in micro mole required to inhibit growth by 50%
52
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

PIC50b = -Log (IC50 × 10ି଺): Training data set developed using model 1 (PLS) and Test set
is light blue shaded.

2.2 BIOLOGICAL OBSERVED ACTIVITY DATA
For the evolution of QSAR models of 5-nitrofuran-2-yl ,of processes antitubercular activity in
terms of half maximum inhibitory concentration IC50 (µM) versus (H37Rv) strains were took
from the antitubercular drug molecule databases[12]. The IC50 activity data contains only
molecules that have at least exhibited some activity. The biological activity data (IC50) were
transformed in to pIC50 according to the formula pIC50 = (−log	(IC50	 × 10ି଺)) was used as
response values, thus correlating the data linear to the free energy change.

2.3 DESCRIPTOR CALCULATION FOR MOLECULAR DATASET
The VLife MDS tool used for the computation of various molecular descriptors containing
topological index (J), connectivity index (x), radius of gyration (RG), moment of inertia, Wiener
index(W), balabian index(J), centric index, hosoya index (Z), information based indices, XlogP,
logP, hydrophobicity, elemental count, path count, chain count, pathcluster count, molecular
connectivity index (chi), kappa values, electro topological state indices, electrostatic surface
properties, dipole moment, polar surface area(PSA), alignment independent descriptor
(AI)[11,14. The calculated molecular descriptors were gathered in a data matrix. The
preprocessing for the generated molecular descriptors was done by removing invariable (constant
column) and cross-correlated descriptors (with r = 0.99). which happen in total 156, 125 and 162
molecular descriptors for MLR, PCR and PLS accordingly to be used for QSAR analysis.

2.4 CREATION OF TRAINING AND TEST SET
The dataset of forty molecular descriptors is split s into training and test set by Sphere Exclusion
(SE)[15-16] technique. In this technique initially data set splits into training and test set using
sphere exclusion technique. In this technique variance value provides an idea to handle training
and test set size. It needs to be adapted by trial and error until a desired split of training and test
set is acquired. Increase in dissimilarity value results in increase in number of molecules in the
test set. This technique is used for MLR, PCR and PLS models with pIC50 activity data as
response variable and various 2D molecular descriptors computed for the molecules as
independent variables.

2.5 MODEL VALIDATION
Model validation [17-18] is a essential manner of quantitative structure–activity relationship
(QSAR) modelling. This is done to test the internal stability and predictive capability of the
QSAR models. These three QSAR models were validated by the following method.
2.5.1 INTERNAL MODEL VALIDATION
Internal model validation was carried out using leave-one-out (LOO-Qଶ ) method. For calculating
qଶ , each sample in the training set was eliminated once and the activity of the eliminated sample
was predicted by using the model developed by the remaining samples. The Qଶ computed using
the expression which explains the internal strength of a model.
53
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013
∑(ଢ଼

Qଶ = 1 − ∑(ଢ଼ ౦౨౛ౚ	

ି	ଢ଼౥ౘ౩ )మ 				

(1)

మ
౥ౘ౩ 	ି	ଢ଼ౣ౛౗౤ ) 				

In Eq. (1), Y୮୰ୣୢ and Y୭ୠୱ indicate predicted and observed activity values accordingly and Y୫ୣୟ୬
signify mean activity value. A model is considered acceptable when the value of Qଶ exceeds 0.5.
2.5.2 EXTERNAL MODEL VALIDATION
External model validation, the activity of each sample in the test set was predicted using the
model created by the training set. The pred_r ଶ value is computed as follows.

pred_r ଶ =

∑(ଢ଼౦౨౛ౚ(౪౛౩౪) ି	ଢ଼౪౛౩౪ )మ 				
		∑(ଢ଼౪౨౗౟౤ ି	ଢ଼ౣ౛౗౤(౪౨౗౟౤) )మ 				

(2)

In Eq (2) Y୮୰ୣୢ(୲ୣୱ୲) and Y୲ୣୱ୲ indicate predicted and observed activity values for the test set and
Y୲୰ୟ୧୬ indicates mean activity value of the training set. For the predictive QSAR model, the value
of pred_r ଶ must be more than 0.5.
2.5.3 RANDOMIZATION TEST
Randomization test or Y-scrambling is key mean of statistical validation. To assess the statistical
importance of the QSAR model for the dataset, one tail hypothesis testing is used. The strength
of the models for training sets was tested by examining these models to those derived for random
datasets. Random sets were produced by rearranging the activities of the samples in the training
set. The statistical model was determined using different randomly reorganize activities (random
sets) with the chosen molecular descriptors and the equivalent Qଶ were computed. The
importance of the models for that reason obtained was developed based on a computed Zୱୡ୭୰ୣ.
A Z score value is calculated by the following equation:
(୦	ି		ஜ)	

																																																						Zୱୡ୭୰ୣ 	=		
																																																						
		

஢

(3)

Where ℎ is the Qଶ value computed for the dataset, µ the mean Qଶ , and is its σ standard deviation
calculated for various iterations using models build by different random datasets. The probability
(a) of importance of randomization test is derived by comparing Zୱୡ୭୰ୣ 	value with Zୱୡ୭୰ୣ critical
value as stated, if Zୱୡ୭୰ୣ value is less than 4.0; otherwise it is computed by the expression as
given in the literature. For example, a Zୱୡ୭୰ୣ value more than 3.10 proposes that there is a
probability (a) of smaller than 0.001 that the QSAR model build for the dataset is random. The
randomization test proposes that all the created models have a probability of less than 1% that the
model is produced by chance.

2.6 MULTIPLE LINEAR REGRESSION (MLR) ANALYSIS
MLR technique used for modelling linear relationship between a response variable Y (pIC50) and
independent variables X (2D molecular descriptors). MLR is based on least squares technique: the
model is fit such that sum-of-squares of differences of actual and a predicted values are
minimized. MLR estimates the regression coefficients (	r ଶ ) by applying least squares fitting
54
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

technique. The model builds a relationship in the form of a straight line (linear) that best
estimates all the individual data points. In regression analysis, conditional mean of response
variable (pIC50) Y	depends on (molecular descriptors)X. MLR analysis add to this idea to include
more than one independent variables. Regression expression takes the form.
(4)
Y = 	 bଵ xଵ + bଶ xଶ + bଷ xଷ + c
where Y	is a response variable, ‘b’s are regression coefficients for corresponding ‘x’s are
molecular descriptors(independent variables), ‘c’ is a regression constant or intercept [19,25].

2.7 PRINCIPAL COMPONENT REGRESSION (PCR) ANALYSIS
Principal Component Regression (PCR) is a regression technique that uses principal component
analysis(PCA) when evaluating regression coefficients. PCR presents a technique for finding
structure in datasets. Its object is to group correlated variables, replacing the earlier descriptors
by new set called principal components (PCs). These PC’s are uncorrelated and are developed as
a simple linear aggregation of earlier variables. It moves the data into a new set of axes such that
first few axes indicates most of the variations within the data. First PC (PC1) is expressed in the
direction of maximum variance of the whole dataset. Second PC (PC2) is the direction that
defines the maximum variance in orthogonal subspace to PC1. Consequent components are taken
orthogonal to the particular formerly chosen and defines best of remaining variance, by locating
the data on new set of axes, it can points major fundamental structures certainly. Value of each
point, when moved to a given axis, is called the PC value. PCA chooses a new set of axes for the
data. These are chosen in decreasing order of variance within the data. The aim of principal
component PCR is the computation of values of a response variable on the basis of chosen PCs of
independent variables [21].

2.8 PARTIAL LEAST SQUARES (PLS) REGRESSION ANALYSIS
PLS is a well known regression technique which can be used to correlate one or more response
variable (Y)	to various independent variables(X) . PLS relates a matrix Y of response variables to
a matrix X of molecular descriptors. PLS is useful in conditions where the number of molecular
descriptors( independent variables) exceeds the number of samples, when X data contain
colinearties or when N is less than 5M, where N	is number of samples and M is number of
response variables. PLS builds orthogonal components using existing correlations between
independent variables and corresponding outputs while also keeping most of the variance of
independent variables. Major aim of PLS regression is to predict the activity (Y) from X and to
define their common frame[22,23] . PLS is probably the least contrary of various multivariate
extensions of MLR model. PLS is a technique for constructing predictive models when factors
are many and highly collinear.

2.9 EVALUATION OF THE QSAR MODELS
The created QSAR models are computed using the following statistical parameters: N (Number
of samples in regression); K (Number of independent variables(molecular descriptors)); DF
(Degree of freedom); optimum component ( number of optimums); r ଶ		 ( the squared correlation
coefficient); F test (Fischer’s Value) for statistical importance; qଶ (cross-validated correlation
55
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

coefficient); pred_r ଶ (	r ଶ for external test set); Zୱୡ୭୰ୣ ( Z score computed by the randomization
test); Best_ran_r ଶ 	 (maximal r ଶ		 value in the randomization test)	;	Best_ran_qଶ 	(maximal qଶ value
in the randomization test) ; α	( statistical importance parameter obtained by the randomization
test). The correlation coefficient r ଶ is a respective standard of fit by the regression expression. It
expressed the part of the variation in the observed data is analyzed by the regression. Despite, a
QSAR models are examined to be predictive, if the following prerequistes are satisfied: r ଶ > 0.6 ,
qଶ > 0.6 and pred_r ଶ > 0.5 [24] . The F-test indicates the ratio of variance described by the
model and variance due to the error in the regression. High values of the F-test indicate that
model is statistically meaningful. The reduced standard error of pred_r ଶ se	, qଶ _se and r ଶ _se
demonstrates actual value of the fitness of the model. The cross-correlation extent was set at 0.5.

3. RESULTS
Taining set of 30 and 10 of test set of 5-nitrofuran-2-yl having different substitution were
employed.

3.1 CREATION OF QSAR MODELS
3.1.1 PARTIAL LEAST SQUARES (PLS) REGRESSION ANALYSIS
The molecular descriptors were applied to PLS technique to developQSAR models by using
simulated anealing variable selection mode. PLS model is having following QSAR Eq.(5) with
five descriptors.
pIC50	 =
	1.8704(StsCcount) + 4.0747(chi5chain) − 0.6865(SaaaCcount) + 0.7046(SssScount) −
0.1538	(SdssCcount) + 4.9478
(5)
Table 2 Statistical parameters of PLS, MLR And PCR
Parameters
N
DF
rଶ

PLS
40
26
0.8484

MLR
40
24
0.8484

PCR
40
28
0.3289

qଶ
F-test
best_ran_r	ଶ	
best_ran_qଶ	
Zୱୡ୭୰ୣ_୰ୟ୬_୰ ଶ
	Zୱୡ୭୰ୣ_୰ୟ୬_୯ ଶ

0.0939
48.5187
0.56429
−0.03892
3.43122
1.59111

0.0932
26.8725
0.28620
−0.08598
8.11471
1.31886

−5.3805
13.7231
0.32891
−2.93782
2.81258
−2.47533

α_ran_r ଶ
α_ran_q	ଶ

0.00100
0.10000

0.00000
0.10000

0.01000
99.00000

													r ଶ _se

0.2277

0.2370

0.4617

q _se
pred_r ଶ

0.5568
−0.5604

0.5797
−0.5616

1.4237
−0.0734

pred_r ଶ se

0.7252

0.7255

0.6015

ଶ

56
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

The above analysis directs to the improvement of statistically meaningful QSAR model, which
allows understanding of the molecular properties/features that play an key role in governing the
variation in the activities. In addition, this QSAR study allowed examining influence of very
simple and easy-to-compute molecular descriptors in discovering biological activities, which
could shed light on the important factors that may aid in design of new potent molecules.
All the parameters and their significance, which contributed to the specific
inhibitory activity in the generated model is discussed below.

antitubercular

1. StsCcount: This descriptor indicates the total number of carbon atoms with a triple bond and a
single bond exist in the molecule. Positive Contribution of this descriptor to the model is
31.72%.
2.chi5chain: This descriptor signifies a retention index for five membered ring. Positive
Contribution of this descriptor to the model is 21.99%.
3. SaaaCcount: This descriptor defines the total number of carbon connected with three aromatic
bonds. Negative Contribution of this descriptor to the model is -23.28%.
4. SssScount: This descriptor indicates the total number of sulphur atom attached with two
single bonds. Positive Contributions of this descriptor to the model is 11.95%.
5. SdssCcount: This descriptor defines the total number of carbon connected with one double
and two single bond. Negative Contribution of this descriptor to the model is -11.06%.

Figure 1 Observed vs. Predicted activities for training and test set molecular descriptors by Partial Least
Square model. (A) Training set (Red dots) (B) Test Set (Blue dots).

The PLS model gave correlation coefficient (	r ଶ ) of 0.8484, significant cross validated correlation
coefficient (	qଶ ) of 0.0939, F-test of 48.5187 and degree of freedom 26. The model is validated
by α_ran_r ଶ = 0.00100, α_ran_q	ଶ = 0.10000, best_ran_r	ଶ	 = 	0.56429, best_ran_qଶ	 = -0.03892,
Zୱୡ୭୰ୣ_୰ୟ୬_୰ ଶ = 3.43122 and Zୱୡ୭୰ୣ_୰ୟ୬_୯ ଶ =1.59111. The randomization test proposes that the
created model have a probability of smaller than 1% that the model is build by chance. Statistical
57
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

data is presented in Table 2. The graph of observed vs. predicted activity is demonstrated in
Figure 1. The descriptors which contribute for the QSAR model is demonstrated in Figure 2.

Figure 2 Percentage contribution of each descriptor in created PLS model describing variation in the
activity

3.1.2 MULTIPLE LINEAR REGRESSION (MLR) ANALYSIS
The QSAR analysis by Multiple Linear Regression method with simulated annealing variable
selection technique, the final QSAR model is created having five descriptors is shown in Eq. (6).
pIC50	 =
1.8681(±	0.2421)StsCcount + 4.0722(±	0.7599)chi5chain −
0.6879(±	0.1139)SaaaCcount + 0.7033(±	0.2393)SssScount −
0.1548	(±	0.0265)SdssCcount + 4.9497

(6)

MLR Model has a correlation coefficient (	r ଶ ) of 0.8484, significant cross validated correlation
coefficient (	qଶ ) of 0.0932, F test of 26.8725 and degree of freedom 24. The model is validated
by α_ran_r ଶ = 0.00000 , α_ran_q	ଶ = 0.10000, best_ran_r	ଶ	 = 0.28620, best_ran_qଶ =
−0.08598	,	Zୱୡ୭୰ୣ_୰ୟ୬_୰ ଶ = 	8.11471					andZୱୡ୭୰ୣ_୰ୟ୬_୯ ଶ = 1.31886	
The randomization test proposes that the created model have a probability of smaller than 1%
that the model is build by chance. The observed and predicted values with residual values are
demonstrated in Table 1.Statistical data is demonstrated in Table 2.The graph of observed vs.
predicted activity demonstrated is in Figure 3. The descriptors which contribute for the QSAR
model are demonstrated in Figure 4. All the parameters and their significance, which contributed
to the specific antitubercular inhibitory activity in the generated models are explained below.

Figure 3 Observed vs. Predicted activities for training and test set molecular descriptors from the Multiple
Linear Regression model. (A) Training set (Red dots) (B) Test Set (Blue dots).
58
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

1. StsCcount: This descriptor indicates the total number of carbon atoms with a triple bond and a
single bond present in the molecule. Positive Contribution of this descriptor to the model is
31.67%.
2.chi5chain: This descriptor signifies a retention index for five membered ring. Positive
Contribution of this descriptor to the model is 21.97%.
3. SaaaCcount: This descriptor defines the total number of carbon connected with three aromatic
bonds. Negative Contribution of this descriptor to the model is -23.32%.
4. SssScount: This descriptor indicates the total number of sulphur atom attached with two
single bonds. Positive Contributions of this descriptor to the model is 11.92%.
5. SdssCcount: This descriptor defines the total number of carbon connected with one double
and two single bond. Negative Contribution of this descriptor to the model is -11.12%.

Figure 4 Percentage contribution of each descriptor in created MLR model describing variation in the
activity.

3.1.3 PRINCIPAL COMPONENT REGRESSION (PCR) ANALYSIS
The molecular descriptors were applied to under goes PCR technique to create QSAR model with
Simulated anealining variable selection mode by using PCR model. The final QSAR model is
Eq. (7) was created having one descriptor as follows.
pIC50	 = 1.7397StsCcount + 5.3563

(7)

The PCR model gave correlation coefficient (	r ଶ ) is 0.3289, significant cross validated
correlation coefficient (	qଶ ) of -5.3805, F test of 13.7231 and degree of freedom 28. The model is
validated by α_ran_r ଶ = 0.01000, α_ran_q	ଶ 	= 99.00000, best_ran_r	ଶ	 = 0.32891, best_ran_qଶ	
=-0.13938 , Zୱୡ୭୰ୣ_୰ୟ୬_୰ ଶ 	= 2.81258 and Zୱୡ୭୰ୣ_୰ୟ୬_୯ ଶ 	=-2.47533. The randomization test proposes
that the created model have a probability of smaller than 1% that the model is build by chance.
59
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

Statistical data is demonstrated in Table 2. The graph of observed vs. predicted activity is in
demonstrated Figure 5 .The descriptors which contribute for the QSAR model is demonstrated in
Figure 6.

Figure 5 Observed vs. Predicted activities for training and test set molecular descriptors by Principal
Component Regression model. A) Training set (Red dots) B) Test Set (Blue dots).

All the parameters and their significance, which contributed to the specific
inhibitory activity in the generated models are discussed here.

antitubercular

1. StsCcount: This descriptor indicates the total number of carbon atoms with a triple bond and a
single bond present in the molecule. Positive Contribution of this descriptor to the model is
100%.

Figure 6 Percentage contribution of each descriptor in developed PCR model describing variation in the
activity
60
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013

4. CONCLUSION
The 2D QSAR analysis were conducted with a series of 5-nitrofuran-2-yl derivatives for
mycobacterium tuberculosis(H37Rv) inhibitors , and some useful predictive models were
obtained. The physicochemical molecular descriptors were found to have an key role in
governing the change in activity. The statistical parameters demonstrate the estimation power of
QSAR model for the molecular descriptor data set from which it has been determined and
evaluate it only internally. The overall performance of prediction was found to be around 84% in
case of PLS and MLR. Among the three 2D-QSAR models (MLR, PCR, and PLS), the results of
PLS and MLR analysis showed significant predictive power and reliability as compare to PCR
technique.

ACKNOWLEDGEMENTS
The Authors are thankful to Dr Mahesh .B. Palkar Department of Pharmaceutical Chemistry
K.L.E Pharmacy College Hubli

REFERENCES
[1]
[2]
[3]
[4]
[5]
[6]T

[7]

[8]
[9]
[10]
[11]
[12]
[13]
[14]

“Tuberculosis”, Centers for Disease Control and Prevention1600 Clifton Rd. Atlanta, GA 30333,
USA http://www.cdc.gov/tb/topic/basics/default.htm
“Weekly Epidemiological Record (WER)”,WHO annual report on global TB control – summary
http://www.who.int/wer/2003/wer7815/en/index.html
Phyllis C. Braun, PhD and John D. Zoidis, MD, “Update on Drug-Resistant Pathogens: Mechanisms
of Resistance, Emerging Strains”, http://www.rtmagazine.com/issues/articles/2004-01_01.asp
“Tuberculosis
management”,
From
Wikipedia,
the
free
encyclopaedia
http://en.wikipedia.org/wiki/Tuberculosis_management
”Multidrug-resistant tuberculosis (MDR-TB)” , From World Health Organization
http://www.who.int/tb/challenges/mdr/en/
awari NR, Bairwa R, Ray MK, Rajan MG, Degani MS. Bioorg Med Chem Lett. (2010 ),“Design,
synthesis,and biological evaluation of 4-(5-nitrofuran-2-yl)prop-2-en-1-one derivatives as potent
antitubercular agents.” ,1;20(21):6175-8. doi: 10.1016/j.bmcl.2010.08.127. Epub 2010 Sep 16.
Sriram D, Yogeeswari P, Dhakla P, Senthilkumar P, Banerjee D, Manjashetty TH (2009),“5Nitrofuran-2-ylderivatives: synthesis and inhibitory activities against growing and dormant
mycobacterium species.” , Bioorganic & medicinal chemistry letters 19:4 pg 1152-4
R. Karbakhsh1,* and R. Sabet (2011),“Application of different chemometric tools in QSAR study of
azoloadamantanes against influenza A virus”, Research in Pharmaceutical Sciences; 6(1): 23-33
“Molecular
Descriptors“
,The
Free
Online
resource
http://www.moleculardescriptors.eu/tutorials/what_is.htm
“Molecular Descriptors Guide” , Version 1.0.2 Copyright [2008] US Environamental Protection
agency
“Streamline Drug Discovery with CDD colabrative web based software “ ,
https://www.collaborativedrug.com/ ( Accesed in May-june [2012] )
”Canv
as”,
A
comprehensive
cheminformatics
computing
environment
http://www.schrodinger.com/products/14/23/
“VlifeMDS”,
Integrated
platform
for
Computer
Aided
Drug
Design
(CADD)
http://www.vlifesciences.com/products/VLifeMDS/Product_VLifeMDS.php
“Sphere Exclusion Method for set selection” ,Rajarshi Guha Penn State University
http://rguha.net/writing/pres/tropsha.pdf
61
Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013
[15] Mary Ann Liebert,(1999), “Dissimilarity-Based Algorithms for Selecting Structurally Diverse Sets of
Compounds” , JOURNAL OF COMPUTATIONAL BIOLOGY Volume 6, Numbers 3/4, Inc. Pp.
447–457
[16] Tropsha, A.; Gramatica, P.; Gombar,V.K.(2003),”The importance of being earnest: Validation is the
absolute essential for successful application and interpretation of QSPR models.”, QSAR Comb. Sci. ,
22, 69-77
[17] Partha Pratim Roy, Somnath Paul, Indrani Mitra and Kunal Roy(2009) ,“On Two Novel Parameters
for Validation of Predictive QSAR Models” , Molecules , 14, 1660-1701 ISSN 1420-3049
[18] “Multile linear Regression”, http://www.ltrr.arizona.edu/~dmeko/notes_11.pdf
[19] Dr. Frank Dieterle ,“Variable Selection by Simulated Annealing”,
http://www.frankdieterle.de/phd/2_8_6.html
[20] Hwang , Dan Nettleton ,“Principal Components Regression With Data-Choosen Components and
related methods” , J.T. Gene www.math.cornell.edu/~hwang/pcr.pdf
[21] Herv´e Abdi1 ,“Partial Least Squares(PLS) Regression ” ,The University of Texas at Dallas
[22] Randall D. Tobias, “An introduction to partial least squares Regression”, SAS Institute Inc., Carry,
NC www.ats.ucla.edu/stat/sas/library/pls.pdf
[23] Golbraikh .A , and A. Tropsha, (2002),”Predictive QSAR modeing based on diversity of sampling
of experimental datasets for the training and test set selection “, J. Comp Aided . Mol Design ,
16:357-366.
[24] “Influence of observations on the misclassification probability in quadratic discriminant analysis”
,https://lirias.kuleuven.be/bitstream/123456789/85608/1/qda.pdf
[25] Craig A. James, “An introduction to the Computer Science and Chemistry of Chemical Information
Systems” eMolecules, Inc. http://www.emolecules.com/doc/cheminformatics-101.php

Authors
Doreswamy received B.Sc degree in Computer Science and M.Sc Degree in
Computer Science from University of Mysore in 1993 and 1995 respectively. Ph.D
degree in Computer Science from Mangalore University in the year 2007. After
completion of his Post-Graduation Degree, he subsequently joined and served
asLecturer in Computer Science at St. Joseph’s College, Bangalore from 1996
1999.Then he has elevated to the position Reader in Computer Science at Mangalor
Universityin year 2003. He was the Chairman of the Department of Post-Graduate Studies and research in
computer science from 2003-2005 and from 2009-2008 and served at varies capacitiesin Mangalore
University at present he is the Chairman of Board of Studies and Professor in Computer Science of
Mangalore University. His areas of Research interests include Data Mining and Knowledge Discovery,
Artificial Intelligence and Expert Systems, Bioinformatics ,Molecular modelling and simulation
,Computational Intelligence ,Nanotechnology, Image Processing and Pattern recognition. He has been
granted a Major Research project entitled “Scientific Knowledge Discovery Systems (SKDS) for
Advanced Engineering Materials Design Applications” from the funding agency University Grant
Commission, New Delhi , India. He has been published about 30 contributed peer reviewed Papers at
national/International Journal and Conferences. He received SHIKSHA RATTAN PURASKAR for his
outstanding achievements in the year 2009 and RASTRIYA VIDYA SARASWATHI AWARD for
outstanding achievement in chosen field of activity in the year 2010.
Chanabasayya.M.Vastrad received B.E. degree and M.Tech. degree in the year
2001and 2006 respectively. Currently working towards his Ph.D Degree in Computer
Scienceand Technology under the guidance of Dr. Doreswamy in the Department of
Post-Graduate Studies and Research in Computer Science , Mangalore University.

62

Weitere ähnliche Inhalte

Was ist angesagt?

QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...inventionjournals
 
Studies on Anti-Inflammation Activity of Phenols Using Newly Introduced Balab...
Studies on Anti-Inflammation Activity of Phenols Using Newly Introduced Balab...Studies on Anti-Inflammation Activity of Phenols Using Newly Introduced Balab...
Studies on Anti-Inflammation Activity of Phenols Using Newly Introduced Balab...IOSRJAC
 
Publications 1 to dr. mark a. babizhayev
Publications 1 to dr. mark a. babizhayev Publications 1 to dr. mark a. babizhayev
Publications 1 to dr. mark a. babizhayev Mark Babizhayev
 
Publications 1 to Dr. Mark A. Babizhayev
Publications 1 to Dr. Mark A. Babizhayev Publications 1 to Dr. Mark A. Babizhayev
Publications 1 to Dr. Mark A. Babizhayev Mark Babizhayev
 
Computational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasisComputational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasisAlexander Decker
 
Synthesis and Anti-inflammatory activity of 2-amino-6-methoxy benzothiazole d...
Synthesis and Anti-inflammatory activity of 2-amino-6-methoxy benzothiazole d...Synthesis and Anti-inflammatory activity of 2-amino-6-methoxy benzothiazole d...
Synthesis and Anti-inflammatory activity of 2-amino-6-methoxy benzothiazole d...IOSR Journals
 
Novel Hybrid Molecules of Quinazoline Chalcone Derivatives: Synthesis and Stu...
Novel Hybrid Molecules of Quinazoline Chalcone Derivatives: Synthesis and Stu...Novel Hybrid Molecules of Quinazoline Chalcone Derivatives: Synthesis and Stu...
Novel Hybrid Molecules of Quinazoline Chalcone Derivatives: Synthesis and Stu...Ratnakaram Venkata Nadh
 
Rational drug design
Rational drug designRational drug design
Rational drug designNaresh Juttu
 
Inhibition of Aldose Activity by Essential Phytochemicals of Cymbopogon citra...
Inhibition of Aldose Activity by Essential Phytochemicals of Cymbopogon citra...Inhibition of Aldose Activity by Essential Phytochemicals of Cymbopogon citra...
Inhibition of Aldose Activity by Essential Phytochemicals of Cymbopogon citra...CSCJournals
 
Qsar studies on gallic acid derivatives and molecular docking studies of bace...
Qsar studies on gallic acid derivatives and molecular docking studies of bace...Qsar studies on gallic acid derivatives and molecular docking studies of bace...
Qsar studies on gallic acid derivatives and molecular docking studies of bace...bioejjournal
 
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...bioejjournal
 
Statistical Optimization of Keratinase Production from Marine Fungus
Statistical Optimization of Keratinase Production from Marine FungusStatistical Optimization of Keratinase Production from Marine Fungus
Statistical Optimization of Keratinase Production from Marine FungusIJERA Editor
 
Research Avenues in Drug discovery of natural products
Research Avenues in Drug discovery of natural productsResearch Avenues in Drug discovery of natural products
Research Avenues in Drug discovery of natural productsDevakumar Jain
 
quantitative structure activity relationship studies of anti proliferative ac...
quantitative structure activity relationship studies of anti proliferative ac...quantitative structure activity relationship studies of anti proliferative ac...
quantitative structure activity relationship studies of anti proliferative ac...IJEAB
 
Ulipristal acetate determination using MBTH
Ulipristal acetate determination using MBTHUlipristal acetate determination using MBTH
Ulipristal acetate determination using MBTHRatnakaram Venkata Nadh
 

Was ist angesagt? (18)

QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
QSAR Studies of the Inhibitory Activity of a Series of Substituted Indole and...
 
Studies on Anti-Inflammation Activity of Phenols Using Newly Introduced Balab...
Studies on Anti-Inflammation Activity of Phenols Using Newly Introduced Balab...Studies on Anti-Inflammation Activity of Phenols Using Newly Introduced Balab...
Studies on Anti-Inflammation Activity of Phenols Using Newly Introduced Balab...
 
Publications 1 to dr. mark a. babizhayev
Publications 1 to dr. mark a. babizhayev Publications 1 to dr. mark a. babizhayev
Publications 1 to dr. mark a. babizhayev
 
Publications 1 to Dr. Mark A. Babizhayev
Publications 1 to Dr. Mark A. Babizhayev Publications 1 to Dr. Mark A. Babizhayev
Publications 1 to Dr. Mark A. Babizhayev
 
Computational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasisComputational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasis
 
IAPST 2011
IAPST 2011IAPST 2011
IAPST 2011
 
Synthesis and Anti-inflammatory activity of 2-amino-6-methoxy benzothiazole d...
Synthesis and Anti-inflammatory activity of 2-amino-6-methoxy benzothiazole d...Synthesis and Anti-inflammatory activity of 2-amino-6-methoxy benzothiazole d...
Synthesis and Anti-inflammatory activity of 2-amino-6-methoxy benzothiazole d...
 
Novel Hybrid Molecules of Quinazoline Chalcone Derivatives: Synthesis and Stu...
Novel Hybrid Molecules of Quinazoline Chalcone Derivatives: Synthesis and Stu...Novel Hybrid Molecules of Quinazoline Chalcone Derivatives: Synthesis and Stu...
Novel Hybrid Molecules of Quinazoline Chalcone Derivatives: Synthesis and Stu...
 
my article
my articlemy article
my article
 
Rational drug design
Rational drug designRational drug design
Rational drug design
 
Inhibition of Aldose Activity by Essential Phytochemicals of Cymbopogon citra...
Inhibition of Aldose Activity by Essential Phytochemicals of Cymbopogon citra...Inhibition of Aldose Activity by Essential Phytochemicals of Cymbopogon citra...
Inhibition of Aldose Activity by Essential Phytochemicals of Cymbopogon citra...
 
Qsar studies on gallic acid derivatives and molecular docking studies of bace...
Qsar studies on gallic acid derivatives and molecular docking studies of bace...Qsar studies on gallic acid derivatives and molecular docking studies of bace...
Qsar studies on gallic acid derivatives and molecular docking studies of bace...
 
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
 
Statistical Optimization of Keratinase Production from Marine Fungus
Statistical Optimization of Keratinase Production from Marine FungusStatistical Optimization of Keratinase Production from Marine Fungus
Statistical Optimization of Keratinase Production from Marine Fungus
 
Drug discovery anthony crasto
Drug discovery  anthony crastoDrug discovery  anthony crasto
Drug discovery anthony crasto
 
Research Avenues in Drug discovery of natural products
Research Avenues in Drug discovery of natural productsResearch Avenues in Drug discovery of natural products
Research Avenues in Drug discovery of natural products
 
quantitative structure activity relationship studies of anti proliferative ac...
quantitative structure activity relationship studies of anti proliferative ac...quantitative structure activity relationship studies of anti proliferative ac...
quantitative structure activity relationship studies of anti proliferative ac...
 
Ulipristal acetate determination using MBTH
Ulipristal acetate determination using MBTHUlipristal acetate determination using MBTH
Ulipristal acetate determination using MBTH
 

Andere mochten auch

Copyrights & education
Copyrights & educationCopyrights & education
Copyrights & educationajethridge
 
Copyrights & education r3
Copyrights & education r3Copyrights & education r3
Copyrights & education r3ajethridge
 
Uts miicrc sector sustainability
Uts miicrc sector sustainabilityUts miicrc sector sustainability
Uts miicrc sector sustainabilityUTSBusinessSchool
 
Lesson 5 labor
Lesson 5 labor Lesson 5 labor
Lesson 5 labor tonyabur
 
INTEGRATED, RELIABLE AND CLOUD-BASED PERSONAL HEALTH RECORD: A SCOPING REVIEW
INTEGRATED, RELIABLE AND CLOUD-BASED PERSONAL HEALTH RECORD: A SCOPING REVIEWINTEGRATED, RELIABLE AND CLOUD-BASED PERSONAL HEALTH RECORD: A SCOPING REVIEW
INTEGRATED, RELIABLE AND CLOUD-BASED PERSONAL HEALTH RECORD: A SCOPING REVIEWhiij
 
Overview of critical factors affecting medical user interfaces in intensive c...
Overview of critical factors affecting medical user interfaces in intensive c...Overview of critical factors affecting medical user interfaces in intensive c...
Overview of critical factors affecting medical user interfaces in intensive c...hiij
 
Lesson 4 urbanization
Lesson 4 urbanizationLesson 4 urbanization
Lesson 4 urbanizationtonyabur
 
Tre presentation 8 revised
Tre presentation 8 revisedTre presentation 8 revised
Tre presentation 8 revisedleroylacy12
 
Public Libraries News: "Riding the tiger by its tail"
Public Libraries News: "Riding the tiger by its tail"Public Libraries News: "Riding the tiger by its tail"
Public Libraries News: "Riding the tiger by its tail"Public Libraries News
 
Cat recicladora asfalto_rm-500
Cat recicladora asfalto_rm-500Cat recicladora asfalto_rm-500
Cat recicladora asfalto_rm-500Jaime Santos
 
SCA Digital Q3 C14 Engagement Metrics
SCA Digital Q3 C14 Engagement MetricsSCA Digital Q3 C14 Engagement Metrics
SCA Digital Q3 C14 Engagement MetricsDarren Kerry
 
Uberlina
UberlinaUberlina
Uberlinabaiarin
 
самая классная классная жизнь!
самая классная классная жизнь!самая классная классная жизнь!
самая классная классная жизнь!Julia Moshkova
 

Andere mochten auch (16)

Copyrights & education
Copyrights & educationCopyrights & education
Copyrights & education
 
Copyrights & education r3
Copyrights & education r3Copyrights & education r3
Copyrights & education r3
 
Uts miicrc sector sustainability
Uts miicrc sector sustainabilityUts miicrc sector sustainability
Uts miicrc sector sustainability
 
Sap 0 rkom
Sap 0 rkomSap 0 rkom
Sap 0 rkom
 
Lesson 5 labor
Lesson 5 labor Lesson 5 labor
Lesson 5 labor
 
INTEGRATED, RELIABLE AND CLOUD-BASED PERSONAL HEALTH RECORD: A SCOPING REVIEW
INTEGRATED, RELIABLE AND CLOUD-BASED PERSONAL HEALTH RECORD: A SCOPING REVIEWINTEGRATED, RELIABLE AND CLOUD-BASED PERSONAL HEALTH RECORD: A SCOPING REVIEW
INTEGRATED, RELIABLE AND CLOUD-BASED PERSONAL HEALTH RECORD: A SCOPING REVIEW
 
Overview of critical factors affecting medical user interfaces in intensive c...
Overview of critical factors affecting medical user interfaces in intensive c...Overview of critical factors affecting medical user interfaces in intensive c...
Overview of critical factors affecting medical user interfaces in intensive c...
 
Lesson 4 urbanization
Lesson 4 urbanizationLesson 4 urbanization
Lesson 4 urbanization
 
Cенгара
CенгараCенгара
Cенгара
 
Tre presentation 8 revised
Tre presentation 8 revisedTre presentation 8 revised
Tre presentation 8 revised
 
ใบงาน3
ใบงาน3ใบงาน3
ใบงาน3
 
Public Libraries News: "Riding the tiger by its tail"
Public Libraries News: "Riding the tiger by its tail"Public Libraries News: "Riding the tiger by its tail"
Public Libraries News: "Riding the tiger by its tail"
 
Cat recicladora asfalto_rm-500
Cat recicladora asfalto_rm-500Cat recicladora asfalto_rm-500
Cat recicladora asfalto_rm-500
 
SCA Digital Q3 C14 Engagement Metrics
SCA Digital Q3 C14 Engagement MetricsSCA Digital Q3 C14 Engagement Metrics
SCA Digital Q3 C14 Engagement Metrics
 
Uberlina
UberlinaUberlina
Uberlina
 
самая классная классная жизнь!
самая классная классная жизнь!самая классная классная жизнь!
самая классная классная жизнь!
 

Ähnlich wie Predictive comparative qsar analysis of as 5 nitrofuran-2-yl derivatives myco bacterium tuberculosis h37 rv inhibitors

PREDICTIVE COMPARATIVE QSAR ANALYSIS OF AS 5-NITROFURAN-2-YL DERIVATIVES MYCO...
PREDICTIVE COMPARATIVE QSAR ANALYSIS OF AS 5-NITROFURAN-2-YL DERIVATIVES MYCO...PREDICTIVE COMPARATIVE QSAR ANALYSIS OF AS 5-NITROFURAN-2-YL DERIVATIVES MYCO...
PREDICTIVE COMPARATIVE QSAR ANALYSIS OF AS 5-NITROFURAN-2-YL DERIVATIVES MYCO...hiij
 
A Semi-empirical based QSAR study of indole휷- Diketo acid, Diketo acid and Ca...
A Semi-empirical based QSAR study of indole휷- Diketo acid, Diketo acid and Ca...A Semi-empirical based QSAR study of indole휷- Diketo acid, Diketo acid and Ca...
A Semi-empirical based QSAR study of indole휷- Diketo acid, Diketo acid and Ca...iosrjce
 
Effect of 3D parameters on Antifungal Activities of Some Heterocyclic Compounds
Effect of 3D parameters on Antifungal Activities of Some Heterocyclic CompoundsEffect of 3D parameters on Antifungal Activities of Some Heterocyclic Compounds
Effect of 3D parameters on Antifungal Activities of Some Heterocyclic CompoundsIOSR Journals
 
IRJET- Insilico Studies of Molecular Property and Bioactivity of Organic ...
IRJET-  	  Insilico Studies of Molecular Property and Bioactivity of Organic ...IRJET-  	  Insilico Studies of Molecular Property and Bioactivity of Organic ...
IRJET- Insilico Studies of Molecular Property and Bioactivity of Organic ...IRJET Journal
 
Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Sunghwan Kim
 
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...bioejjournal
 
2017 - Plausible Bioindicators of Biological Nitrogen Removal Process in WWTPs
2017 - Plausible Bioindicators of Biological Nitrogen Removal Process in WWTPs2017 - Plausible Bioindicators of Biological Nitrogen Removal Process in WWTPs
2017 - Plausible Bioindicators of Biological Nitrogen Removal Process in WWTPsWALEBUBLÉ
 
Nc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologyNc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologySean Ekins
 
QSAR studies of some anilinoquinolines for their antitumor activity as EGFR i...
QSAR studies of some anilinoquinolines for their antitumor activity as EGFR i...QSAR studies of some anilinoquinolines for their antitumor activity as EGFR i...
QSAR studies of some anilinoquinolines for their antitumor activity as EGFR i...IOSR Journals
 
IUPHAR/BPS Guide to Pharmacology in 2018
IUPHAR/BPS Guide to Pharmacology in 2018IUPHAR/BPS Guide to Pharmacology in 2018
IUPHAR/BPS Guide to Pharmacology in 2018Guide to PHARMACOLOGY
 
Raypur(NEW) QSAR.pptx
Raypur(NEW) QSAR.pptxRaypur(NEW) QSAR.pptx
Raypur(NEW) QSAR.pptxDrRajeshDas
 
Raypur(NEW) QSAR.pptx
Raypur(NEW) QSAR.pptxRaypur(NEW) QSAR.pptx
Raypur(NEW) QSAR.pptxDrRajeshDas
 
7 synthesis, characterisation and antimicrobial activity of schiff base of 7 ...
7 synthesis, characterisation and antimicrobial activity of schiff base of 7 ...7 synthesis, characterisation and antimicrobial activity of schiff base of 7 ...
7 synthesis, characterisation and antimicrobial activity of schiff base of 7 ...BIOLOGICAL FORUM
 
Exploration of a potential FtsZ inhibitors as new scaffolds by Ligand and Str...
Exploration of a potential FtsZ inhibitors as new scaffolds by Ligand and Str...Exploration of a potential FtsZ inhibitors as new scaffolds by Ligand and Str...
Exploration of a potential FtsZ inhibitors as new scaffolds by Ligand and Str...Pavan Kumar
 

Ähnlich wie Predictive comparative qsar analysis of as 5 nitrofuran-2-yl derivatives myco bacterium tuberculosis h37 rv inhibitors (20)

PREDICTIVE COMPARATIVE QSAR ANALYSIS OF AS 5-NITROFURAN-2-YL DERIVATIVES MYCO...
PREDICTIVE COMPARATIVE QSAR ANALYSIS OF AS 5-NITROFURAN-2-YL DERIVATIVES MYCO...PREDICTIVE COMPARATIVE QSAR ANALYSIS OF AS 5-NITROFURAN-2-YL DERIVATIVES MYCO...
PREDICTIVE COMPARATIVE QSAR ANALYSIS OF AS 5-NITROFURAN-2-YL DERIVATIVES MYCO...
 
A Semi-empirical based QSAR study of indole휷- Diketo acid, Diketo acid and Ca...
A Semi-empirical based QSAR study of indole휷- Diketo acid, Diketo acid and Ca...A Semi-empirical based QSAR study of indole휷- Diketo acid, Diketo acid and Ca...
A Semi-empirical based QSAR study of indole휷- Diketo acid, Diketo acid and Ca...
 
Effect of 3D parameters on Antifungal Activities of Some Heterocyclic Compounds
Effect of 3D parameters on Antifungal Activities of Some Heterocyclic CompoundsEffect of 3D parameters on Antifungal Activities of Some Heterocyclic Compounds
Effect of 3D parameters on Antifungal Activities of Some Heterocyclic Compounds
 
IRJET- Insilico Studies of Molecular Property and Bioactivity of Organic ...
IRJET-  	  Insilico Studies of Molecular Property and Bioactivity of Organic ...IRJET-  	  Insilico Studies of Molecular Property and Bioactivity of Organic ...
IRJET- Insilico Studies of Molecular Property and Bioactivity of Organic ...
 
Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...
 
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
Qsar Studies on Gallic Acid Derivatives and Molecular Docking Studies of Bace...
 
2017 - Plausible Bioindicators of Biological Nitrogen Removal Process in WWTPs
2017 - Plausible Bioindicators of Biological Nitrogen Removal Process in WWTPs2017 - Plausible Bioindicators of Biological Nitrogen Removal Process in WWTPs
2017 - Plausible Bioindicators of Biological Nitrogen Removal Process in WWTPs
 
Nc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologyNc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational Toxicology
 
QSAR studies of some anilinoquinolines for their antitumor activity as EGFR i...
QSAR studies of some anilinoquinolines for their antitumor activity as EGFR i...QSAR studies of some anilinoquinolines for their antitumor activity as EGFR i...
QSAR studies of some anilinoquinolines for their antitumor activity as EGFR i...
 
IUPHAR/BPS Guide to Pharmacology in 2018
IUPHAR/BPS Guide to Pharmacology in 2018IUPHAR/BPS Guide to Pharmacology in 2018
IUPHAR/BPS Guide to Pharmacology in 2018
 
Raypur(NEW) QSAR.pptx
Raypur(NEW) QSAR.pptxRaypur(NEW) QSAR.pptx
Raypur(NEW) QSAR.pptx
 
Raypur(NEW) QSAR.pptx
Raypur(NEW) QSAR.pptxRaypur(NEW) QSAR.pptx
Raypur(NEW) QSAR.pptx
 
IAJPR LAVANYA
IAJPR LAVANYAIAJPR LAVANYA
IAJPR LAVANYA
 
Ijpar 18 22
Ijpar 18 22Ijpar 18 22
Ijpar 18 22
 
Ijpar 18 22
Ijpar 18 22Ijpar 18 22
Ijpar 18 22
 
7 synthesis, characterisation and antimicrobial activity of schiff base of 7 ...
7 synthesis, characterisation and antimicrobial activity of schiff base of 7 ...7 synthesis, characterisation and antimicrobial activity of schiff base of 7 ...
7 synthesis, characterisation and antimicrobial activity of schiff base of 7 ...
 
Exploration of a potential FtsZ inhibitors as new scaffolds by Ligand and Str...
Exploration of a potential FtsZ inhibitors as new scaffolds by Ligand and Str...Exploration of a potential FtsZ inhibitors as new scaffolds by Ligand and Str...
Exploration of a potential FtsZ inhibitors as new scaffolds by Ligand and Str...
 
Qsar ppt
Qsar pptQsar ppt
Qsar ppt
 
iGEM UCSD 2015 Poster
iGEM UCSD 2015 PosteriGEM UCSD 2015 Poster
iGEM UCSD 2015 Poster
 
Critical Review (drug toxicity testing using organ on chip)
Critical Review (drug toxicity testing using organ on chip)Critical Review (drug toxicity testing using organ on chip)
Critical Review (drug toxicity testing using organ on chip)
 

Mehr von hiij

Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)hiij
 
Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)hiij
 
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...hiij
 
Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)hiij
 
AUTOMATIC AND NON-INVASIVE CONTINUOUS GLUCOSE MONITORING IN PAEDIATRIC PATIENTS
AUTOMATIC AND NON-INVASIVE CONTINUOUS GLUCOSE MONITORING IN PAEDIATRIC PATIENTSAUTOMATIC AND NON-INVASIVE CONTINUOUS GLUCOSE MONITORING IN PAEDIATRIC PATIENTS
AUTOMATIC AND NON-INVASIVE CONTINUOUS GLUCOSE MONITORING IN PAEDIATRIC PATIENTShiij
 
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMS
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMSINTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMS
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMShiij
 
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...hiij
 
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMS
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMSINTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMS
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMShiij
 
Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)hiij
 
The Proposed Guidelines for Cloud Computing Migration for South African Rural...
The Proposed Guidelines for Cloud Computing Migration for South African Rural...The Proposed Guidelines for Cloud Computing Migration for South African Rural...
The Proposed Guidelines for Cloud Computing Migration for South African Rural...hiij
 
SUPPORTING LARGE-SCALE NUTRITION ANALYSIS BASED ON DIETARY SURVEY DATA
SUPPORTING LARGE-SCALE NUTRITION ANALYSIS BASED ON DIETARY SURVEY DATASUPPORTING LARGE-SCALE NUTRITION ANALYSIS BASED ON DIETARY SURVEY DATA
SUPPORTING LARGE-SCALE NUTRITION ANALYSIS BASED ON DIETARY SURVEY DATAhiij
 
AN EHEALTH ADOPTION FRAMEWORK FOR DEVELOPING COUNTRIES: A SYSTEMATIC REVIEW
AN EHEALTH ADOPTION FRAMEWORK FOR DEVELOPING COUNTRIES: A SYSTEMATIC REVIEWAN EHEALTH ADOPTION FRAMEWORK FOR DEVELOPING COUNTRIES: A SYSTEMATIC REVIEW
AN EHEALTH ADOPTION FRAMEWORK FOR DEVELOPING COUNTRIES: A SYSTEMATIC REVIEWhiij
 
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...hiij
 
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...hiij
 
CHINESE PHARMACISTS LAW MODIFICATION, HOW TO PROTECT PATIENTS‘INTERESTS?
CHINESE PHARMACISTS LAW MODIFICATION, HOW TO PROTECT PATIENTS‘INTERESTS?CHINESE PHARMACISTS LAW MODIFICATION, HOW TO PROTECT PATIENTS‘INTERESTS?
CHINESE PHARMACISTS LAW MODIFICATION, HOW TO PROTECT PATIENTS‘INTERESTS?hiij
 
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...hiij
 
PERSONALIZED MEDICINE SUPPORT SYSTEM: RESOLVING CONFLICT IN ALLOCATION TO RIS...
PERSONALIZED MEDICINE SUPPORT SYSTEM: RESOLVING CONFLICT IN ALLOCATION TO RIS...PERSONALIZED MEDICINE SUPPORT SYSTEM: RESOLVING CONFLICT IN ALLOCATION TO RIS...
PERSONALIZED MEDICINE SUPPORT SYSTEM: RESOLVING CONFLICT IN ALLOCATION TO RIS...hiij
 
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...hiij
 
CLINICAL AND SOCIAL DETERMINANTS TO IMPROVE DOD/VETERAN WELL BEING: THE SERVI...
CLINICAL AND SOCIAL DETERMINANTS TO IMPROVE DOD/VETERAN WELL BEING: THE SERVI...CLINICAL AND SOCIAL DETERMINANTS TO IMPROVE DOD/VETERAN WELL BEING: THE SERVI...
CLINICAL AND SOCIAL DETERMINANTS TO IMPROVE DOD/VETERAN WELL BEING: THE SERVI...hiij
 
Design of RLE Scorer Web Forms and Nursing Students Efficacy in Parenteral Dr...
Design of RLE Scorer Web Forms and Nursing Students Efficacy in Parenteral Dr...Design of RLE Scorer Web Forms and Nursing Students Efficacy in Parenteral Dr...
Design of RLE Scorer Web Forms and Nursing Students Efficacy in Parenteral Dr...hiij
 

Mehr von hiij (20)

Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)
 
Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)
 
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...
 
Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)
 
AUTOMATIC AND NON-INVASIVE CONTINUOUS GLUCOSE MONITORING IN PAEDIATRIC PATIENTS
AUTOMATIC AND NON-INVASIVE CONTINUOUS GLUCOSE MONITORING IN PAEDIATRIC PATIENTSAUTOMATIC AND NON-INVASIVE CONTINUOUS GLUCOSE MONITORING IN PAEDIATRIC PATIENTS
AUTOMATIC AND NON-INVASIVE CONTINUOUS GLUCOSE MONITORING IN PAEDIATRIC PATIENTS
 
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMS
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMSINTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMS
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMS
 
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...
BRIEF COMMENTARY: USING A LOGIC MODEL TO INTEGRATE PUBLIC HEALTH INFORMATICS ...
 
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMS
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMSINTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMS
INTEGRATING MACHINE LEARNING IN CLINICAL DECISION SUPPORT SYSTEMS
 
Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)Health Informatics - An International Journal (HIIJ)
Health Informatics - An International Journal (HIIJ)
 
The Proposed Guidelines for Cloud Computing Migration for South African Rural...
The Proposed Guidelines for Cloud Computing Migration for South African Rural...The Proposed Guidelines for Cloud Computing Migration for South African Rural...
The Proposed Guidelines for Cloud Computing Migration for South African Rural...
 
SUPPORTING LARGE-SCALE NUTRITION ANALYSIS BASED ON DIETARY SURVEY DATA
SUPPORTING LARGE-SCALE NUTRITION ANALYSIS BASED ON DIETARY SURVEY DATASUPPORTING LARGE-SCALE NUTRITION ANALYSIS BASED ON DIETARY SURVEY DATA
SUPPORTING LARGE-SCALE NUTRITION ANALYSIS BASED ON DIETARY SURVEY DATA
 
AN EHEALTH ADOPTION FRAMEWORK FOR DEVELOPING COUNTRIES: A SYSTEMATIC REVIEW
AN EHEALTH ADOPTION FRAMEWORK FOR DEVELOPING COUNTRIES: A SYSTEMATIC REVIEWAN EHEALTH ADOPTION FRAMEWORK FOR DEVELOPING COUNTRIES: A SYSTEMATIC REVIEW
AN EHEALTH ADOPTION FRAMEWORK FOR DEVELOPING COUNTRIES: A SYSTEMATIC REVIEW
 
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...
 
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...
 
CHINESE PHARMACISTS LAW MODIFICATION, HOW TO PROTECT PATIENTS‘INTERESTS?
CHINESE PHARMACISTS LAW MODIFICATION, HOW TO PROTECT PATIENTS‘INTERESTS?CHINESE PHARMACISTS LAW MODIFICATION, HOW TO PROTECT PATIENTS‘INTERESTS?
CHINESE PHARMACISTS LAW MODIFICATION, HOW TO PROTECT PATIENTS‘INTERESTS?
 
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...
GENDER DISPARITYOF TUBERCULOSISBURDENIN LOW-AND MIDDLE-INCOME COUNTRIES: A SY...
 
PERSONALIZED MEDICINE SUPPORT SYSTEM: RESOLVING CONFLICT IN ALLOCATION TO RIS...
PERSONALIZED MEDICINE SUPPORT SYSTEM: RESOLVING CONFLICT IN ALLOCATION TO RIS...PERSONALIZED MEDICINE SUPPORT SYSTEM: RESOLVING CONFLICT IN ALLOCATION TO RIS...
PERSONALIZED MEDICINE SUPPORT SYSTEM: RESOLVING CONFLICT IN ALLOCATION TO RIS...
 
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...
BRIEF COMMUNICATIONS DATA HYGIENE: IMPORTANT STEP IN DECISIONMAKING WITH IMPL...
 
CLINICAL AND SOCIAL DETERMINANTS TO IMPROVE DOD/VETERAN WELL BEING: THE SERVI...
CLINICAL AND SOCIAL DETERMINANTS TO IMPROVE DOD/VETERAN WELL BEING: THE SERVI...CLINICAL AND SOCIAL DETERMINANTS TO IMPROVE DOD/VETERAN WELL BEING: THE SERVI...
CLINICAL AND SOCIAL DETERMINANTS TO IMPROVE DOD/VETERAN WELL BEING: THE SERVI...
 
Design of RLE Scorer Web Forms and Nursing Students Efficacy in Parenteral Dr...
Design of RLE Scorer Web Forms and Nursing Students Efficacy in Parenteral Dr...Design of RLE Scorer Web Forms and Nursing Students Efficacy in Parenteral Dr...
Design of RLE Scorer Web Forms and Nursing Students Efficacy in Parenteral Dr...
 

Kürzlich hochgeladen

Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Kürzlich hochgeladen (20)

Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

Predictive comparative qsar analysis of as 5 nitrofuran-2-yl derivatives myco bacterium tuberculosis h37 rv inhibitors

  • 1. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 PREDICTIVE COMPARATIVE QSAR ANALYSIS OF AS 5-NITROFURAN-2-YL DERIVATIVES MYCO BACTERIUM TUBERCULOSIS H37RV INHIBITORS Doreswamy1 and Chanabasayya .M. Vastrad2 1 Department of Computer Science Mangalore University , Mangalagangotri-574 199, Karnataka, INDIA 2 Department of Computer Science Mangalore University , Mangalagangotri-574 199, Karnataka, INDIA ABSTRACT Antitubercular activity of 5-nitrofuran-2-yl Derivatives series were subjected to Quantitative Structure Activity Relationship (QSAR) Analysis with an effort to derive and understand a correlation between the biological activity as response variable and different molecular descriptors as independent variables. QSAR models are built using 40 molecular descriptor dataset. Different statistical regression expressions were got using Partial Least Squares (PLS) ,Multiple Linear Regression (MLR) and Principal Component Regression (PCR) techniques. The among these technique, Partial Least Square Regression (PLS) technique has shown very promising result as compared to MLR technique A QSAR model was build by a training set of 30 molecules with correlation coefficient ( ‫ ݎ‬ଶ ) of 0.8484 , significant cross validated ଶ correlation coefficient (‫ ݍ‬ଶ ) is 0.0939, ‫ ݐݏ݁ݐ ܨ‬is 48.5187, ‫ ݎ‬ଶ for external test set (‫ ) ݎ_݀݁ݎ݌‬is -0.5604, coefficient of correlation of predicted data set (‫ ݎ_݀݁ݎ݌‬ଶ ‫ )݁ݏ‬is 0.7252 and degree of freedom is 26 by Partial Least Squares Regression technique. KEYWORDS TB, MLR , PLS , PCR , LOO 1. INTRODUCTION Tuberculosis in humans is generally caused by mycobacterium tuberculosis(TB). The desease is spread by respirable droplets generated during effective expiratory manoeuvres such as coughing. TB desease can be either active or latent[1] . The World Health Organization (WHO) asses that within the next twenty years about thirty million people will be troubled with the bacillus [2-3]. The analytic management of TB has depends dully on a limited number of drugs such as Isonicotinic acid, Hydrazide,Rifadin, Rimactane ,Myambutol ,Streptomycin, Ethionamide, Pyrazinamide, Fluroquinolones etc [4]. Still with the origin of these special chemical drugs the DOI: 10.5121/hiij.2013.2404 47
  • 2. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 spread of TB has not been eradicated completely because of delayed treatment programmes .There is now recognition that new drugs to treat TB are necessarily required, particularly for use in shorter medication procedure than are possible with the current agents and which can be engaged to treat multi-drug resistant and hidden disease[5]. 5-nitrofuran-2-yl shows effective in vitro and in vivo antimycobacterial activity [6]. There is also a great effort to find and develop newer, 5-nitrofuran-2-yl, and some of them might have value in the remedy of TB [7]. Chemo informatics[26] and computer-aided drug design (CADD) are likely to contribute to a possible solution for the dangerous situation regarding this infectious disease by assisting in the swift identification of new effective anti-TB agents. The other way for overcoming the absence of empirical analysis for biological systems is depends on the activity to develop quantitative structure activity relationship (QSAR) [8] . QSAR models are mathematical expressions formulating a relationship between chemical structures and biological activities. These models have different capability, which is providing a deeper knowledge about the process of biological activity. In the first step of a usual QSAR study one needs to find a set of molecular descriptors with the higher influence on the biological activity of interest [9]. A broad scope of molecular descriptors[10] has been used in QSAR modeling. These molecular descriptors[11] have been categorised into different classes, including constitutional, geometrical, topological, quantum chemical and so on. Using such an way one could predict the activities of newly formulated compounds before a conclusion is being made whether these compounds should be truly synthesized and tested. We examine the performance of Partial Least Squares(PLS) based QSAR models with the results produced by Multi Linear Regression(MLR ) and Principal Component Regression (PCR) methods to discover basic requirements for additional bettered antitubercular activity. 2. MATERIALS AND METHODS 2.1 MOLECULAR DESCRIPTOR DATA SETS A set of fourty molecule compounds relates to derivatives for mycobacterium TB(H37Rv) inhibitors were taken from large antitubercular drug molecule databases[12] using substructure mining tool Schrodinger Canvas 2010(Trial version) [13]. All molecules were handled by the Vlife MDS [14] - 2D coordinates of atoms were recalculated counter ions and salts were eliminated from molecular structures, molecules were neutralized, mesomerized and aromatized. Data sets were then refined from duplicates. The 2D-QSAR models were produced using a training set of thirty molecules. Predictive ability of the models was assessed by a test set of ten molecules with consistently distributed biological activities. The observed selection of test set molecules was made by seeing the fact that test set molecules shows a range of biological activity similar to the training set. The actual and predicted biological activities of the training and test set molecules are given in Table 1. 48
  • 3. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 Sl no 1 Compound IC50a(µg/ml) PIC50b Obsr Pred Residual 4.41 5.3820 0.027 6.41 5.193 5.2282 0.0352 3.29 5.482 5.2282 0.2538 8.7 5.060 5.2282 0.1682 11.38 4.943 6.0161 1.0731 6.83 5.165 5.3820 0.217 2.53 5.596 5.3820 0.214 -4.35 5.361 5.3820 0.021 -0.95 H3 C 2 5.355 6.022 5.3820 0.64 -4.12 5.385 5.3820 0.003 -7.8 5.062 5.3820 0.32 CH 3 HN O O 3 4 H3 C HN O O 5 6 7 H3 C 8 O CH 3 S Cl N O O HN O CH 9 O O O S N H3 C O O HN H3C O O CH 10 O O O S N H3 C O O HN H3C O O 11 H3 C O O CH 3 S N O O H3C HN O 49
  • 4. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 12 -0.82 6.086 6.0867 0.0007 4.01 5.317 5.2282 0.0888 3.9 5.408 5.2282 0.1798 -5.44 5.264 5.2282 0.0358 0.15 6.823 5.3820 1.441 6.79 5.168 5.3820 0.214 9.26 5.033 5.3820 0.349 -3.15 5.501 5.3820 0.119 6.04 5.218 5.5359 0.3179 -3.47 5.459 5.3820 0.077 13 CH 3 O O CH 3 H3 C S O N O O HN H N O O 14 15 16 17 18 19 20 21 50
  • 5. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 22 -4.23 5.373 5.3820 0.009 -2.48 5.605 5.3820 0.223 -1.65 5.782 5.3820 0.4 -3.62 5.441 5.6625 0.2215 6.2 5.207 5.3820 0.175 98.23 4.007 4.0090 0.002 -5.11 5.291 5.2282 0.0628 1.15 5.939 5.2282 0.7108 -6.59 5.181 5.2282 0.0472 -0.08 7.096 7.0986 0.0026 23 24 O 25 CH 3 O O NH O O O O 26 27 28 29 O 30 H2 N N N O O N 31 H3 C O O O 51
  • 6. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 32 O N O 0.15 6.823 6.7122 0.1108 -2.55 5.593 5.3820 0.211 -1.05 5.978 5.0744 0.9036 -9.38 5.027 5.2282 0.2012 -4.8 5.318 5.0744 0.2436 10.73 4.969 5.0744 0.1054 6.25 5.204 5.2282 0.0242 4.26 5.370 5.2282 0.1418 -8.01 N 5.096 5.2282 0.1322 O 33 O F HN O 34 35 H3 C O HS N Cl Cl HN Cl HN O O 36 37 38 CH 3 O HN O O 39 40 Table 1 Molecular structure with Observed and Predicted activity of 5-nitrofuran-2-yl used in training and test set using Model-1 (PLS) Expt. = Experimental activity, Pred. = Predicted activity IC50a = Compound concentration in micro mole required to inhibit growth by 50% 52
  • 7. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 PIC50b = -Log (IC50 × 10ି଺): Training data set developed using model 1 (PLS) and Test set is light blue shaded. 2.2 BIOLOGICAL OBSERVED ACTIVITY DATA For the evolution of QSAR models of 5-nitrofuran-2-yl ,of processes antitubercular activity in terms of half maximum inhibitory concentration IC50 (µM) versus (H37Rv) strains were took from the antitubercular drug molecule databases[12]. The IC50 activity data contains only molecules that have at least exhibited some activity. The biological activity data (IC50) were transformed in to pIC50 according to the formula pIC50 = (−log (IC50 × 10ି଺)) was used as response values, thus correlating the data linear to the free energy change. 2.3 DESCRIPTOR CALCULATION FOR MOLECULAR DATASET The VLife MDS tool used for the computation of various molecular descriptors containing topological index (J), connectivity index (x), radius of gyration (RG), moment of inertia, Wiener index(W), balabian index(J), centric index, hosoya index (Z), information based indices, XlogP, logP, hydrophobicity, elemental count, path count, chain count, pathcluster count, molecular connectivity index (chi), kappa values, electro topological state indices, electrostatic surface properties, dipole moment, polar surface area(PSA), alignment independent descriptor (AI)[11,14. The calculated molecular descriptors were gathered in a data matrix. The preprocessing for the generated molecular descriptors was done by removing invariable (constant column) and cross-correlated descriptors (with r = 0.99). which happen in total 156, 125 and 162 molecular descriptors for MLR, PCR and PLS accordingly to be used for QSAR analysis. 2.4 CREATION OF TRAINING AND TEST SET The dataset of forty molecular descriptors is split s into training and test set by Sphere Exclusion (SE)[15-16] technique. In this technique initially data set splits into training and test set using sphere exclusion technique. In this technique variance value provides an idea to handle training and test set size. It needs to be adapted by trial and error until a desired split of training and test set is acquired. Increase in dissimilarity value results in increase in number of molecules in the test set. This technique is used for MLR, PCR and PLS models with pIC50 activity data as response variable and various 2D molecular descriptors computed for the molecules as independent variables. 2.5 MODEL VALIDATION Model validation [17-18] is a essential manner of quantitative structure–activity relationship (QSAR) modelling. This is done to test the internal stability and predictive capability of the QSAR models. These three QSAR models were validated by the following method. 2.5.1 INTERNAL MODEL VALIDATION Internal model validation was carried out using leave-one-out (LOO-Qଶ ) method. For calculating qଶ , each sample in the training set was eliminated once and the activity of the eliminated sample was predicted by using the model developed by the remaining samples. The Qଶ computed using the expression which explains the internal strength of a model. 53
  • 8. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 ∑(ଢ଼ Qଶ = 1 − ∑(ଢ଼ ౦౨౛ౚ ି ଢ଼౥ౘ౩ )మ (1) మ ౥ౘ౩ ି ଢ଼ౣ౛౗౤ ) In Eq. (1), Y୮୰ୣୢ and Y୭ୠୱ indicate predicted and observed activity values accordingly and Y୫ୣୟ୬ signify mean activity value. A model is considered acceptable when the value of Qଶ exceeds 0.5. 2.5.2 EXTERNAL MODEL VALIDATION External model validation, the activity of each sample in the test set was predicted using the model created by the training set. The pred_r ଶ value is computed as follows. pred_r ଶ = ∑(ଢ଼౦౨౛ౚ(౪౛౩౪) ି ଢ଼౪౛౩౪ )మ ∑(ଢ଼౪౨౗౟౤ ି ଢ଼ౣ౛౗౤(౪౨౗౟౤) )మ (2) In Eq (2) Y୮୰ୣୢ(୲ୣୱ୲) and Y୲ୣୱ୲ indicate predicted and observed activity values for the test set and Y୲୰ୟ୧୬ indicates mean activity value of the training set. For the predictive QSAR model, the value of pred_r ଶ must be more than 0.5. 2.5.3 RANDOMIZATION TEST Randomization test or Y-scrambling is key mean of statistical validation. To assess the statistical importance of the QSAR model for the dataset, one tail hypothesis testing is used. The strength of the models for training sets was tested by examining these models to those derived for random datasets. Random sets were produced by rearranging the activities of the samples in the training set. The statistical model was determined using different randomly reorganize activities (random sets) with the chosen molecular descriptors and the equivalent Qଶ were computed. The importance of the models for that reason obtained was developed based on a computed Zୱୡ୭୰ୣ. A Z score value is calculated by the following equation: (୦ ି ஜ) Zୱୡ୭୰ୣ = ஢ (3) Where ℎ is the Qଶ value computed for the dataset, µ the mean Qଶ , and is its σ standard deviation calculated for various iterations using models build by different random datasets. The probability (a) of importance of randomization test is derived by comparing Zୱୡ୭୰ୣ value with Zୱୡ୭୰ୣ critical value as stated, if Zୱୡ୭୰ୣ value is less than 4.0; otherwise it is computed by the expression as given in the literature. For example, a Zୱୡ୭୰ୣ value more than 3.10 proposes that there is a probability (a) of smaller than 0.001 that the QSAR model build for the dataset is random. The randomization test proposes that all the created models have a probability of less than 1% that the model is produced by chance. 2.6 MULTIPLE LINEAR REGRESSION (MLR) ANALYSIS MLR technique used for modelling linear relationship between a response variable Y (pIC50) and independent variables X (2D molecular descriptors). MLR is based on least squares technique: the model is fit such that sum-of-squares of differences of actual and a predicted values are minimized. MLR estimates the regression coefficients ( r ଶ ) by applying least squares fitting 54
  • 9. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 technique. The model builds a relationship in the form of a straight line (linear) that best estimates all the individual data points. In regression analysis, conditional mean of response variable (pIC50) Y depends on (molecular descriptors)X. MLR analysis add to this idea to include more than one independent variables. Regression expression takes the form. (4) Y = bଵ xଵ + bଶ xଶ + bଷ xଷ + c where Y is a response variable, ‘b’s are regression coefficients for corresponding ‘x’s are molecular descriptors(independent variables), ‘c’ is a regression constant or intercept [19,25]. 2.7 PRINCIPAL COMPONENT REGRESSION (PCR) ANALYSIS Principal Component Regression (PCR) is a regression technique that uses principal component analysis(PCA) when evaluating regression coefficients. PCR presents a technique for finding structure in datasets. Its object is to group correlated variables, replacing the earlier descriptors by new set called principal components (PCs). These PC’s are uncorrelated and are developed as a simple linear aggregation of earlier variables. It moves the data into a new set of axes such that first few axes indicates most of the variations within the data. First PC (PC1) is expressed in the direction of maximum variance of the whole dataset. Second PC (PC2) is the direction that defines the maximum variance in orthogonal subspace to PC1. Consequent components are taken orthogonal to the particular formerly chosen and defines best of remaining variance, by locating the data on new set of axes, it can points major fundamental structures certainly. Value of each point, when moved to a given axis, is called the PC value. PCA chooses a new set of axes for the data. These are chosen in decreasing order of variance within the data. The aim of principal component PCR is the computation of values of a response variable on the basis of chosen PCs of independent variables [21]. 2.8 PARTIAL LEAST SQUARES (PLS) REGRESSION ANALYSIS PLS is a well known regression technique which can be used to correlate one or more response variable (Y) to various independent variables(X) . PLS relates a matrix Y of response variables to a matrix X of molecular descriptors. PLS is useful in conditions where the number of molecular descriptors( independent variables) exceeds the number of samples, when X data contain colinearties or when N is less than 5M, where N is number of samples and M is number of response variables. PLS builds orthogonal components using existing correlations between independent variables and corresponding outputs while also keeping most of the variance of independent variables. Major aim of PLS regression is to predict the activity (Y) from X and to define their common frame[22,23] . PLS is probably the least contrary of various multivariate extensions of MLR model. PLS is a technique for constructing predictive models when factors are many and highly collinear. 2.9 EVALUATION OF THE QSAR MODELS The created QSAR models are computed using the following statistical parameters: N (Number of samples in regression); K (Number of independent variables(molecular descriptors)); DF (Degree of freedom); optimum component ( number of optimums); r ଶ ( the squared correlation coefficient); F test (Fischer’s Value) for statistical importance; qଶ (cross-validated correlation 55
  • 10. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 coefficient); pred_r ଶ ( r ଶ for external test set); Zୱୡ୭୰ୣ ( Z score computed by the randomization test); Best_ran_r ଶ (maximal r ଶ value in the randomization test) ; Best_ran_qଶ (maximal qଶ value in the randomization test) ; α ( statistical importance parameter obtained by the randomization test). The correlation coefficient r ଶ is a respective standard of fit by the regression expression. It expressed the part of the variation in the observed data is analyzed by the regression. Despite, a QSAR models are examined to be predictive, if the following prerequistes are satisfied: r ଶ > 0.6 , qଶ > 0.6 and pred_r ଶ > 0.5 [24] . The F-test indicates the ratio of variance described by the model and variance due to the error in the regression. High values of the F-test indicate that model is statistically meaningful. The reduced standard error of pred_r ଶ se , qଶ _se and r ଶ _se demonstrates actual value of the fitness of the model. The cross-correlation extent was set at 0.5. 3. RESULTS Taining set of 30 and 10 of test set of 5-nitrofuran-2-yl having different substitution were employed. 3.1 CREATION OF QSAR MODELS 3.1.1 PARTIAL LEAST SQUARES (PLS) REGRESSION ANALYSIS The molecular descriptors were applied to PLS technique to developQSAR models by using simulated anealing variable selection mode. PLS model is having following QSAR Eq.(5) with five descriptors. pIC50 = 1.8704(StsCcount) + 4.0747(chi5chain) − 0.6865(SaaaCcount) + 0.7046(SssScount) − 0.1538 (SdssCcount) + 4.9478 (5) Table 2 Statistical parameters of PLS, MLR And PCR Parameters N DF rଶ PLS 40 26 0.8484 MLR 40 24 0.8484 PCR 40 28 0.3289 qଶ F-test best_ran_r ଶ best_ran_qଶ Zୱୡ୭୰ୣ_୰ୟ୬_୰ ଶ Zୱୡ୭୰ୣ_୰ୟ୬_୯ ଶ 0.0939 48.5187 0.56429 −0.03892 3.43122 1.59111 0.0932 26.8725 0.28620 −0.08598 8.11471 1.31886 −5.3805 13.7231 0.32891 −2.93782 2.81258 −2.47533 α_ran_r ଶ α_ran_q ଶ 0.00100 0.10000 0.00000 0.10000 0.01000 99.00000 r ଶ _se 0.2277 0.2370 0.4617 q _se pred_r ଶ 0.5568 −0.5604 0.5797 −0.5616 1.4237 −0.0734 pred_r ଶ se 0.7252 0.7255 0.6015 ଶ 56
  • 11. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 The above analysis directs to the improvement of statistically meaningful QSAR model, which allows understanding of the molecular properties/features that play an key role in governing the variation in the activities. In addition, this QSAR study allowed examining influence of very simple and easy-to-compute molecular descriptors in discovering biological activities, which could shed light on the important factors that may aid in design of new potent molecules. All the parameters and their significance, which contributed to the specific inhibitory activity in the generated model is discussed below. antitubercular 1. StsCcount: This descriptor indicates the total number of carbon atoms with a triple bond and a single bond exist in the molecule. Positive Contribution of this descriptor to the model is 31.72%. 2.chi5chain: This descriptor signifies a retention index for five membered ring. Positive Contribution of this descriptor to the model is 21.99%. 3. SaaaCcount: This descriptor defines the total number of carbon connected with three aromatic bonds. Negative Contribution of this descriptor to the model is -23.28%. 4. SssScount: This descriptor indicates the total number of sulphur atom attached with two single bonds. Positive Contributions of this descriptor to the model is 11.95%. 5. SdssCcount: This descriptor defines the total number of carbon connected with one double and two single bond. Negative Contribution of this descriptor to the model is -11.06%. Figure 1 Observed vs. Predicted activities for training and test set molecular descriptors by Partial Least Square model. (A) Training set (Red dots) (B) Test Set (Blue dots). The PLS model gave correlation coefficient ( r ଶ ) of 0.8484, significant cross validated correlation coefficient ( qଶ ) of 0.0939, F-test of 48.5187 and degree of freedom 26. The model is validated by α_ran_r ଶ = 0.00100, α_ran_q ଶ = 0.10000, best_ran_r ଶ = 0.56429, best_ran_qଶ = -0.03892, Zୱୡ୭୰ୣ_୰ୟ୬_୰ ଶ = 3.43122 and Zୱୡ୭୰ୣ_୰ୟ୬_୯ ଶ =1.59111. The randomization test proposes that the created model have a probability of smaller than 1% that the model is build by chance. Statistical 57
  • 12. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 data is presented in Table 2. The graph of observed vs. predicted activity is demonstrated in Figure 1. The descriptors which contribute for the QSAR model is demonstrated in Figure 2. Figure 2 Percentage contribution of each descriptor in created PLS model describing variation in the activity 3.1.2 MULTIPLE LINEAR REGRESSION (MLR) ANALYSIS The QSAR analysis by Multiple Linear Regression method with simulated annealing variable selection technique, the final QSAR model is created having five descriptors is shown in Eq. (6). pIC50 = 1.8681(± 0.2421)StsCcount + 4.0722(± 0.7599)chi5chain − 0.6879(± 0.1139)SaaaCcount + 0.7033(± 0.2393)SssScount − 0.1548 (± 0.0265)SdssCcount + 4.9497 (6) MLR Model has a correlation coefficient ( r ଶ ) of 0.8484, significant cross validated correlation coefficient ( qଶ ) of 0.0932, F test of 26.8725 and degree of freedom 24. The model is validated by α_ran_r ଶ = 0.00000 , α_ran_q ଶ = 0.10000, best_ran_r ଶ = 0.28620, best_ran_qଶ = −0.08598 , Zୱୡ୭୰ୣ_୰ୟ୬_୰ ଶ = 8.11471 andZୱୡ୭୰ୣ_୰ୟ୬_୯ ଶ = 1.31886 The randomization test proposes that the created model have a probability of smaller than 1% that the model is build by chance. The observed and predicted values with residual values are demonstrated in Table 1.Statistical data is demonstrated in Table 2.The graph of observed vs. predicted activity demonstrated is in Figure 3. The descriptors which contribute for the QSAR model are demonstrated in Figure 4. All the parameters and their significance, which contributed to the specific antitubercular inhibitory activity in the generated models are explained below. Figure 3 Observed vs. Predicted activities for training and test set molecular descriptors from the Multiple Linear Regression model. (A) Training set (Red dots) (B) Test Set (Blue dots). 58
  • 13. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 1. StsCcount: This descriptor indicates the total number of carbon atoms with a triple bond and a single bond present in the molecule. Positive Contribution of this descriptor to the model is 31.67%. 2.chi5chain: This descriptor signifies a retention index for five membered ring. Positive Contribution of this descriptor to the model is 21.97%. 3. SaaaCcount: This descriptor defines the total number of carbon connected with three aromatic bonds. Negative Contribution of this descriptor to the model is -23.32%. 4. SssScount: This descriptor indicates the total number of sulphur atom attached with two single bonds. Positive Contributions of this descriptor to the model is 11.92%. 5. SdssCcount: This descriptor defines the total number of carbon connected with one double and two single bond. Negative Contribution of this descriptor to the model is -11.12%. Figure 4 Percentage contribution of each descriptor in created MLR model describing variation in the activity. 3.1.3 PRINCIPAL COMPONENT REGRESSION (PCR) ANALYSIS The molecular descriptors were applied to under goes PCR technique to create QSAR model with Simulated anealining variable selection mode by using PCR model. The final QSAR model is Eq. (7) was created having one descriptor as follows. pIC50 = 1.7397StsCcount + 5.3563 (7) The PCR model gave correlation coefficient ( r ଶ ) is 0.3289, significant cross validated correlation coefficient ( qଶ ) of -5.3805, F test of 13.7231 and degree of freedom 28. The model is validated by α_ran_r ଶ = 0.01000, α_ran_q ଶ = 99.00000, best_ran_r ଶ = 0.32891, best_ran_qଶ =-0.13938 , Zୱୡ୭୰ୣ_୰ୟ୬_୰ ଶ = 2.81258 and Zୱୡ୭୰ୣ_୰ୟ୬_୯ ଶ =-2.47533. The randomization test proposes that the created model have a probability of smaller than 1% that the model is build by chance. 59
  • 14. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 Statistical data is demonstrated in Table 2. The graph of observed vs. predicted activity is in demonstrated Figure 5 .The descriptors which contribute for the QSAR model is demonstrated in Figure 6. Figure 5 Observed vs. Predicted activities for training and test set molecular descriptors by Principal Component Regression model. A) Training set (Red dots) B) Test Set (Blue dots). All the parameters and their significance, which contributed to the specific inhibitory activity in the generated models are discussed here. antitubercular 1. StsCcount: This descriptor indicates the total number of carbon atoms with a triple bond and a single bond present in the molecule. Positive Contribution of this descriptor to the model is 100%. Figure 6 Percentage contribution of each descriptor in developed PCR model describing variation in the activity 60
  • 15. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 4. CONCLUSION The 2D QSAR analysis were conducted with a series of 5-nitrofuran-2-yl derivatives for mycobacterium tuberculosis(H37Rv) inhibitors , and some useful predictive models were obtained. The physicochemical molecular descriptors were found to have an key role in governing the change in activity. The statistical parameters demonstrate the estimation power of QSAR model for the molecular descriptor data set from which it has been determined and evaluate it only internally. The overall performance of prediction was found to be around 84% in case of PLS and MLR. Among the three 2D-QSAR models (MLR, PCR, and PLS), the results of PLS and MLR analysis showed significant predictive power and reliability as compare to PCR technique. ACKNOWLEDGEMENTS The Authors are thankful to Dr Mahesh .B. Palkar Department of Pharmaceutical Chemistry K.L.E Pharmacy College Hubli REFERENCES [1] [2] [3] [4] [5] [6]T [7] [8] [9] [10] [11] [12] [13] [14] “Tuberculosis”, Centers for Disease Control and Prevention1600 Clifton Rd. Atlanta, GA 30333, USA http://www.cdc.gov/tb/topic/basics/default.htm “Weekly Epidemiological Record (WER)”,WHO annual report on global TB control – summary http://www.who.int/wer/2003/wer7815/en/index.html Phyllis C. Braun, PhD and John D. Zoidis, MD, “Update on Drug-Resistant Pathogens: Mechanisms of Resistance, Emerging Strains”, http://www.rtmagazine.com/issues/articles/2004-01_01.asp “Tuberculosis management”, From Wikipedia, the free encyclopaedia http://en.wikipedia.org/wiki/Tuberculosis_management ”Multidrug-resistant tuberculosis (MDR-TB)” , From World Health Organization http://www.who.int/tb/challenges/mdr/en/ awari NR, Bairwa R, Ray MK, Rajan MG, Degani MS. Bioorg Med Chem Lett. (2010 ),“Design, synthesis,and biological evaluation of 4-(5-nitrofuran-2-yl)prop-2-en-1-one derivatives as potent antitubercular agents.” ,1;20(21):6175-8. doi: 10.1016/j.bmcl.2010.08.127. Epub 2010 Sep 16. Sriram D, Yogeeswari P, Dhakla P, Senthilkumar P, Banerjee D, Manjashetty TH (2009),“5Nitrofuran-2-ylderivatives: synthesis and inhibitory activities against growing and dormant mycobacterium species.” , Bioorganic & medicinal chemistry letters 19:4 pg 1152-4 R. Karbakhsh1,* and R. Sabet (2011),“Application of different chemometric tools in QSAR study of azoloadamantanes against influenza A virus”, Research in Pharmaceutical Sciences; 6(1): 23-33 “Molecular Descriptors“ ,The Free Online resource http://www.moleculardescriptors.eu/tutorials/what_is.htm “Molecular Descriptors Guide” , Version 1.0.2 Copyright [2008] US Environamental Protection agency “Streamline Drug Discovery with CDD colabrative web based software “ , https://www.collaborativedrug.com/ ( Accesed in May-june [2012] ) ”Canv as”, A comprehensive cheminformatics computing environment http://www.schrodinger.com/products/14/23/ “VlifeMDS”, Integrated platform for Computer Aided Drug Design (CADD) http://www.vlifesciences.com/products/VLifeMDS/Product_VLifeMDS.php “Sphere Exclusion Method for set selection” ,Rajarshi Guha Penn State University http://rguha.net/writing/pres/tropsha.pdf 61
  • 16. Health Informatics- An International Journal (HIIJ) Vol.2, No.4, November 2013 [15] Mary Ann Liebert,(1999), “Dissimilarity-Based Algorithms for Selecting Structurally Diverse Sets of Compounds” , JOURNAL OF COMPUTATIONAL BIOLOGY Volume 6, Numbers 3/4, Inc. Pp. 447–457 [16] Tropsha, A.; Gramatica, P.; Gombar,V.K.(2003),”The importance of being earnest: Validation is the absolute essential for successful application and interpretation of QSPR models.”, QSAR Comb. Sci. , 22, 69-77 [17] Partha Pratim Roy, Somnath Paul, Indrani Mitra and Kunal Roy(2009) ,“On Two Novel Parameters for Validation of Predictive QSAR Models” , Molecules , 14, 1660-1701 ISSN 1420-3049 [18] “Multile linear Regression”, http://www.ltrr.arizona.edu/~dmeko/notes_11.pdf [19] Dr. Frank Dieterle ,“Variable Selection by Simulated Annealing”, http://www.frankdieterle.de/phd/2_8_6.html [20] Hwang , Dan Nettleton ,“Principal Components Regression With Data-Choosen Components and related methods” , J.T. Gene www.math.cornell.edu/~hwang/pcr.pdf [21] Herv´e Abdi1 ,“Partial Least Squares(PLS) Regression ” ,The University of Texas at Dallas [22] Randall D. Tobias, “An introduction to partial least squares Regression”, SAS Institute Inc., Carry, NC www.ats.ucla.edu/stat/sas/library/pls.pdf [23] Golbraikh .A , and A. Tropsha, (2002),”Predictive QSAR modeing based on diversity of sampling of experimental datasets for the training and test set selection “, J. Comp Aided . Mol Design , 16:357-366. [24] “Influence of observations on the misclassification probability in quadratic discriminant analysis” ,https://lirias.kuleuven.be/bitstream/123456789/85608/1/qda.pdf [25] Craig A. James, “An introduction to the Computer Science and Chemistry of Chemical Information Systems” eMolecules, Inc. http://www.emolecules.com/doc/cheminformatics-101.php Authors Doreswamy received B.Sc degree in Computer Science and M.Sc Degree in Computer Science from University of Mysore in 1993 and 1995 respectively. Ph.D degree in Computer Science from Mangalore University in the year 2007. After completion of his Post-Graduation Degree, he subsequently joined and served asLecturer in Computer Science at St. Joseph’s College, Bangalore from 1996 1999.Then he has elevated to the position Reader in Computer Science at Mangalor Universityin year 2003. He was the Chairman of the Department of Post-Graduate Studies and research in computer science from 2003-2005 and from 2009-2008 and served at varies capacitiesin Mangalore University at present he is the Chairman of Board of Studies and Professor in Computer Science of Mangalore University. His areas of Research interests include Data Mining and Knowledge Discovery, Artificial Intelligence and Expert Systems, Bioinformatics ,Molecular modelling and simulation ,Computational Intelligence ,Nanotechnology, Image Processing and Pattern recognition. He has been granted a Major Research project entitled “Scientific Knowledge Discovery Systems (SKDS) for Advanced Engineering Materials Design Applications” from the funding agency University Grant Commission, New Delhi , India. He has been published about 30 contributed peer reviewed Papers at national/International Journal and Conferences. He received SHIKSHA RATTAN PURASKAR for his outstanding achievements in the year 2009 and RASTRIYA VIDYA SARASWATHI AWARD for outstanding achievement in chosen field of activity in the year 2010. Chanabasayya.M.Vastrad received B.E. degree and M.Tech. degree in the year 2001and 2006 respectively. Currently working towards his Ph.D Degree in Computer Scienceand Technology under the guidance of Dr. Doreswamy in the Department of Post-Graduate Studies and Research in Computer Science , Mangalore University. 62