2. Plan of My work
Collect Information about researchers
Researcher Affiliation
Journal Details
Conference Details
Books Details (Authors and Editors)
Constructing a graph according to their activity.
Defining Problem .
Proposing/ Designing a new technique .
Simulation and Performance Analysis of proposed technique.
2 Anand Bihari
4. Introduction
Many research professional are involved in activities like Publish a Journal
paper, Conference paper, Books, Articles etc.
Two or more than two persons works together .
Our objective
Find the relationship between professionals.
How many professional are working in the same area.
Which person are more active in a particular area.
Which person are suitable for guidance in particular area.
4 Anand Bihari
5. Our Work
So here we analyze research professional activity as mining.
For analyzing research professional activity, we required three types of
Information like
Research Professional Data(Personal information)
Activity
Relationship between Research professional and their activity
We extract data from IEEE, Springer, Science Direct, ACM digital library etc.
5 Anand Bihari
6. Citation
A citation can represent many types of links, such as links between authors,
publication, journals and conferences.
When researchers refers to another author’s works in their own publication work,
they cite it.
A citation index is a compilation of all the cited references form articles
published during a particular year or period.
A citation index allows to determine the researcher impact of publication
according to the number of times it has been cited by other researcher.
Self citation are not included in such citation count.
6 Anand Bihari
8. Citation example(II)
Author Paper Paper cited by Paper citation value Author citation value
A N,Z 2
B M,W 2
P1 6
C 0
D P,Z 2
A N,Z 2
M D 1
P2 N 0 5
O D,W 2
P 0
W M,N,A,C,P 5
X N 1
P3 6
Y 0
8 Z 0 Anand Bihari
9. h-index
Invented by Jorge Hirsh at university of California in 2005.
A popular method to measure the centrality of academic papers.
h-index =number of your papers h that have been cited at least h times.
The use of h-index aims at identifying researchers with more papers and
relevant impact over a period of time.
For any general “set of papers” one can arrange these papers in decreasing
order of the number of citations they received.
The h-index is then the largest rank h = r such that the paper on this rank (and
hence also all papers on rank 1,…,h) has h or more citations.
Hence the papers on ranks h + 1, h + 2, … have not more than h citations.
9 Anand Bihari
10. h-index example
Author Paper Paper Citation Paper citation value Rank h-index of author
Value in desc. Order
A 2 2 1
B 2 2 2
P1 2
C 0 2 3
D 2 0 4
A 2 2 1
M 1 2 2
P2 N 0 1 3 2
O 2 0 4
P 0 0 5
W 5 5 1
X 1 1 2
P3 1
Y 0 0 3
10 Z 0 0 4 Anand Bihari
11. g-index
The g-index is introduced as an improvement of the h-index of Hirsh to
measure the global citation performance of a set of articles.
This set is ranked in decreasing order of the number of citations that they
received, the g-index is the (unique) largest number such that the top g articles
received (together) at least g2 citations.
The g-index of an author is greater or equal to h-index(g>=h).
11 Anand Bihari
12. g-index example(I)
Author Paper TC r ∑TC r2 h-index g-index
A 2 1 2 1
B 2 2 4 4
P1 2 2
C 2 3 6 9
D 0 4 6 16
A 2 1 2 1
M 2 2 4 4
P2 N 1 3 5 9 2 2
O 0 4 5 16
P 0 5 5 25
W 5 1 5 1
X 1 2 6 4
P3 1 2
Y 0 3 6 9
12 Z 0 4 6 16 Anand Bihari
13. g-index example(II)
TC stands for the total number of citations for each paper on rank r = 1,2,... .
∑TC stands for the cumulative number of citations to the papers on rank 1,...,r
(for each r).
The h-index of author P1 is h = 2 and the g-index g = 2 Indeed h = 2 is the
highest rank such that all papers on rank 1,...,h have at least 2 citations (and
hence the papers on rank 3 or higher have not more than 2 citations). Also g =
2 is the highest rank such that the top 2 papers have at least 22 = 4 citations
(here 4>= 4); on rank 3 we have 6 < 32 = 9 citations.
13 Anand Bihari
14. Longevity
Longevity reflects the length of one author’s academic life. We consider the
year when one author publish his/her first paper as the beginning year of his/her
academic life and the last paper as the end year. Then longevity can be defined
as
longevity(A)=YA(A’s last paper) – YA(A’s first paper)
Example:
let author A published first paper in 1997 and the last paper in 2011.
Therefore, longevity of Author A = 2011 – 1997 = 14
14 Anand Bihari
15. Future Work
After designing a graph , we will implement mathematical equation or graph
algorithm to find the meaningful value of a node.
Research person are taken as vertices.
Edge between them having some weight like activity between them.
Find more influence(Active) person in such area.
Find more influence person in the network.
For this we will collect data from a website known as www.arnetminer.org .
Learn Diversity, Sociability, Activity of researcher in social network.
15 Anand Bihari
16. Future Literature Survey
Titles Name of Journal/Conferences Publication
Year
Link creation and profile IEEE International Conference on 2010
alignment in the aNobii Social Computing / IEEE
social network International Conference on
Privacy, Security, Risk and Trust
Design and implementation of a International Conference on 2011
web structure Mining algorithm internet technology and secured
using breadth first search transactions
Strategy for academic search
application
16 Anand Bihari
17. References
Bing Liu “Web Data Mining ” Springer International Edition.
Articles “Evaluating Scientists: Citations, Impact Factor, h-Index, Online
Page Hits and What Else?” written by M Jagadesh Kumar Editor-in-Chief,
IETE Technical Review, Department of Electrical Engineering, IIT, Hauz Khas,
New Delhi-110 016, India
Springers articles “Theory and practise of the g-index” published by-LEO
EGGHE, Universiteit Hasselt (UHasselt), Campus Diepenbeek (Belgium) , Vol.
69, No. 1 (2006) 131–152.
Website : www.arnetminer.org, Wikipedia.
17 Anand Bihari