SlideShare ist ein Scribd-Unternehmen logo
1 von 1
Downloaden Sie, um offline zu lesen
www.projectsatbangalore.com 09591912372
2916 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 23, NO. 7, JULY 2014
Phase-Based Binarization of Ancient Document
Images: Model and Applications
Hossein Ziaei Nafchi, Reza Farrahi Moghaddam, Member, IEEE, and Mohamed Cheriet, Senior Member, IEEE
Abstract—In this paper, a phase-based binarization model for
ancient document images is proposed, as well as a postprocessing
method that can improve any binarization method and a ground
truth generation tool. Three feature maps derived from the phase
information of an input document image constitute the core of
this binarization model. These features are the maximum moment
of phase congruency covariance, a locally weighted mean phase
angle, and a phase preserved denoised image. The proposed
model consists of three standard steps: 1) preprocessing; 2) main
binarization; and 3) postprocessing. In the preprocessing and
main binarization steps, the features used are mainly phase
derived, while in the postprocessing step, specialized adaptive
Gaussian and median filters are considered. One of the outputs
of the binarization step, which shows high recall performance, is
used in a proposed postprocessing method to improve the perfor-
mance of other binarization methodologies. Finally, we develop
a ground truth generation tool, called PhaseGT, to simplify
and speed up the ground truth generation process for ancient
document images. The comprehensive experimental results on the
DIBCO’09, H-DIBCO’10, DIBCO’11, H-DIBCO’12, DIBCO’13,
PHIBD’12, and BICKLEY DIARY data sets show the robust-
ness of the proposed binarization method on various types of
degradation and document images.
Index Terms—Historical document binarization, phase-derived
features, ground truthing, document enhancement.
I. INTRODUCTION
LIBRARIES and archives around the world store an
abundance of old and historically important documents
and manuscripts. These documents accumulate a significant
amount of human heritage over time. However, many envi-
ronmental factors, improper handling, and the poor quality of
the materials used in their creation cause them to suffer a high
degree of degradation of various types. Today, there is a strong
move toward digitization of these manuscripts to preserve their
content for future generations. The huge amount of digital
data produced requires automatic processing, enhancement,
and recognition. A key step in all document image processing
workflows is binarization, but this is not a very sophisti-
cated process, which is unfortunate, as its performance has
Manuscript received August 8, 2013; revised January 29, 2014 and April 22,
2014; accepted April 26, 2014. Date of publication May 7, 2014; date of
current version May 27, 2014. This work was supported by the Natural
Sciences and Engineering Research Council of Canada. The associate editor
coordinating the review of this manuscript and approving it for publication
was Dr. Debargha Mukherjee.
The authors are with the Synchromedia Laboratory for Multime-
dia Communication in Telepresence, École de Technologie Supérieure,
Montreal, QC H3C 1K3, Canada (e-mail: hossein.zi@synchromedia.ca;
rfarrahi@synchromedia.ca; mohamed.cheriet@etsmtl.ca).
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org.
Digital Object Identifier 10.1109/TIP.2014.2322451
Fig. 1. Sample document images selected from the DIBCO’09 [21],
H-DIBCO’10 [22], and DIBCO’11 datasets [23].
a significant influence on the quality of OCR results. Many
research studies have been carried out to solve the problems
that arise in the binarization of old document images char-
acterized by many types of degradation [1]–[19], including
faded ink, bleed-through, show-through, uneven illumination,
variations in image contrast, and deterioration of the cellulose
structure [1], [20]. There are also differences in patterns of
hand-written and machine-printed documents, which add to the
difficulties associated with the binarization of old document
images.
To the best of our knowledge, none of the proposed methods
can deal with all types of documents and degradation. For
more details, see the Related Work section. Fig. 1 shows some
of the degraded document images used in this paper.
In this paper, a robust phase-based binarization method is
proposed for the binarization and enhancement of historical
documents and manuscripts. The three main steps in the
proposed method are: preprocessing, main binarization, and
post-processing. The preprocessing step mainly involves image
denoising with phase preservation [24], followed by some
morphological operations. We incorporate the Canny edge
detector [25] and a denoised image to obtain a binarized image
in rough form.
Then, we use the phase congruency features [18], [19],
[26] for the main binarization step. Phase congruency is
widely used in the machine vision and image processing
1057-7149 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

Weitere ähnliche Inhalte

Andere mochten auch (7)

Portafolio de dibujos de despiece
Portafolio de dibujos de despiecePortafolio de dibujos de despiece
Portafolio de dibujos de despiece
 
Momento 4 201420_59_2016_16_1
Momento 4 201420_59_2016_16_1Momento 4 201420_59_2016_16_1
Momento 4 201420_59_2016_16_1
 
Diapositivas de johana 2
Diapositivas de johana 2Diapositivas de johana 2
Diapositivas de johana 2
 
Momento 3 201420_59_2016_16_1
Momento 3 201420_59_2016_16_1Momento 3 201420_59_2016_16_1
Momento 3 201420_59_2016_16_1
 
Operação de Câmera de Vídeo
Operação de Câmera de VídeoOperação de Câmera de Vídeo
Operação de Câmera de Vídeo
 
Thai Luxury furniture exporter : Sarnn company profile
Thai Luxury furniture exporter : Sarnn company profileThai Luxury furniture exporter : Sarnn company profile
Thai Luxury furniture exporter : Sarnn company profile
 
Equipes e Fases da Produção Audiovisual
Equipes e Fases da Produção AudiovisualEquipes e Fases da Produção Audiovisual
Equipes e Fases da Produção Audiovisual
 

Ähnlich wie Phase-Based Binarization of Ancient Document Images: Model and Applications

DNA Query Language DNAQL: A Novel Approach
DNA Query Language DNAQL: A Novel ApproachDNA Query Language DNAQL: A Novel Approach
DNA Query Language DNAQL: A Novel Approach
Editor IJCATR
 
Design for Additively Manufactured Structure An Assessment
Design for Additively Manufactured Structure An AssessmentDesign for Additively Manufactured Structure An Assessment
Design for Additively Manufactured Structure An Assessment
ijtsrd
 

Ähnlich wie Phase-Based Binarization of Ancient Document Images: Model and Applications (20)

Guides
GuidesGuides
Guides
 
Improvement of binarization performance using local otsu thresholding
Improvement of binarization performance using local otsu thresholdingImprovement of binarization performance using local otsu thresholding
Improvement of binarization performance using local otsu thresholding
 
Sub1586
Sub1586Sub1586
Sub1586
 
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Phase based-binarization-of-ancie...
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS  Phase based-binarization-of-ancie...IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS  Phase based-binarization-of-ancie...
IEEE 2014 MATLAB IMAGE PROCESSING PROJECTS Phase based-binarization-of-ancie...
 
Modified Approach of Hough Transform for Skew Detection and Correction in Doc...
Modified Approach of Hough Transform for Skew Detection and Correction in Doc...Modified Approach of Hough Transform for Skew Detection and Correction in Doc...
Modified Approach of Hough Transform for Skew Detection and Correction in Doc...
 
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeWorkflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
 
An effective and robust technique for the binarization of degraded document i...
An effective and robust technique for the binarization of degraded document i...An effective and robust technique for the binarization of degraded document i...
An effective and robust technique for the binarization of degraded document i...
 
Web Mining Research Issues and Future Directions – A Survey
Web Mining Research Issues and Future Directions – A SurveyWeb Mining Research Issues and Future Directions – A Survey
Web Mining Research Issues and Future Directions – A Survey
 
DSD-INT 2019 Modelling in DANUBIUS-RI-Bellafiore
DSD-INT 2019 Modelling in DANUBIUS-RI-BellafioreDSD-INT 2019 Modelling in DANUBIUS-RI-Bellafiore
DSD-INT 2019 Modelling in DANUBIUS-RI-Bellafiore
 
DCW Data Quality 1992
DCW Data Quality 1992DCW Data Quality 1992
DCW Data Quality 1992
 
Ic3414861499
Ic3414861499Ic3414861499
Ic3414861499
 
project documentation
project documentationproject documentation
project documentation
 
Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...
Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...
Digital Pathology Information Web Services (DPIWS): Convergence in Digital Pa...
 
Digital Library
Digital LibraryDigital Library
Digital Library
 
Hyper3dpaper s
Hyper3dpaper sHyper3dpaper s
Hyper3dpaper s
 
Simplifying Database Normalization within a Visual Interactive Simulation Model
Simplifying Database Normalization within a Visual Interactive Simulation ModelSimplifying Database Normalization within a Visual Interactive Simulation Model
Simplifying Database Normalization within a Visual Interactive Simulation Model
 
DNA Query Language DNAQL: A Novel Approach
DNA Query Language DNAQL: A Novel ApproachDNA Query Language DNAQL: A Novel Approach
DNA Query Language DNAQL: A Novel Approach
 
Enhancement of Degraded Document Images using Retinex and Morphological Opera...
Enhancement of Degraded Document Images using Retinex and Morphological Opera...Enhancement of Degraded Document Images using Retinex and Morphological Opera...
Enhancement of Degraded Document Images using Retinex and Morphological Opera...
 
Diagnostic hypothesis refinement in reproducible workflows for advanced medic...
Diagnostic hypothesis refinement in reproducible workflows for advanced medic...Diagnostic hypothesis refinement in reproducible workflows for advanced medic...
Diagnostic hypothesis refinement in reproducible workflows for advanced medic...
 
Design for Additively Manufactured Structure An Assessment
Design for Additively Manufactured Structure An AssessmentDesign for Additively Manufactured Structure An Assessment
Design for Additively Manufactured Structure An Assessment
 

Mehr von john236zaq

Mehr von john236zaq (10)

Images as Occlusions of Textures: A Framework for Segmentation
Images as Occlusions of Textures: A Framework for SegmentationImages as Occlusions of Textures: A Framework for Segmentation
Images as Occlusions of Textures: A Framework for Segmentation
 
Image Restoration Using Joint Statistical Modeling in a Space-Transform Domain
Image Restoration Using Joint Statistical Modeling in a Space-Transform DomainImage Restoration Using Joint Statistical Modeling in a Space-Transform Domain
Image Restoration Using Joint Statistical Modeling in a Space-Transform Domain
 
Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...
Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...
Low-Complexity DFT-Based Channel Estimation with Leakage Nulling for OFDM Sys...
 
Low-Rank Neighbor Embedding for Single Image Super-Resolution
Low-Rank Neighbor Embedding for Single Image Super-ResolutionLow-Rank Neighbor Embedding for Single Image Super-Resolution
Low-Rank Neighbor Embedding for Single Image Super-Resolution
 
Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation
Mining Weakly Labeled Web Facial Images for Search-Based Face AnnotationMining Weakly Labeled Web Facial Images for Search-Based Face Annotation
Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation
 
Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...
Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...
Modeling and Estimation of Transient Carrier Frequency Offset in Wireless Tra...
 
OFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic Prefix
OFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic PrefixOFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic Prefix
OFDM Synthetic Aperture Radar Imaging With Sufficient Cyclic Prefix
 
Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...
Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...
Progressive Image Denoising Through Hybrid Graph Laplacian Regularization: A ...
 
Reversible De-Identification for Lossless Image Compression using Reversible ...
Reversible De-Identification for Lossless Image Compression using Reversible ...Reversible De-Identification for Lossless Image Compression using Reversible ...
Reversible De-Identification for Lossless Image Compression using Reversible ...
 
Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...
Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...
Stochastic Analysis of the LMS and NLMS Algorithms for Cyclostationary White ...
 

Kürzlich hochgeladen

Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
dharasingh5698
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 

Kürzlich hochgeladen (20)

Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 

Phase-Based Binarization of Ancient Document Images: Model and Applications

  • 1. www.projectsatbangalore.com 09591912372 2916 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 23, NO. 7, JULY 2014 Phase-Based Binarization of Ancient Document Images: Model and Applications Hossein Ziaei Nafchi, Reza Farrahi Moghaddam, Member, IEEE, and Mohamed Cheriet, Senior Member, IEEE Abstract—In this paper, a phase-based binarization model for ancient document images is proposed, as well as a postprocessing method that can improve any binarization method and a ground truth generation tool. Three feature maps derived from the phase information of an input document image constitute the core of this binarization model. These features are the maximum moment of phase congruency covariance, a locally weighted mean phase angle, and a phase preserved denoised image. The proposed model consists of three standard steps: 1) preprocessing; 2) main binarization; and 3) postprocessing. In the preprocessing and main binarization steps, the features used are mainly phase derived, while in the postprocessing step, specialized adaptive Gaussian and median filters are considered. One of the outputs of the binarization step, which shows high recall performance, is used in a proposed postprocessing method to improve the perfor- mance of other binarization methodologies. Finally, we develop a ground truth generation tool, called PhaseGT, to simplify and speed up the ground truth generation process for ancient document images. The comprehensive experimental results on the DIBCO’09, H-DIBCO’10, DIBCO’11, H-DIBCO’12, DIBCO’13, PHIBD’12, and BICKLEY DIARY data sets show the robust- ness of the proposed binarization method on various types of degradation and document images. Index Terms—Historical document binarization, phase-derived features, ground truthing, document enhancement. I. INTRODUCTION LIBRARIES and archives around the world store an abundance of old and historically important documents and manuscripts. These documents accumulate a significant amount of human heritage over time. However, many envi- ronmental factors, improper handling, and the poor quality of the materials used in their creation cause them to suffer a high degree of degradation of various types. Today, there is a strong move toward digitization of these manuscripts to preserve their content for future generations. The huge amount of digital data produced requires automatic processing, enhancement, and recognition. A key step in all document image processing workflows is binarization, but this is not a very sophisti- cated process, which is unfortunate, as its performance has Manuscript received August 8, 2013; revised January 29, 2014 and April 22, 2014; accepted April 26, 2014. Date of publication May 7, 2014; date of current version May 27, 2014. This work was supported by the Natural Sciences and Engineering Research Council of Canada. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Debargha Mukherjee. The authors are with the Synchromedia Laboratory for Multime- dia Communication in Telepresence, École de Technologie Supérieure, Montreal, QC H3C 1K3, Canada (e-mail: hossein.zi@synchromedia.ca; rfarrahi@synchromedia.ca; mohamed.cheriet@etsmtl.ca). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TIP.2014.2322451 Fig. 1. Sample document images selected from the DIBCO’09 [21], H-DIBCO’10 [22], and DIBCO’11 datasets [23]. a significant influence on the quality of OCR results. Many research studies have been carried out to solve the problems that arise in the binarization of old document images char- acterized by many types of degradation [1]–[19], including faded ink, bleed-through, show-through, uneven illumination, variations in image contrast, and deterioration of the cellulose structure [1], [20]. There are also differences in patterns of hand-written and machine-printed documents, which add to the difficulties associated with the binarization of old document images. To the best of our knowledge, none of the proposed methods can deal with all types of documents and degradation. For more details, see the Related Work section. Fig. 1 shows some of the degraded document images used in this paper. In this paper, a robust phase-based binarization method is proposed for the binarization and enhancement of historical documents and manuscripts. The three main steps in the proposed method are: preprocessing, main binarization, and post-processing. The preprocessing step mainly involves image denoising with phase preservation [24], followed by some morphological operations. We incorporate the Canny edge detector [25] and a denoised image to obtain a binarized image in rough form. Then, we use the phase congruency features [18], [19], [26] for the main binarization step. Phase congruency is widely used in the machine vision and image processing 1057-7149 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.