SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Demystifying Advanced
Technologies to Find Solutions that
Work
Friday, Oct. 11 | 9:45 – 10:45
Presented by
Peter Oesterling
Assistant General Counsel |
Nationwide
Alex Ponce de Leon
Discovery Counsel | Intel
J. William Speros
Evidence Consulting Attorney |
Speros & Associates
“Technology-Assisted Review,” called by its nickname
“Predictive Coding,” describes a process whereby
computers are programmed to search a large amount of
data to find quickly and efficiently the data that meet a
particular requirement. Computer science and the
sciences of statistics and psychology inform its use. While
it bruises the human ego, scientists…determined that
…[i]t is now indubitable that technology-assisted review
is an appreciably better and more accurate means of
searching a set of data.”
THE GROSSMAN-CORMACK GLOSSARY OF
TECHNOLOGY-ASSISTED REVIEW
FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013)
Foreword by John M. Facciola, U.S. Magistrate Judge
“Technology-Assisted Review,” called by its nickname
“Predictive Coding,” describes a process whereby
computers are programmed to search a large amount of
data to find quickly and efficiently the data that meet a
particular requirement. Computer science and the
sciences of statistics and psychology inform its use. While
it bruises the human ego, scientists…determined that
…[i]t is now indubitable that technology-assisted review
is an appreciably better and more accurate means of
searching a set of data.”
THE GROSSMAN-CORMACK GLOSSARY OF
TECHNOLOGY-ASSISTED REVIEW
FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013)
Foreword by John M. Facciola, U.S. Magistrate Judge
Process: a series of
actions that produce
something or that lead
to a particular result
“Now, the methodology of the use of technology-
assisted review may itself be in dispute, with the
parties controverted to each other’s use of a
particular method or tool. Those controversies have
already lead to judicial decisions that have to
grapple with a wholly new way of searching and with
scientific principles derived from the science of
statistics or other disciplines.”
THE GROSSMAN-CORMACK GLOSSARY OF
TECHNOLOGY-ASSISTED REVIEW
FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013)
Foreword by John M. Facciola, U.S. Magistrate Judge
“Now, the methodology of the use of technology-
assisted review may itself be in dispute, with the
parties controverted to each other’s use of a
particular method or tool. Those controversies have
already lead to judicial decisions that have to
grapple with a wholly new way of searching and with
scientific principles derived from the science of
statistics or other disciplines.”
THE GROSSMAN-CORMACK GLOSSARY OF
TECHNOLOGY-ASSISTED REVIEW
FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013)
Foreword by John M. Facciola, U.S. Magistrate Judge
Methodology: a set of methods,
rules, or ideas that are important in a
science or art : a particular procedure
or set of procedures
THE GROSSMAN-CORMACK GLOSSARY OF
TECHNOLOGY-ASSISTED REVIEW
FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013)
Predictive Coding:
An industry-specific term generally used to describe a
Technology-Assisted Review process involving the
use of a Machine Learning Algorithm to distinguish
Relevant from Non-Relevant Documents, based on
Subject Matter Expert(s)’ Coding of a Training Set of
Documents.
THE GROSSMAN-CORMACK GLOSSARY OF
TECHNOLOGY-ASSISTED REVIEW
FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013)
Predictive Coding:
An industry-specific term generally used to describe a
Technology-Assisted Review process involving the
use of a Machine Learning Algorithm to distinguish
Relevant from Non-Relevant Documents, based on
Subject Matter Expert(s)’ Coding of a Training Set of
Documents.
“A word is not a crystal, transparent and unchanged, it
is the skin of a living thought and may vary greatly in
color and content according to the circumstances
and the time in which it is used.”
Justice Oliver Wendell Holmes Jr.,
Towne v. Eisner, 245 U.S. 418, 425 (1918)
THE GROSSMAN-CORMACK GLOSSARY OF
TECHNOLOGY-ASSISTED REVIEW
FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013)
Foreword by John M. Facciola, U.S. Magistrate Judge
“I think you should be more
explicit here in step two.”
Published as guest contributor to Ralph
Losey’s E-Discovery Team Blog Site:
http://e-discoveryteam.com/2013/04/28/predictive-codings-erroneous-zones-are-
emerging-junk-science/?shareadraft=517d80048f827
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
• “PBS’ Frontline’s Forensic Tools: What’s Reliable and
What’s Not-So-Scientific dispelled the infallibility, and
in some instances, the validity, of analytical
techniques long relied upon by our legal profession.”
• “Even if those techniques were not botched or
biased, their validity ranges from bought-and-paid-
for infomercials to, at best, an approximation.”
• “Back then attorneys and judges (and experts and
vendors) did with those junk sciences just what we
are doing now with respect to predictive coding:
allowing claims, however unjustified and
erroneous, to form the basis of our practices, to
influence our precedent and to accrue authority.”
“[T]hose of us who trust the scientific and
adversarial process recognize that erroneous
claims don’t naturally defeat truth. They
suppress truth, distract from truth and
sometimes persist so long that we forget to
inquire into the truth. Oftentimes, weak
interests seek to dispel erroneous claims
which are promoted by strong commercial
interests. With respect to predictive coding
my sense is that we are neither deluded nor
deceptive — well, not too much anyway —
but we just have not yet thought it through.”
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
“[T]hose of us who trust the scientific and
adversarial process recognize that erroneous
claims don’t naturally defeat truth. They
suppress truth, distract from truth and
sometimes persist so long that we forget to
inquire into the truth. Oftentimes, weak
interests seek to dispel erroneous claims
which are promoted by strong commercial
interests. With respect to predictive coding
my sense is that we are neither deluded nor
deceptive — well, not too much anyway —
but we just have not yet thought it through.”
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
Erroneous Practice
#1
Using a full-text search to identify
prospectively responsive documents and then
employing predictive coding to eliminate those
that are not responsive.
Erroneous Practice
#2
Pulling a random sample of documents to train
the initial seed set.
Erroneous Practice
#3
Identifying “magic numbers” of minimum:
• “Iterations”
• Responsive documents within a
randomly accumulated set
Erroneous Practice
#4
Asserting that Predictive Coding software is
the “gold standard” for document retrieval in
complex matters.
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
Erroneous Practice
#4
Asserting that Predictive Coding software is
the “gold standard” for document retrieval in
complex matters.
Is Erroneous
Because
It asserts that predictive coding is a standard:
• Share some commonly understood
characteristics but no precise attributes
• Involves some general methodologies but no
clear rules
• Are associated with general aspirations but
no comprehensively defined operations.
Example All advertisements or orders for “predictive
coding”
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
Erroneous Practice
#4
Asserting that Predictive Coding software is
the “gold standard” for document retrieval in
complex matters.
Is Erroneous
Because
It asserts that predictive coding is a standard:
• Share some commonly understood
characteristics but no precise attributes
• Involves some general methodologies but no
clear rules
• Are associated with general aspirations but
no comprehensively defined operations.
Example All advertisements or orders for “predictive
coding”
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
Gold
Standard vs “Standard”
Erroneous Practice
#2
Pulling a random sample of documents to train
the initial seed set.
Is Erroneous
Because
A. Looks for relevance in all the wrong places:
Thoughtful researchers don’t try learn about
relevant docs by examining irrelevant ones.
B. It turns a blind eye to what is staring you in
the eye: denies that attorneys know what
they are paid to know: where to look and
what to find.
C. Measures the wrong stuff:
• Constrained and circular “like” definition
• Prevalence vs Relevance vs Probativeness
Example Global Aerospace v. Landow Aviation (settled
without court ruling re strategy)
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
Erroneous Practice
#2
Pulling a random sample of documents to train
the initial seed set.
Is Erroneous
Because
A. Looks for relevance in all the wrong places:
Thoughtful researchers don’t try learn about
relevant docs by examining irrelevant ones.
B. It turns a blind eye to what is staring you in
the eye: denies that attorneys know what
they are paid to know: where to look and
what to find.
C. Measures the wrong stuff:
• Constrained and circular “like” definition
• Prevalence vs Relevance vs Probativeness
Example Global Aerospace v. Landow Aviation (settled
without court ruling re strategy)
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
Erroneous Practice
#1
Using a full-text search to identify
prospectively responsive documents and then
employing predictive coding to eliminate those
that are not responsive.
Is Erroneous
Because
A.Over-relies and under-delivers: presumed
arrogance or clairvoyance
B.It arbitrarily places documents out-of-sight
and, therefore, out-of-mind: likelihood that
responsive documents will ever be produced
but dumbing-down the predictive coding
intelligence
Example In re: Biomet M2a Magnum Hip Implant Prods.
Liab. Litig. (endorsed by court)
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
Erroneous Practice
#1
Using a full-text search to identify
prospectively responsive documents and then
employing predictive coding to eliminate those
that are not responsive.
Is Erroneous
Because
A.Over-relies and under-delivers: presumed
arrogance or clairvoyance
B.It arbitrarily places documents out-of-sight
and, therefore, out-of-mind: likelihood that
responsive documents will ever be produced
but dumbing-down the predictive coding
intelligence
Example In re: Biomet M2a Magnum Hip Implant Prods.
Liab. Litig. (endorsed by court)
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
Single Pass Full-Text Filtering…
Erroneous Practice #3 Identifying “magic numbers” of minimum:
• “Iterations”
• Responsive documents within a randomly
accumulated set
Is Erroneous Because A.You may not be able to get there from here:
Don’t know starting point or ending point
B.You don’t know what isn’t yet known: Cannot
predict alternative paths
C. Consider low frequency, high probativeness
D.Who’s the witness?
Example • “This [iteration] process shall be repeated for a total
of seven iterations… [Requesting party pays] costs and
fees… [for] more 40,000 documents.” (DaSilva Moore)
• Vendors’ affidavits in various matters
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
Erroneous Practice #3 Identifying “magic numbers” of minimum:
• “Iterations”
• Responsive documents within a randomly
accumulated set
Is Erroneous Because A.You may not be able to get there from here:
Don’t know starting point or ending point
B.You don’t know what isn’t yet known: Cannot
predict alternative paths
C. Consider low frequency, high probativeness
D.Who’s the witness?
Example • “This [iteration] process shall be repeated for a total
of seven iterations… [Requesting party pays] costs and
fees… [for] more 40,000 documents.” (DaSilva Moore)
• Vendors’ affidavits in various matters
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
May not be able to get there even with a
“Magic” number of steps…
Erroneous Practice
#1
Using a full-text search to identify
prospectively responsive documents and then
employing predictive coding to eliminate those
that are not responsive.
Erroneous Practice
#2
Pulling a random sample of documents to train
the initial seed set.
Erroneous Practice
#3
Identifying “magic numbers” of minimum:
• “Iterations”
• Responsive documents within a
randomly accumulated set
Erroneous Practice
#4
Asserting that Predictive Coding software is
the “gold standard” for document retrieval in
complex matters.
“Predictive Coding’s Erroneous Zones
Are Emerging Junk Science”
Search Mechanisms’ InferencesInferences(risk)rerecall
Search Mechanism
Databases
Files, Folders
(in place)
End-user
tags
Files, Folders
(per user)
Duplicates
“Technology
Assisted Review”
via Machine
Learning
Key words
Random
Sampling
Similarity/
Clusters
Sorting
Similarity
Clustering
Your Notes
_______________________________________
_______________________________________
_______________________________________
_______________________________________
_______________________________________
_______________________________________
_______________________________________
Your Notes
_______________________________________
_______________________________________
_______________________________________
_______________________________________
_______________________________________
_______________________________________
_______________________________________
Your Notes
_______________________________________
_______________________________________
_______________________________________
_______________________________________
_______________________________________
_______________________________________
_______________________________________

Weitere ähnliche Inhalte

Andere mochten auch

8_Gradnja in energija_Tadej Macura
8_Gradnja in energija_Tadej Macura8_Gradnja in energija_Tadej Macura
8_Gradnja in energija_Tadej Macura
Tadej Macura
 
PROBABILIDADES, COMBINACIONES Y PERMUTACIONES
PROBABILIDADES, COMBINACIONES Y PERMUTACIONESPROBABILIDADES, COMBINACIONES Y PERMUTACIONES
PROBABILIDADES, COMBINACIONES Y PERMUTACIONES
Santia06
 
ABC debate sustitución VTA
ABC debate sustitución VTAABC debate sustitución VTA
ABC debate sustitución VTA
Michel Maya
 
Hilton los movimientos campesinos en inglaterra antes de 1381
Hilton   los movimientos campesinos en inglaterra antes de 1381Hilton   los movimientos campesinos en inglaterra antes de 1381
Hilton los movimientos campesinos en inglaterra antes de 1381
pinoemilianomeincke
 
Madison Conroy- The University of Dayton- Strategic Analysis Project-3
Madison Conroy- The University of Dayton- Strategic Analysis Project-3Madison Conroy- The University of Dayton- Strategic Analysis Project-3
Madison Conroy- The University of Dayton- Strategic Analysis Project-3
Mitch Coughlin
 
Italcementi monitoraggio febbraio 2012 febbraio 2013 protocollo n 53156 del 0...
Italcementi monitoraggio febbraio 2012 febbraio 2013 protocollo n 53156 del 0...Italcementi monitoraggio febbraio 2012 febbraio 2013 protocollo n 53156 del 0...
Italcementi monitoraggio febbraio 2012 febbraio 2013 protocollo n 53156 del 0...
Pino Ciampolillo
 
que es informatica juridica y sus diferentes enfoques
que es informatica juridica y sus diferentes enfoquesque es informatica juridica y sus diferentes enfoques
que es informatica juridica y sus diferentes enfoques
gavilucho
 
A f google doctors 20130831
A f google doctors 20130831A f google doctors 20130831
A f google doctors 20130831
Janet Hall
 

Andere mochten auch (15)

Corporations MUST hire a law firm!! Florida wins Justice loses
Corporations MUST hire a law firm!! Florida wins Justice losesCorporations MUST hire a law firm!! Florida wins Justice loses
Corporations MUST hire a law firm!! Florida wins Justice loses
 
Reporte Punto Jalisco Abierto Marzo 2015
Reporte Punto Jalisco Abierto Marzo 2015Reporte Punto Jalisco Abierto Marzo 2015
Reporte Punto Jalisco Abierto Marzo 2015
 
Certificate(29)
Certificate(29)Certificate(29)
Certificate(29)
 
8_Gradnja in energija_Tadej Macura
8_Gradnja in energija_Tadej Macura8_Gradnja in energija_Tadej Macura
8_Gradnja in energija_Tadej Macura
 
Igniting Information Governance by Uniting Departments, Disciplines and Data ...
Igniting Information Governance by Uniting Departments, Disciplines and Data ...Igniting Information Governance by Uniting Departments, Disciplines and Data ...
Igniting Information Governance by Uniting Departments, Disciplines and Data ...
 
El Quijote
El Quijote El Quijote
El Quijote
 
PROBABILIDADES, COMBINACIONES Y PERMUTACIONES
PROBABILIDADES, COMBINACIONES Y PERMUTACIONESPROBABILIDADES, COMBINACIONES Y PERMUTACIONES
PROBABILIDADES, COMBINACIONES Y PERMUTACIONES
 
ABC debate sustitución VTA
ABC debate sustitución VTAABC debate sustitución VTA
ABC debate sustitución VTA
 
Hilton los movimientos campesinos en inglaterra antes de 1381
Hilton   los movimientos campesinos en inglaterra antes de 1381Hilton   los movimientos campesinos en inglaterra antes de 1381
Hilton los movimientos campesinos en inglaterra antes de 1381
 
Madison Conroy- The University of Dayton- Strategic Analysis Project-3
Madison Conroy- The University of Dayton- Strategic Analysis Project-3Madison Conroy- The University of Dayton- Strategic Analysis Project-3
Madison Conroy- The University of Dayton- Strategic Analysis Project-3
 
The erp speaker series
The erp speaker seriesThe erp speaker series
The erp speaker series
 
Italcementi monitoraggio febbraio 2012 febbraio 2013 protocollo n 53156 del 0...
Italcementi monitoraggio febbraio 2012 febbraio 2013 protocollo n 53156 del 0...Italcementi monitoraggio febbraio 2012 febbraio 2013 protocollo n 53156 del 0...
Italcementi monitoraggio febbraio 2012 febbraio 2013 protocollo n 53156 del 0...
 
Direitos autorais
Direitos autoraisDireitos autorais
Direitos autorais
 
que es informatica juridica y sus diferentes enfoques
que es informatica juridica y sus diferentes enfoquesque es informatica juridica y sus diferentes enfoques
que es informatica juridica y sus diferentes enfoques
 
A f google doctors 20130831
A f google doctors 20130831A f google doctors 20130831
A f google doctors 20130831
 

Ähnlich wie Demystifying Advanced Technologies to Find Solutions that Work

Four article assignment
Four article assignmentFour article assignment
Four article assignment
Maria Donohue
 
The Changing Role Of The Experts
The Changing Role Of The ExpertsThe Changing Role Of The Experts
The Changing Role Of The Experts
alisonegypt
 
The changing role of the experts
The changing role of the expertsThe changing role of the experts
The changing role of the experts
Alison Stevens
 
Critical Analysis of Admissibility of Forensic Evidence in the Indian Crimina...
Critical Analysis of Admissibility of Forensic Evidence in the Indian Crimina...Critical Analysis of Admissibility of Forensic Evidence in the Indian Crimina...
Critical Analysis of Admissibility of Forensic Evidence in the Indian Crimina...
ijtsrd
 
Forensic science intro
Forensic science introForensic science intro
Forensic science intro
Maria Donohue
 
Forensic Expert Cross Examination
Forensic Expert Cross ExaminationForensic Expert Cross Examination
Forensic Expert Cross Examination
ivneetsingh
 
Law day presentation
Law day presentationLaw day presentation
Law day presentation
David Ricker
 
Inscribed on the Temple of Apollo at Delphi, and echoed down
Inscribed on the Temple of Apollo at Delphi, and echoed downInscribed on the Temple of Apollo at Delphi, and echoed down
Inscribed on the Temple of Apollo at Delphi, and echoed down
LaticiaGrissomzz
 
Inscribed on the Temple of Apollo at Delphi, and echoed down
Inscribed on the Temple of Apollo at Delphi, and echoed downInscribed on the Temple of Apollo at Delphi, and echoed down
Inscribed on the Temple of Apollo at Delphi, and echoed down
TatianaMajor22
 
Process of legal_research_complet_show2
Process of legal_research_complet_show2Process of legal_research_complet_show2
Process of legal_research_complet_show2
jzahrndt
 
Forensic Science Intro
Forensic Science IntroForensic Science Intro
Forensic Science Intro
Maria Donohue
 

Ähnlich wie Demystifying Advanced Technologies to Find Solutions that Work (20)

Junk Science On Trial
Junk Science On TrialJunk Science On Trial
Junk Science On Trial
 
Four article assignment
Four article assignmentFour article assignment
Four article assignment
 
DOJ
DOJDOJ
DOJ
 
The Changing Role Of The Experts
The Changing Role Of The ExpertsThe Changing Role Of The Experts
The Changing Role Of The Experts
 
The changing role of the experts
The changing role of the expertsThe changing role of the experts
The changing role of the experts
 
Critical Analysis of Admissibility of Forensic Evidence in the Indian Crimina...
Critical Analysis of Admissibility of Forensic Evidence in the Indian Crimina...Critical Analysis of Admissibility of Forensic Evidence in the Indian Crimina...
Critical Analysis of Admissibility of Forensic Evidence in the Indian Crimina...
 
Forensic science intro
Forensic science introForensic science intro
Forensic science intro
 
Sarcasm detection literature survey
Sarcasm detection literature surveySarcasm detection literature survey
Sarcasm detection literature survey
 
Forensic Expert Cross Examination
Forensic Expert Cross ExaminationForensic Expert Cross Examination
Forensic Expert Cross Examination
 
Litigating Flawed Forensic Science at Every Stage
Litigating Flawed Forensic Science at Every StageLitigating Flawed Forensic Science at Every Stage
Litigating Flawed Forensic Science at Every Stage
 
Law day presentation
Law day presentationLaw day presentation
Law day presentation
 
Inscribed on the Temple of Apollo at Delphi, and echoed down
Inscribed on the Temple of Apollo at Delphi, and echoed downInscribed on the Temple of Apollo at Delphi, and echoed down
Inscribed on the Temple of Apollo at Delphi, and echoed down
 
Inscribed on the Temple of Apollo at Delphi, and echoed down
Inscribed on the Temple of Apollo at Delphi, and echoed downInscribed on the Temple of Apollo at Delphi, and echoed down
Inscribed on the Temple of Apollo at Delphi, and echoed down
 
Introduction to forensic science
Introduction to forensic scienceIntroduction to forensic science
Introduction to forensic science
 
Research Skills for International Law Moot Court Competition 'Phillip C. Jess...
Research Skills for International Law Moot Court Competition 'Phillip C. Jess...Research Skills for International Law Moot Court Competition 'Phillip C. Jess...
Research Skills for International Law Moot Court Competition 'Phillip C. Jess...
 
Process of legal_research_complet_show2
Process of legal_research_complet_show2Process of legal_research_complet_show2
Process of legal_research_complet_show2
 
Westlaw Research Strategies
Westlaw Research StrategiesWestlaw Research Strategies
Westlaw Research Strategies
 
Forensic Science Intro
Forensic Science IntroForensic Science Intro
Forensic Science Intro
 
Unit1 lab
Unit1 labUnit1 lab
Unit1 lab
 
Waarom LegalTech de toekomst heeft
Waarom LegalTech de toekomst heeftWaarom LegalTech de toekomst heeft
Waarom LegalTech de toekomst heeft
 

Mehr von Emily Mermell

Mesas Redondas I - Herramientas Analiticas
Mesas Redondas I - Herramientas AnaliticasMesas Redondas I - Herramientas Analiticas
Mesas Redondas I - Herramientas Analiticas
Emily Mermell
 
Mesas Redondas H - Beneficiario Final
Mesas Redondas H - Beneficiario FinalMesas Redondas H - Beneficiario Final
Mesas Redondas H - Beneficiario Final
Emily Mermell
 
Mesas Redondas G - Nuevas Tendencias Lat Am Sucre
Mesas Redondas G - Nuevas Tendencias Lat Am SucreMesas Redondas G - Nuevas Tendencias Lat Am Sucre
Mesas Redondas G - Nuevas Tendencias Lat Am Sucre
Emily Mermell
 
Mesas Redondas F - Fraude
Mesas Redondas F - FraudeMesas Redondas F - Fraude
Mesas Redondas F - Fraude
Emily Mermell
 
Mesas Redondas E - Auditoria Forense
Mesas Redondas E - Auditoria ForenseMesas Redondas E - Auditoria Forense
Mesas Redondas E - Auditoria Forense
Emily Mermell
 
Mesas Redondas D - Gafi Evasion-Fiscal
Mesas Redondas D - Gafi Evasion-FiscalMesas Redondas D - Gafi Evasion-Fiscal
Mesas Redondas D - Gafi Evasion-Fiscal
Emily Mermell
 
Mesas Redondas C - Fideicomisos
Mesas Redondas C - FideicomisosMesas Redondas C - Fideicomisos
Mesas Redondas C - Fideicomisos
Emily Mermell
 
Mesas Redondas B - Clientes Alto-Riesgo
Mesas Redondas B - Clientes Alto-RiesgoMesas Redondas B - Clientes Alto-Riesgo
Mesas Redondas B - Clientes Alto-Riesgo
Emily Mermell
 
Mesas Redondas A - Delitos Ciberneticos
Mesas Redondas A - Delitos CiberneticosMesas Redondas A - Delitos Ciberneticos
Mesas Redondas A - Delitos Ciberneticos
Emily Mermell
 
Panel General 5 - Corrupcion
Panel General 5 - CorrupcionPanel General 5 - Corrupcion
Panel General 5 - Corrupcion
Emily Mermell
 
Panel General 4 - Fuentes Abiertas
Panel General 4 - Fuentes AbiertasPanel General 4 - Fuentes Abiertas
Panel General 4 - Fuentes Abiertas
Emily Mermell
 
Panel General 3 - FATCA
Panel General 3 - FATCAPanel General 3 - FATCA
Panel General 3 - FATCA
Emily Mermell
 
Panel General 2 - Convergencia
Panel General 2 - ConvergenciaPanel General 2 - Convergencia
Panel General 2 - Convergencia
Emily Mermell
 
Panel General 1 - FATCA
Panel General 1 - FATCAPanel General 1 - FATCA
Panel General 1 - FATCA
Emily Mermell
 

Mehr von Emily Mermell (17)

Mesas Redondas I - Herramientas Analiticas
Mesas Redondas I - Herramientas AnaliticasMesas Redondas I - Herramientas Analiticas
Mesas Redondas I - Herramientas Analiticas
 
Mesas Redondas H - Beneficiario Final
Mesas Redondas H - Beneficiario FinalMesas Redondas H - Beneficiario Final
Mesas Redondas H - Beneficiario Final
 
Mesas Redondas G - Nuevas Tendencias Lat Am Sucre
Mesas Redondas G - Nuevas Tendencias Lat Am SucreMesas Redondas G - Nuevas Tendencias Lat Am Sucre
Mesas Redondas G - Nuevas Tendencias Lat Am Sucre
 
Mesas Redondas F - Fraude
Mesas Redondas F - FraudeMesas Redondas F - Fraude
Mesas Redondas F - Fraude
 
Mesas Redondas E - Auditoria Forense
Mesas Redondas E - Auditoria ForenseMesas Redondas E - Auditoria Forense
Mesas Redondas E - Auditoria Forense
 
Mesas Redondas D - Gafi Evasion-Fiscal
Mesas Redondas D - Gafi Evasion-FiscalMesas Redondas D - Gafi Evasion-Fiscal
Mesas Redondas D - Gafi Evasion-Fiscal
 
Mesas Redondas C - Fideicomisos
Mesas Redondas C - FideicomisosMesas Redondas C - Fideicomisos
Mesas Redondas C - Fideicomisos
 
Mesas Redondas B - Clientes Alto-Riesgo
Mesas Redondas B - Clientes Alto-RiesgoMesas Redondas B - Clientes Alto-Riesgo
Mesas Redondas B - Clientes Alto-Riesgo
 
Mesas Redondas A - Delitos Ciberneticos
Mesas Redondas A - Delitos CiberneticosMesas Redondas A - Delitos Ciberneticos
Mesas Redondas A - Delitos Ciberneticos
 
Panel General 5 - Corrupcion
Panel General 5 - CorrupcionPanel General 5 - Corrupcion
Panel General 5 - Corrupcion
 
Panel General 4 - Fuentes Abiertas
Panel General 4 - Fuentes AbiertasPanel General 4 - Fuentes Abiertas
Panel General 4 - Fuentes Abiertas
 
Panel General 3 - FATCA
Panel General 3 - FATCAPanel General 3 - FATCA
Panel General 3 - FATCA
 
Panel General 2 - Convergencia
Panel General 2 - ConvergenciaPanel General 2 - Convergencia
Panel General 2 - Convergencia
 
Panel General 1 - FATCA
Panel General 1 - FATCAPanel General 1 - FATCA
Panel General 1 - FATCA
 
Mock Hearing: The Great E-Discovery Cost-Shifting Debate
Mock Hearing: The Great E-Discovery Cost-Shifting DebateMock Hearing: The Great E-Discovery Cost-Shifting Debate
Mock Hearing: The Great E-Discovery Cost-Shifting Debate
 
Helping Hoarders: Practical, Cost-Effective Strategies for Defensibly Deletin...
Helping Hoarders: Practical, Cost-Effective Strategies for Defensibly Deletin...Helping Hoarders: Practical, Cost-Effective Strategies for Defensibly Deletin...
Helping Hoarders: Practical, Cost-Effective Strategies for Defensibly Deletin...
 
Getting Past Guesswork: Achieving Cost Predictability through Process and Rep...
Getting Past Guesswork: Achieving Cost Predictability through Process and Rep...Getting Past Guesswork: Achieving Cost Predictability through Process and Rep...
Getting Past Guesswork: Achieving Cost Predictability through Process and Rep...
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Kürzlich hochgeladen (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 

Demystifying Advanced Technologies to Find Solutions that Work

  • 1. Demystifying Advanced Technologies to Find Solutions that Work Friday, Oct. 11 | 9:45 – 10:45 Presented by
  • 2. Peter Oesterling Assistant General Counsel | Nationwide
  • 3. Alex Ponce de Leon Discovery Counsel | Intel
  • 4. J. William Speros Evidence Consulting Attorney | Speros & Associates
  • 5. “Technology-Assisted Review,” called by its nickname “Predictive Coding,” describes a process whereby computers are programmed to search a large amount of data to find quickly and efficiently the data that meet a particular requirement. Computer science and the sciences of statistics and psychology inform its use. While it bruises the human ego, scientists…determined that …[i]t is now indubitable that technology-assisted review is an appreciably better and more accurate means of searching a set of data.” THE GROSSMAN-CORMACK GLOSSARY OF TECHNOLOGY-ASSISTED REVIEW FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013) Foreword by John M. Facciola, U.S. Magistrate Judge
  • 6. “Technology-Assisted Review,” called by its nickname “Predictive Coding,” describes a process whereby computers are programmed to search a large amount of data to find quickly and efficiently the data that meet a particular requirement. Computer science and the sciences of statistics and psychology inform its use. While it bruises the human ego, scientists…determined that …[i]t is now indubitable that technology-assisted review is an appreciably better and more accurate means of searching a set of data.” THE GROSSMAN-CORMACK GLOSSARY OF TECHNOLOGY-ASSISTED REVIEW FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013) Foreword by John M. Facciola, U.S. Magistrate Judge Process: a series of actions that produce something or that lead to a particular result
  • 7. “Now, the methodology of the use of technology- assisted review may itself be in dispute, with the parties controverted to each other’s use of a particular method or tool. Those controversies have already lead to judicial decisions that have to grapple with a wholly new way of searching and with scientific principles derived from the science of statistics or other disciplines.” THE GROSSMAN-CORMACK GLOSSARY OF TECHNOLOGY-ASSISTED REVIEW FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013) Foreword by John M. Facciola, U.S. Magistrate Judge
  • 8. “Now, the methodology of the use of technology- assisted review may itself be in dispute, with the parties controverted to each other’s use of a particular method or tool. Those controversies have already lead to judicial decisions that have to grapple with a wholly new way of searching and with scientific principles derived from the science of statistics or other disciplines.” THE GROSSMAN-CORMACK GLOSSARY OF TECHNOLOGY-ASSISTED REVIEW FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013) Foreword by John M. Facciola, U.S. Magistrate Judge Methodology: a set of methods, rules, or ideas that are important in a science or art : a particular procedure or set of procedures
  • 9. THE GROSSMAN-CORMACK GLOSSARY OF TECHNOLOGY-ASSISTED REVIEW FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013) Predictive Coding: An industry-specific term generally used to describe a Technology-Assisted Review process involving the use of a Machine Learning Algorithm to distinguish Relevant from Non-Relevant Documents, based on Subject Matter Expert(s)’ Coding of a Training Set of Documents.
  • 10. THE GROSSMAN-CORMACK GLOSSARY OF TECHNOLOGY-ASSISTED REVIEW FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013) Predictive Coding: An industry-specific term generally used to describe a Technology-Assisted Review process involving the use of a Machine Learning Algorithm to distinguish Relevant from Non-Relevant Documents, based on Subject Matter Expert(s)’ Coding of a Training Set of Documents.
  • 11. “A word is not a crystal, transparent and unchanged, it is the skin of a living thought and may vary greatly in color and content according to the circumstances and the time in which it is used.” Justice Oliver Wendell Holmes Jr., Towne v. Eisner, 245 U.S. 418, 425 (1918) THE GROSSMAN-CORMACK GLOSSARY OF TECHNOLOGY-ASSISTED REVIEW FEDERAL COURTS LAW REVIEW Volume 7, Issue 1 (2013) Foreword by John M. Facciola, U.S. Magistrate Judge
  • 12.
  • 13.
  • 14.
  • 15. “I think you should be more explicit here in step two.”
  • 16. Published as guest contributor to Ralph Losey’s E-Discovery Team Blog Site: http://e-discoveryteam.com/2013/04/28/predictive-codings-erroneous-zones-are- emerging-junk-science/?shareadraft=517d80048f827 “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 17. “Predictive Coding’s Erroneous Zones Are Emerging Junk Science” • “PBS’ Frontline’s Forensic Tools: What’s Reliable and What’s Not-So-Scientific dispelled the infallibility, and in some instances, the validity, of analytical techniques long relied upon by our legal profession.” • “Even if those techniques were not botched or biased, their validity ranges from bought-and-paid- for infomercials to, at best, an approximation.” • “Back then attorneys and judges (and experts and vendors) did with those junk sciences just what we are doing now with respect to predictive coding: allowing claims, however unjustified and erroneous, to form the basis of our practices, to influence our precedent and to accrue authority.”
  • 18. “[T]hose of us who trust the scientific and adversarial process recognize that erroneous claims don’t naturally defeat truth. They suppress truth, distract from truth and sometimes persist so long that we forget to inquire into the truth. Oftentimes, weak interests seek to dispel erroneous claims which are promoted by strong commercial interests. With respect to predictive coding my sense is that we are neither deluded nor deceptive — well, not too much anyway — but we just have not yet thought it through.” “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 19. “[T]hose of us who trust the scientific and adversarial process recognize that erroneous claims don’t naturally defeat truth. They suppress truth, distract from truth and sometimes persist so long that we forget to inquire into the truth. Oftentimes, weak interests seek to dispel erroneous claims which are promoted by strong commercial interests. With respect to predictive coding my sense is that we are neither deluded nor deceptive — well, not too much anyway — but we just have not yet thought it through.” “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 20. Erroneous Practice #1 Using a full-text search to identify prospectively responsive documents and then employing predictive coding to eliminate those that are not responsive. Erroneous Practice #2 Pulling a random sample of documents to train the initial seed set. Erroneous Practice #3 Identifying “magic numbers” of minimum: • “Iterations” • Responsive documents within a randomly accumulated set Erroneous Practice #4 Asserting that Predictive Coding software is the “gold standard” for document retrieval in complex matters. “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 21. Erroneous Practice #4 Asserting that Predictive Coding software is the “gold standard” for document retrieval in complex matters. Is Erroneous Because It asserts that predictive coding is a standard: • Share some commonly understood characteristics but no precise attributes • Involves some general methodologies but no clear rules • Are associated with general aspirations but no comprehensively defined operations. Example All advertisements or orders for “predictive coding” “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 22. Erroneous Practice #4 Asserting that Predictive Coding software is the “gold standard” for document retrieval in complex matters. Is Erroneous Because It asserts that predictive coding is a standard: • Share some commonly understood characteristics but no precise attributes • Involves some general methodologies but no clear rules • Are associated with general aspirations but no comprehensively defined operations. Example All advertisements or orders for “predictive coding” “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 24. Erroneous Practice #2 Pulling a random sample of documents to train the initial seed set. Is Erroneous Because A. Looks for relevance in all the wrong places: Thoughtful researchers don’t try learn about relevant docs by examining irrelevant ones. B. It turns a blind eye to what is staring you in the eye: denies that attorneys know what they are paid to know: where to look and what to find. C. Measures the wrong stuff: • Constrained and circular “like” definition • Prevalence vs Relevance vs Probativeness Example Global Aerospace v. Landow Aviation (settled without court ruling re strategy) “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 25. Erroneous Practice #2 Pulling a random sample of documents to train the initial seed set. Is Erroneous Because A. Looks for relevance in all the wrong places: Thoughtful researchers don’t try learn about relevant docs by examining irrelevant ones. B. It turns a blind eye to what is staring you in the eye: denies that attorneys know what they are paid to know: where to look and what to find. C. Measures the wrong stuff: • Constrained and circular “like” definition • Prevalence vs Relevance vs Probativeness Example Global Aerospace v. Landow Aviation (settled without court ruling re strategy) “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 26.
  • 27. Erroneous Practice #1 Using a full-text search to identify prospectively responsive documents and then employing predictive coding to eliminate those that are not responsive. Is Erroneous Because A.Over-relies and under-delivers: presumed arrogance or clairvoyance B.It arbitrarily places documents out-of-sight and, therefore, out-of-mind: likelihood that responsive documents will ever be produced but dumbing-down the predictive coding intelligence Example In re: Biomet M2a Magnum Hip Implant Prods. Liab. Litig. (endorsed by court) “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 28. Erroneous Practice #1 Using a full-text search to identify prospectively responsive documents and then employing predictive coding to eliminate those that are not responsive. Is Erroneous Because A.Over-relies and under-delivers: presumed arrogance or clairvoyance B.It arbitrarily places documents out-of-sight and, therefore, out-of-mind: likelihood that responsive documents will ever be produced but dumbing-down the predictive coding intelligence Example In re: Biomet M2a Magnum Hip Implant Prods. Liab. Litig. (endorsed by court) “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 29. Single Pass Full-Text Filtering…
  • 30. Erroneous Practice #3 Identifying “magic numbers” of minimum: • “Iterations” • Responsive documents within a randomly accumulated set Is Erroneous Because A.You may not be able to get there from here: Don’t know starting point or ending point B.You don’t know what isn’t yet known: Cannot predict alternative paths C. Consider low frequency, high probativeness D.Who’s the witness? Example • “This [iteration] process shall be repeated for a total of seven iterations… [Requesting party pays] costs and fees… [for] more 40,000 documents.” (DaSilva Moore) • Vendors’ affidavits in various matters “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 31. Erroneous Practice #3 Identifying “magic numbers” of minimum: • “Iterations” • Responsive documents within a randomly accumulated set Is Erroneous Because A.You may not be able to get there from here: Don’t know starting point or ending point B.You don’t know what isn’t yet known: Cannot predict alternative paths C. Consider low frequency, high probativeness D.Who’s the witness? Example • “This [iteration] process shall be repeated for a total of seven iterations… [Requesting party pays] costs and fees… [for] more 40,000 documents.” (DaSilva Moore) • Vendors’ affidavits in various matters “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 32. May not be able to get there even with a “Magic” number of steps…
  • 33. Erroneous Practice #1 Using a full-text search to identify prospectively responsive documents and then employing predictive coding to eliminate those that are not responsive. Erroneous Practice #2 Pulling a random sample of documents to train the initial seed set. Erroneous Practice #3 Identifying “magic numbers” of minimum: • “Iterations” • Responsive documents within a randomly accumulated set Erroneous Practice #4 Asserting that Predictive Coding software is the “gold standard” for document retrieval in complex matters. “Predictive Coding’s Erroneous Zones Are Emerging Junk Science”
  • 34. Search Mechanisms’ InferencesInferences(risk)rerecall Search Mechanism Databases Files, Folders (in place) End-user tags Files, Folders (per user) Duplicates “Technology Assisted Review” via Machine Learning Key words Random Sampling Similarity/ Clusters Sorting Similarity Clustering