SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
EUROPEAN LEGISLATIVE
RESPONSES TO
INTERNATIONAL TERRORISM
A Database of Laws in German Plenary Protocols
Outline
1. Introduction
2. Xtract: a software for extraction
3. Expected results
4. Discussion
Introduction1
Linking Laws and Plenary Protocols
 Extract agenda items and participants‘ information
from plenary protocols from terms 12 – 16
 Use GESTA as an index of laws
 Link laws to plenary speeches and vice versa
1 introduction
We have ...
 Plenary protocol PDFs from electoral terms 12 – 16
 1990-12-10 – present
 120.655 pages in 1162 documents
 GESTA database of laws, terms 8 – 16
1 introduction
We have ...
 Plenary protocol PDFs from electoral terms 12 – 16
 1990-12-10 – present
 120.655 pages in 1162 documents
 GESTA database of laws, terms 8 – 16
 : ) and ambition to deliver excellent results
1 introduction
We want to ...
 Extract from 1990 up to the present time
 For each plenary session
 Session number, date, ...
 For each item on the agenda
 Descriptions
 list of participants
 printed matter references
 speech texts
 tables
 Link the results with our database of laws
1 introduction
Challanges
 Older electoral terms are not digitalized
 Each electoral term requires different pattern matching
strategies
 GESTA tables generated for the project
 No consistent, direct links to plenary protocols
 Course of legislation undetailed
 Quality difference between older and newer terms
 OCR errors
 GESTA Database – no improvements possible for older terms
1 introduction
Xtract2
Xtract – software for data mining
 a set of modern tools to annotate plenary protocols
with relevant pieces of information
 preserves document layout
 uses multiple strategies to mark important text blocks
 location, shape and internal structure of blocks
 pattern matching
 Euclidean distances
 statistics
 comes with its own document viewer
2 software
Xtract – implementation details
 PDF access
 pdftohtml (custom builds)
 Acrobat Professional 9 Extended (older terms)
 Data manipulation
 C# 4.0: LINQ to XML
 Visualization
 C# 4.0: WPF (Windows Presentation Foundation)
 Statistics
 CORSIS: my personal open-source project for corpus analysis
2 software
Xtract – why XML?
 Simple and highly-`liquid´ file format
 based on simple international standards
 excellent APIs in many programming languages
 converts easily into other formats
 used in Microsoft Office, OpenOffice.org
2 software
Xtract – XML crash course
 <event>
<speaker id=„12“>
<name>Franz Müntefering</name>
<is>Bundesminister für Arbeit und Soziales</is>
</speaker>
</event>
 elements
 attributes
 hierarchical relations
2 software
Xtract – XML crash course
 <event>
<speaker id=„12“>
<name>Franz Müntefering</name>
<is>Bundesminister für Arbeit und Soziales</is>
</speaker>
</event>
 elements: event, speaker, name, is
2 software
Xtract – XML crash course
 <event>
<speaker id=„12“>
<name>Franz Müntefering</name>
<is>Bundesminister für Arbeit und Soziales</is>
</speaker>
</event>
 attributes: id
2 software
Xtract – XML crash course
 <event>
<speaker id=„12“>
<name>Franz Müntefering</name>
<is>Bundesminister für Arbeit und Soziales</is>
</speaker>
</event>
 children: event → speaker
 parents: event ← speaker
2 software
Xtract – XML crash course
 <event>
<speaker id=„12“>
<name>Franz Müntefering</name>
<is>Bundesminister für Arbeit und Soziales</is>
</speaker>
</event>
 descendants: event → speaker, name, is
2 software
Xtract – XML crash course
 <event>
<speaker id=„12“>
<name>Franz Müntefering</name>
<is>Bundesminister für Arbeit und Soziales</is>
</speaker>
</event>
 siblings: name ↔ is
2 software
Xtract – how does it function?
 extracts texts from PDF files along with layout
information
2 software
Xtract – how does it function?
 merges texts into proximity blocks
2 software
Xtract – how does it function?
 marks ambient constructs
2 software
Xtract – how does it function?
 marks agenda items
2 software
Xtract – how does it function?
 annotates blocks with sections they belong to
2 software
Expected Results3
DIGESTA
 Based on `GESTA Gesamtausgaben´: terms 14 – 16
 Always up-to-date
 Detailed course of legislation information
 Direct links to plenary protocols
 Can be complemented with keywords from MZES
 http://corsis.sf.net/ipw/digesta/
3 results
Done!!
PLEDA – Plenary Protocols Database
 Based on plenary protocols
 Links agenda items multidirectionally with
participants
 Interesting for different linguistic/political research
purposes
3 results
PLEDA – Project Status
12 13 14 15 16
OCR
Run X X - - -
Correction - - -
XML Conversion * * X X X
Division C./S. X X X
Block Merging * * X X X
Ambient Constructs X X X
Page Sections X X X
Interjections * * X X X
Contents * * X
Speeches * * X
Contents-speech links * * X
3 results
GLIT – German Legislative Resp ...
Laws
• .law files
• from GESTA
Protocols
• .pro files
• from BTP
GLIT
• German part of
ELIT
3 results
Discussion4
Open questions
 Project hosting
 Where can we host the results?
 Initial GLIT interface
 Web service?
 Rich client-side app?
 Any questions from your side?
4 discussion

Weitere ähnliche Inhalte

Was ist angesagt?

[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...Ontotext
 
Web Archive Research Skills and Tools Survey (WARST)
 Web Archive Research Skills and Tools Survey (WARST) Web Archive Research Skills and Tools Survey (WARST)
Web Archive Research Skills and Tools Survey (WARST)WARCnet
 
Automatic creation of mappings between classification systems
Automatic creation of mappings between classification systemsAutomatic creation of mappings between classification systems
Automatic creation of mappings between classification systemsMagnus Pfeffer
 
Automatic creation of mappings between classification systems for bibliograph...
Automatic creation of mappings between classification systems for bibliograph...Automatic creation of mappings between classification systems for bibliograph...
Automatic creation of mappings between classification systems for bibliograph...Magnus Pfeffer
 
ArchAIDE Kick-Off Meeting - WP5
ArchAIDE Kick-Off Meeting - WP5ArchAIDE Kick-Off Meeting - WP5
ArchAIDE Kick-Off Meeting - WP5ArchAIDE Project
 
Data Licensing on the Cloud - Empirical Insights and Implications for Linked ...
Data Licensing on the Cloud - Empirical Insights and Implications for Linked ...Data Licensing on the Cloud - Empirical Insights and Implications for Linked ...
Data Licensing on the Cloud - Empirical Insights and Implications for Linked ...Ivan Ermilov
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Sergio Fernández
 
A researcher driven data description for the archived web: Why and how?
A researcher driven data description for the archived web: Why and how?A researcher driven data description for the archived web: Why and how?
A researcher driven data description for the archived web: Why and how?WARCnet
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016Sergio Fernández
 

Was ist angesagt? (10)

[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
 
Web Archive Research Skills and Tools Survey (WARST)
 Web Archive Research Skills and Tools Survey (WARST) Web Archive Research Skills and Tools Survey (WARST)
Web Archive Research Skills and Tools Survey (WARST)
 
Automatic creation of mappings between classification systems
Automatic creation of mappings between classification systemsAutomatic creation of mappings between classification systems
Automatic creation of mappings between classification systems
 
Ice dec04-04-sammy
Ice dec04-04-sammyIce dec04-04-sammy
Ice dec04-04-sammy
 
Automatic creation of mappings between classification systems for bibliograph...
Automatic creation of mappings between classification systems for bibliograph...Automatic creation of mappings between classification systems for bibliograph...
Automatic creation of mappings between classification systems for bibliograph...
 
ArchAIDE Kick-Off Meeting - WP5
ArchAIDE Kick-Off Meeting - WP5ArchAIDE Kick-Off Meeting - WP5
ArchAIDE Kick-Off Meeting - WP5
 
Data Licensing on the Cloud - Empirical Insights and Implications for Linked ...
Data Licensing on the Cloud - Empirical Insights and Implications for Linked ...Data Licensing on the Cloud - Empirical Insights and Implications for Linked ...
Data Licensing on the Cloud - Empirical Insights and Implications for Linked ...
 
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
 
A researcher driven data description for the archived web: Why and how?
A researcher driven data description for the archived web: Why and how?A researcher driven data description for the archived web: Why and how?
A researcher driven data description for the archived web: Why and how?
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
 

Andere mochten auch

Peace Powerpoint by Natasha, Kyle, Joey and Caitlin
Peace Powerpoint by Natasha, Kyle, Joey and CaitlinPeace Powerpoint by Natasha, Kyle, Joey and Caitlin
Peace Powerpoint by Natasha, Kyle, Joey and Caitlinsmuench
 
terrorism by aamish garg
terrorism by aamish gargterrorism by aamish garg
terrorism by aamish gargaamish garg
 
Terrorism-Causes and Types
Terrorism-Causes and TypesTerrorism-Causes and Types
Terrorism-Causes and TypesShaan Yaduvanshi
 

Andere mochten auch (6)

Peace Powerpoint by Natasha, Kyle, Joey and Caitlin
Peace Powerpoint by Natasha, Kyle, Joey and CaitlinPeace Powerpoint by Natasha, Kyle, Joey and Caitlin
Peace Powerpoint by Natasha, Kyle, Joey and Caitlin
 
Terrorism
TerrorismTerrorism
Terrorism
 
Cyber laws
Cyber lawsCyber laws
Cyber laws
 
terrorism by aamish garg
terrorism by aamish gargterrorism by aamish garg
terrorism by aamish garg
 
Terrorism
TerrorismTerrorism
Terrorism
 
Terrorism-Causes and Types
Terrorism-Causes and TypesTerrorism-Causes and Types
Terrorism-Causes and Types
 

Ähnlich wie Ipw slides

Lynx project presentation at ENDORSE 2021 Conference
Lynx project presentation at ENDORSE 2021 ConferenceLynx project presentation at ENDORSE 2021 Conference
Lynx project presentation at ENDORSE 2021 ConferenceLynx Project
 
A MEDIA SHARING PLATFORM BUILT WITH OPEN SOURCE SOFTWARE
A MEDIA SHARING PLATFORM BUILT WITH OPEN SOURCE SOFTWAREA MEDIA SHARING PLATFORM BUILT WITH OPEN SOURCE SOFTWARE
A MEDIA SHARING PLATFORM BUILT WITH OPEN SOURCE SOFTWAREvrt-medialab
 
Semantic web-and-public-data - en
Semantic web-and-public-data - enSemantic web-and-public-data - en
Semantic web-and-public-data - enTenforce
 
Audio MD Metadata Scheme
Audio MD Metadata SchemeAudio MD Metadata Scheme
Audio MD Metadata SchemeAriel Hess
 
CORE final workshop introduction
CORE final workshop introductionCORE final workshop introduction
CORE final workshop introductionCarlo Vaccari
 
IP Messenger And File Transfer over Ethernet LAN
IP Messenger And File Transfer over Ethernet LANIP Messenger And File Transfer over Ethernet LAN
IP Messenger And File Transfer over Ethernet LANdbpublications
 
Decoder Open Research Webinar
Decoder Open Research WebinarDecoder Open Research Webinar
Decoder Open Research WebinarDecoder Project
 
Electronic Common Technical Document (eCTD)
Electronic Common Technical Document (eCTD)Electronic Common Technical Document (eCTD)
Electronic Common Technical Document (eCTD)Md. Zakaria Faruki
 
The need of Interoperability in Office and GIS formats
The need of Interoperability in Office and GIS formatsThe need of Interoperability in Office and GIS formats
The need of Interoperability in Office and GIS formatsMarkus Neteler
 
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...EUDAT
 
3rd Session: Workshop I on Legal Informatics Services, Challenges and Ideas, ...
3rd Session: Workshop I on Legal Informatics Services, Challenges and Ideas, ...3rd Session: Workshop I on Legal Informatics Services, Challenges and Ideas, ...
3rd Session: Workshop I on Legal Informatics Services, Challenges and Ideas, ...Samos2019Summit
 
A Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSXA Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSXStuart Chalk
 
Core kick off vaccari
Core kick off vaccariCore kick off vaccari
Core kick off vaccariCarlo Vaccari
 
Evaluation of Research Tools
Evaluation of Research ToolsEvaluation of Research Tools
Evaluation of Research ToolsHATS
 

Ähnlich wie Ipw slides (20)

Lynx project presentation at ENDORSE 2021 Conference
Lynx project presentation at ENDORSE 2021 ConferenceLynx project presentation at ENDORSE 2021 Conference
Lynx project presentation at ENDORSE 2021 Conference
 
A MEDIA SHARING PLATFORM BUILT WITH OPEN SOURCE SOFTWARE
A MEDIA SHARING PLATFORM BUILT WITH OPEN SOURCE SOFTWAREA MEDIA SHARING PLATFORM BUILT WITH OPEN SOURCE SOFTWARE
A MEDIA SHARING PLATFORM BUILT WITH OPEN SOURCE SOFTWARE
 
Semantic web-and-public-data - en
Semantic web-and-public-data - enSemantic web-and-public-data - en
Semantic web-and-public-data - en
 
Audio MD Metadata Scheme
Audio MD Metadata SchemeAudio MD Metadata Scheme
Audio MD Metadata Scheme
 
Airline Data Analysis
Airline Data AnalysisAirline Data Analysis
Airline Data Analysis
 
Jon resume4
Jon resume4Jon resume4
Jon resume4
 
Tape Access Optimization With TReqS
Tape Access Optimization With TReqSTape Access Optimization With TReqS
Tape Access Optimization With TReqS
 
CORE final workshop introduction
CORE final workshop introductionCORE final workshop introduction
CORE final workshop introduction
 
IP Messenger And File Transfer over Ethernet LAN
IP Messenger And File Transfer over Ethernet LANIP Messenger And File Transfer over Ethernet LAN
IP Messenger And File Transfer over Ethernet LAN
 
Workshop on "Legislative XML
Workshop on "Legislative XMLWorkshop on "Legislative XML
Workshop on "Legislative XML
 
Decoder Open Research Webinar
Decoder Open Research WebinarDecoder Open Research Webinar
Decoder Open Research Webinar
 
[IJET V2I3P7] Authors: Muthe Sandhya, Shitole Sarika, Sinha Anukriti, Aghav S...
[IJET V2I3P7] Authors: Muthe Sandhya, Shitole Sarika, Sinha Anukriti, Aghav S...[IJET V2I3P7] Authors: Muthe Sandhya, Shitole Sarika, Sinha Anukriti, Aghav S...
[IJET V2I3P7] Authors: Muthe Sandhya, Shitole Sarika, Sinha Anukriti, Aghav S...
 
Electronic Common Technical Document (eCTD)
Electronic Common Technical Document (eCTD)Electronic Common Technical Document (eCTD)
Electronic Common Technical Document (eCTD)
 
The need of Interoperability in Office and GIS formats
The need of Interoperability in Office and GIS formatsThe need of Interoperability in Office and GIS formats
The need of Interoperability in Office and GIS formats
 
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
 
3rd Session: Workshop I on Legal Informatics Services, Challenges and Ideas, ...
3rd Session: Workshop I on Legal Informatics Services, Challenges and Ideas, ...3rd Session: Workshop I on Legal Informatics Services, Challenges and Ideas, ...
3rd Session: Workshop I on Legal Informatics Services, Challenges and Ideas, ...
 
A Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSXA Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSX
 
Core kick off vaccari
Core kick off vaccariCore kick off vaccari
Core kick off vaccari
 
Evaluation of Research Tools
Evaluation of Research ToolsEvaluation of Research Tools
Evaluation of Research Tools
 
cv_filustek_en_08
cv_filustek_en_08cv_filustek_en_08
cv_filustek_en_08
 

Kürzlich hochgeladen

Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )Pooja Nehwal
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubssamaasim06
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024eCommerce Institute
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...henrik385807
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMoumonDas2
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Vipesco
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardsticksaastr
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyPooja Nehwal
 

Kürzlich hochgeladen (20)

Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubs
 
George Lever - eCommerce Day Chile 2024
George Lever -  eCommerce Day Chile 2024George Lever -  eCommerce Day Chile 2024
George Lever - eCommerce Day Chile 2024
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptx
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
 

Ipw slides

  • 1. EUROPEAN LEGISLATIVE RESPONSES TO INTERNATIONAL TERRORISM A Database of Laws in German Plenary Protocols
  • 2. Outline 1. Introduction 2. Xtract: a software for extraction 3. Expected results 4. Discussion
  • 4. Linking Laws and Plenary Protocols  Extract agenda items and participants‘ information from plenary protocols from terms 12 – 16  Use GESTA as an index of laws  Link laws to plenary speeches and vice versa 1 introduction
  • 5. We have ...  Plenary protocol PDFs from electoral terms 12 – 16  1990-12-10 – present  120.655 pages in 1162 documents  GESTA database of laws, terms 8 – 16 1 introduction
  • 6. We have ...  Plenary protocol PDFs from electoral terms 12 – 16  1990-12-10 – present  120.655 pages in 1162 documents  GESTA database of laws, terms 8 – 16  : ) and ambition to deliver excellent results 1 introduction
  • 7. We want to ...  Extract from 1990 up to the present time  For each plenary session  Session number, date, ...  For each item on the agenda  Descriptions  list of participants  printed matter references  speech texts  tables  Link the results with our database of laws 1 introduction
  • 8. Challanges  Older electoral terms are not digitalized  Each electoral term requires different pattern matching strategies  GESTA tables generated for the project  No consistent, direct links to plenary protocols  Course of legislation undetailed  Quality difference between older and newer terms  OCR errors  GESTA Database – no improvements possible for older terms 1 introduction
  • 10. Xtract – software for data mining  a set of modern tools to annotate plenary protocols with relevant pieces of information  preserves document layout  uses multiple strategies to mark important text blocks  location, shape and internal structure of blocks  pattern matching  Euclidean distances  statistics  comes with its own document viewer 2 software
  • 11. Xtract – implementation details  PDF access  pdftohtml (custom builds)  Acrobat Professional 9 Extended (older terms)  Data manipulation  C# 4.0: LINQ to XML  Visualization  C# 4.0: WPF (Windows Presentation Foundation)  Statistics  CORSIS: my personal open-source project for corpus analysis 2 software
  • 12. Xtract – why XML?  Simple and highly-`liquid´ file format  based on simple international standards  excellent APIs in many programming languages  converts easily into other formats  used in Microsoft Office, OpenOffice.org 2 software
  • 13. Xtract – XML crash course  <event> <speaker id=„12“> <name>Franz Müntefering</name> <is>Bundesminister für Arbeit und Soziales</is> </speaker> </event>  elements  attributes  hierarchical relations 2 software
  • 14. Xtract – XML crash course  <event> <speaker id=„12“> <name>Franz Müntefering</name> <is>Bundesminister für Arbeit und Soziales</is> </speaker> </event>  elements: event, speaker, name, is 2 software
  • 15. Xtract – XML crash course  <event> <speaker id=„12“> <name>Franz Müntefering</name> <is>Bundesminister für Arbeit und Soziales</is> </speaker> </event>  attributes: id 2 software
  • 16. Xtract – XML crash course  <event> <speaker id=„12“> <name>Franz Müntefering</name> <is>Bundesminister für Arbeit und Soziales</is> </speaker> </event>  children: event → speaker  parents: event ← speaker 2 software
  • 17. Xtract – XML crash course  <event> <speaker id=„12“> <name>Franz Müntefering</name> <is>Bundesminister für Arbeit und Soziales</is> </speaker> </event>  descendants: event → speaker, name, is 2 software
  • 18. Xtract – XML crash course  <event> <speaker id=„12“> <name>Franz Müntefering</name> <is>Bundesminister für Arbeit und Soziales</is> </speaker> </event>  siblings: name ↔ is 2 software
  • 19. Xtract – how does it function?  extracts texts from PDF files along with layout information 2 software
  • 20. Xtract – how does it function?  merges texts into proximity blocks 2 software
  • 21. Xtract – how does it function?  marks ambient constructs 2 software
  • 22. Xtract – how does it function?  marks agenda items 2 software
  • 23. Xtract – how does it function?  annotates blocks with sections they belong to 2 software
  • 25. DIGESTA  Based on `GESTA Gesamtausgaben´: terms 14 – 16  Always up-to-date  Detailed course of legislation information  Direct links to plenary protocols  Can be complemented with keywords from MZES  http://corsis.sf.net/ipw/digesta/ 3 results Done!!
  • 26. PLEDA – Plenary Protocols Database  Based on plenary protocols  Links agenda items multidirectionally with participants  Interesting for different linguistic/political research purposes 3 results
  • 27. PLEDA – Project Status 12 13 14 15 16 OCR Run X X - - - Correction - - - XML Conversion * * X X X Division C./S. X X X Block Merging * * X X X Ambient Constructs X X X Page Sections X X X Interjections * * X X X Contents * * X Speeches * * X Contents-speech links * * X 3 results
  • 28. GLIT – German Legislative Resp ... Laws • .law files • from GESTA Protocols • .pro files • from BTP GLIT • German part of ELIT 3 results
  • 30. Open questions  Project hosting  Where can we host the results?  Initial GLIT interface  Web service?  Rich client-side app?  Any questions from your side? 4 discussion