MediaEval 2016 - BUT Zero-Cost Speech Recognition

•

1 gefällt mir•173 views

Presenter: Miroslav Skácel BUT Zero-Cost Speech Recognition 2016 System Description In Working Notes Proceedings of the MediaEval 2016 Workshop, Hilversum, Netherlands, October 20-21, CEUR-WS.org (2016) by Miroslav Skácel, Martin Karafiát, Lucas Ondel, Albert Uchytil, Igor Szöke Paper: http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_48.pdf Video: https://youtu.be/0pNiLLVTa28 Abstract: This paper describes our work on developing speech recognizers for Vietnamese. It focuses on procedures to prepare provided data precisely. We aim on analysis of the textual transcriptions in particular. Methods to filter out defective data to improve performance of final system are proposed and described in detail. We also propose cleaning of other textual data used for language modeling. Several architectures are investigated to reach both sub-tasks goals. The achieved results are discussed.

Wissenschaft

ﬁt
speech fit
BUT Zero-Cost 2016 Speech Recognition
Miroslav Skácel, Martin Karaﬁát, Lucas Ondel, Albert Uchytil, Igor Szöke
BUT Speech@FIT
Brno University of Technology
Czech Republic
MediaEval 2016 Workshop, October 20-21, 2016, Hilversum, Netherlands

Overview
Our ideas for this task were:
• to use previous knowledge of low-resource languages
• to use existing systems from Babel1
• to adapt that on Vietnamese
• to follow zero-cost requirements
We ended up with:
• lots of data processing and cleaning
• 2x LVCSR sub-task systems
• 1x subword sub-task systems
• suggestions for further improvements
1https://www.iarpa.gov/index.php/research-programs/babel
1

Data preparation
• audio downsampled from 16kHz to 8kHz
• got Vietnamese alphabet from wordlist
• cleaned other characters (punctuation marks, brackets, etc.)
• numerals expanded to textual form
2

Data segmentation
• audio longer than 1 minute caused troubles
• alignment from ﬁrst training stages used
• data split when:
• silence longer than 0.5s occurs
• segment would be longer than 15s
3

Language models
3 LMs were created:
1) cleaned transcriptions from train set
2) cleaned subtitles
3) cleaned text from websites
• 3 or more words
• we combined them together
4

LVCSR systems - GMM/DNN2
2Martin Karaﬁát et al. BUT Neural Network Features for Spontaneous Vietnamese in
BABEL. In Proceedings of ICASSP 2014, pages 5659–5663. IEEE Signal Processing Society,
2014.
5

LVCSR systems - BLSTM3
• 16kHz audio, original transcriptions
• Kaldi/TNet toolkits
3Martin Karaﬁát et al. Multilingual BLSTM and Speaker-Speciﬁc Vector Adaptation in
2016 BUT Babel System. Accepted at SLT 2016, 2016.
6

Subword system
• Acoustic Unit Discovery4
(AUD) model
• unlabeled data - no transcription needed
• like phone-loop model, fully Bayesian, Dirichlet distribution
• no ﬁxed number of components like in HMM
• can learn complexity of model
4Lucas Ondel et al. Variational Inference for Acoustic Unit Discovery. In Procedia
Computer Science, volume 2016, pages 80–86. Elsevier Science, 2016.
7

Results
System
Devel [WER] Test [WER]
all (ELSA / Forvo / RhinoSpike) all (ELSA / Forvo / RhinoSpike / YouTube)
Kaldi Baseline 60.4 (45.3 / 99.8 / 73.5) 75.5 (46.7 / 98.1 / 76.2 / 97.1)
P-BUT - Babel Kaldi BLSTM 16kHz 17.9 (6.4 / 58.1 / 15.8) 48.0 (4.9 / 55.7 / 35.4 / 87.2)
L-BUT - Babel Kaldi BLSTM 16kHz - LM tune 17.6 (6.2 / 56.4 / 16.9) 46.3 (4.6 / 52.6 / 32.2 / 84.7)
L-BUT - Babel GMM/DNN 8kHz 36.1 (29.7 / 68.5 / 23.4) 55.7 (28.0 / 59.3 / 44.9 / 81.4)
System
Devel [NMI] Test [NMI]
all (ELSA / Forvo / RhinoSpike) all (ELSA / Forvo / RhinoSpike / YouTube)
Kaldi Baseline 5.48 (7.88 / 10.44 / 13.8) 4.29 (7.49 / 11.25 / 15.99 / 6.35)
P-BUT AUD phone-loop 5.08 (6.45 / 8.76 / 14.19) 4.56 (5.52 / 9.59 / 18.49 / 7.59)
• BLSTM overperformed GMM/DNN system
• exception is unseen data (YouTube test set)
8

Further work
Ideas for future experiments:
• ﬁnd better way to clean original transcriptions
• methods to detect inappropriate audio/transcriptions
• adding noise to audio for more robust system
• audio reverberation using RIR
9

Weitere ähnliche Inhalte

Andere mochten auch

This slideset presents an approach to automatically detecting breaking news events from social media streams, using event detection to collecting near real time relevant video documents from social networks regarding that breaking news. A visual analytics dashboard provides access to the results of the content processing pipeline, providing a rich interactive interface to explore emerging stories and select video material around those stories for verification.

Video Retrieval for Multimedia Verification of Breaking News on Social Networks

InVID Project

MediaEval 2016 - Emotion in Music Task: Lessons Learned

multimediaeval

The event synchronisation task addresses the problem of aligning media (i.e., photo and video) streams (“galleries”) from different users temporally and identifying coherent events in the streams. Our approach uses the visual similarity of image/key frame pairs based on full matching of SIFT descriptors with geometric verification. Based on the visual similarity and the given time information, a probabilistic algorithm is employed, where in each run a hypothesis is calculated for the set of time offsets with respect to the reference gallery. From the gathered hypotheses, the final set of time offsets is calculated as the medoid of all hypotheses. http://ceur-ws.org/Vol-1436/ http://www.multimediaeval.org

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Task

multimediaeval

This paper describes the results of our participation to the Synchronization of Multi-User Event Media Task at the MediaEval 2015 challenge. Using multiple similarity measures, we identify pairs of similar media from different galleries. We use a graph-based approach to temporally synchronize user galleries; subsequently we use time information, geolocation information and visual concept detection results to cluster all photos into different sub-events. Our method achieves good accuracy on considerably diverse datasets. http://ceur-ws.org/Vol-1436/ http://www.multimediaeval.org

MediaEval 2015 - CERTH at MediaEval 2015 Synchronization of Multi-User Event ...

multimediaeval

In this paper, we present the systems developed by GTMUVigo team for the Multimedia Person Discovery in Broadcast TV task at MediaEval 2015. The systems propose two different strategies for person discovery in audio through speaker diarization (one based on an online clustering strategy with error correction using OCR information and the other based on agglomerative hierarchical clustering) as well as intrashot and intershot trategies for face clustering. http://ceur-ws.org/Vol-1436/ http://www.multimediaeval.org

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015

multimediaeval

Presenter: Cynthia Liem TUD-MMC at MediaEval 2016: Predicting Media Interestingness Task In Working Notes Proceedings of the MediaEval 2016 Workshop, Hilversum, Netherlands, October 20-21, CEUR-WS.org (2016) by Cynthia Liem Paper: http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_30.pdf Video: https://youtu.be/NQan10E_-kE Abstract: This working notes paper describes the TUD-MMC entry to the MediaEval 2016 Predicting Media Interestingness Task. Noting that the nature of movie trailer shots is different from that of preceding tasks on image and video interestingness, we propose two baseline heuristic approaches based on the clear occurrence of people. MAP scores obtained on the development set and test set suggest that our approaches cover a limited but non-marginal subset of the interestingness spectrum. Most strikingly, our obtained scores on the Image and Video Subtasks are comparable or better than those obtained when evaluating the ground truth annotations of the Image Subtask against the Video Subtask and vice versa

MediaEval 2016 - TUD-MMC Predicting media Interestingness Task

multimediaeval

This paper provides an overview of the Verifying Multimedia Use task that takes places as part of the 2015 MediaEval Benchmark. The task deals with the automatic detection of manipulation and misuse of Web multimedia content. Its aim is to lay the basis for a future generation of tools that could assist media professionals in the process of verification. Examples of manipulation include maliciously tampering with images and videos, e.g., splicing, removal/addition of elements, while other kinds of misuse include the reposting of previously captured multimedia content in a different context (e.g., a new event) claiming that it was captured there. For the 2015 edition of the task, we have generated and made available a large corpus of real-world cases of images that were distributed through tweets, along with manually assigned labels regarding their use, i.e. misleading (fake) versus appropriate (real). http://ceur-ws.org/Vol-1436/ http://www.multimediaeval.org

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015

multimediaeval

Presenter: Giorgos Kordopatis-Zilos Placing Images with Refined Language Models and Similarity Search with PCA-reduced VGG Features In Working Notes Proceedings of the MediaEval 2016 Workshop, Hilversum, Netherlands, October 20-21, CEUR-WS.org (2016) by Giorgos Kordopatis-Zilos, Adrian Popescu, Symeon Papadopoulos, Yiannis Kompatsiaris Paper: http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_13.pdf Video: https://youtu.be/WR4I3CWjcR4 Abstract: We describe the participation of the CERTH/CEA-LIST team in the MediaEval 2016 Placing Task. We submitted five runs to the estimation-based sub-task: one based only on text by employing a Language Model-based approach with several refinements, one based on visual content, using geospatial clustering over the most visually similar images, and three based on a hybrid scheme exploiting both visual and textual cues from the multimedia items, trained on datasets of different size and origin. The best results were obtained by a hybrid approach trained with external training data and using two publicly available gazetteers.

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

multimediaeval

Presenter: Maigrot Cédric MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task In Working Notes Proceedings of the MediaEval 2016 Workshop, Hilversum, Netherlands, October 20-21, CEUR-WS.org (2016) by Cédric Maigrot, Vincent Claveau, Ewa Kijak, Ronan Sicre Paper: http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_45.pdf Video: https://youtu.be/ay1zWydnijY Abstract: This paper presents a multi-modal hoax detection system composed of text, source, and image analysis. As hoax can be very diverse, we want to analyze several modalities to better detect them. This system is applied in the context of the Verifying Multimedia Use task of MediaEval 2016. Experiments show the performance of each separated modality as well as their combination.

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

multimediaeval

Presenter: Bogdan Boteanu, LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-Relevance Feedback Diversification Perspective In Working Notes Proceedings of the MediaEval 2016 Workshop, Hilversum, Netherlands, October 20-21, CEUR-WS.org (2016) by Bogdan Boteanu, Mihai G. Constantin, Bogdan Ionescu Paper: http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_20.pdf Video: https://youtu.be/mDI8Z31p7TY Abstract: In this paper we present the results achieved during the 2016 MediaEval Retrieving Diverse Social Images Task, using an approach based on pseudo-relevance feedback, in which human feedback is replaced by an automatic selection of images. The proposed approach is designed to have in priority the diversification of the results, in contrast to most of the existing techniques that address only the relevance. Diversification is achieved by exploiting a hierarchical clustering scheme followed by a diversification strategy. Methods are tested on the benchmarking data and results are analyzed. Insights for future work conclude the paper.

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...

multimediaeval

The objective of this paper is to provide an overview of the Synchronization of Multi-User Event Media (SEM) Task, which is part of the MediaEval Benchmark for Multimedia Evaluation. The SEM task was initially presented at MediaEval in 2014, with the goal of proposing a challenge in aligning multiple users’ photo galleries related to the same event but with unreliable timestamps. Besides aligning the pictures on a common timeline, participants were also required to detect the sub-events and cluster the pictures accordingly. For 2015 we have decided to extend the task also to other types of media, thus including audio and video information for a more complete and diversified representation of the analyzed event. http://ceur-ws.org/Vol-1436/ http://www.multimediaeval.org

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...

multimediaeval

Presenter: Konstantin Pogorelov Simula @ MediaEval 2016 Context of Experience Task In Working Notes Proceedings of the MediaEval 2016 Workshop, Hilversum, Netherlands, October 20-21, CEUR-WS.org (2016) by Konstantin Pogorelov, Michael Riegler, Pål Halvorsen, Carsten Griwodz Paper: http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_53.pdf Video: https://youtu.be/FTIeGpHhURU Abstract: This paper presents our approach for the Context of Multimedia Experience Task of the MediaEval 2016 Benchmark. We present different analyses of the given data using different subsets of data sources and combinations of it. Our approach gives a baseline evaluation indicating that metadata approaches work well but that also visual features can provide useful information for the given problem to solve.

MediaEval 2016 - Simula Team @ Context of Experience Task

multimediaeval

Media REVEALr: A social multimedia monitoring and intelligence system for Web...

Symeon Papadopoulos

The InVID Plug-in: Web Video Verification on the Browser

InVID Project

Presenter: Mihai Gabriel Constantin LAPI at MediaEval 2016 Predicting Media Interestingness Task In Working Notes Proceedings of the MediaEval 2016 Workshop, Hilversum, Netherlands, October 20-21, CEUR-WS.org (2016) by Mihai G. Constantin, Bogdan Boteanu, Bogdan Ionescu Paper: http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_23.pdf Video: https://youtu.be/4VKeMMeroG0 Abstract: This paper will present our results for the MediaEval 2016 Predicting Media Interestingness task. We proposed an approach based on video descriptors and studied several machine learning models, in order to detect the optimal configuration and combination for the descriptors and algorithms that compose our system.

MediaEval 2016: LAPI at Predicting Media Interestingness Task

multimediaeval

Presenters: Stuart E. Middleton and Christina Boididou Verifying Multimedia Use at MediaEval 2016 In Working Notes Proceedings of the MediaEval 2016 Workshop, Hilversum, Netherlands, October 20-21, CEUR-WS.org (2016) by Christina Boididou, Symeon Papadopoulos, Duc-Tien Dang-Nguyen, Giulia Boato, Michael Riegler, Stuart E. Middleton, Andreas Petlund, and Yiannis Kompatsiaris Paper: http://ceur-ws.org/Vol-1739/MediaEval_2016_paper_3.pdf Video: https://youtu.be/2Jx6OliFR-0 Abstract: This paper provides an overview of the Verifying Multimedia Use task that takes places as part of the 2016 MediaEval Benchmark. The task motivates the development of automated techniques for detecting manipulated and misleading use of web multimedia content. Splicing, tampering and reposting videos and images are examples of manipulation that are part of the task definition. For the 2016 edition of the task, a corpus of images/videos and their associated posts is made available, together with labels indicating the appearance of misuse (fake) or not (real) in each case as well as some useful post metadata.

MediaEval 2016 - Verifying Multimedia Use Task Overview

multimediaeval

Andere mochten auch (16)

Video Retrieval for Multimedia Verification of Breaking News on Social Networks

MediaEval 2016 - Emotion in Music Task: Lessons Learned

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Task

MediaEval 2015 - CERTH at MediaEval 2015 Synchronization of Multi-User Event ...

MediaEval 2015 - GTM-UVigo Systems for Person Discovery Task at MediaEval 2015

MediaEval 2016 - TUD-MMC Predicting media Interestingness Task

MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015

MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...

MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task

MediaEval 2016 - LAPI @ 2016 Retrieving Diverse Social Images Task: A Pseudo-...

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015:...

MediaEval 2016 - Simula Team @ Context of Experience Task

Media REVEALr: A social multimedia monitoring and intelligence system for Web...

The InVID Plug-in: Web Video Verification on the Browser

MediaEval 2016: LAPI at Predicting Media Interestingness Task

MediaEval 2016 - Verifying Multimedia Use Task Overview

Ähnlich wie MediaEval 2016 - BUT Zero-Cost Speech Recognition

Mediaeval 2013 Spoken Web Search results slides

Xavier Anguera

Wreck a nice beach: adventures in speech recognition

Stephen Marquard

Many syntactic treebanks and parser toolkits are developed in the past twenty years, including dependency structure parsers and phrase structure parsers. For the phrase structure parsers, they usually utilize different phrase tagsets for different languages, which results in an inconvenience when conducting the multilingual research. This paper designs a refined universal phrase tagset that contains 9 commonly used phrase categories. Furthermore, the mapping covers 25 constituent treebanks and 21 languages. The experiments show that the universal phrase tagset can generally reduce the costs in the parsing models and even improve the parsing accuracy.

PPT-CCL: A Universal Phrase Tagset for Multilingual Treebanks

Lifeng (Aaron) Han

Preliminary study on using vector quantization latent spaces for TTS/VC syste...

Yamagishi Laboratory, National Institute of Informatics, Japan

Lenar Gabdrakhmanov (Provectus): Speech synthesis

Provectus

Presenter: Dr. Xiaoxiao Miao, NII Paper: https://arxiv.org/abs/2202.13097 Speaker anonymization aims to protect the privacy of speakers while preserving spoken linguistic information from speech. Current mainstream neural network speaker anonymization systems are complicated, containing an F0 extractor, speaker encoder, automatic speech recognition acoustic model (ASR AM), speech synthesis acoustic model and speech waveform generation model. Moreover, as an ASR AM is language-dependent, trained on English data, it is hard to adapt it into another language. In this paper, we propose a simpler self-supervised learning (SSL)-based method for language-independent speaker anonymization without any explicit language-dependent model, which can be easily used for other languages. Extensive experiments were conducted on the VoicePrivacy Challenge 2020 datasets in English and AISHELL-3 datasets in Mandarin to demonstrate the effectiveness of our proposed SSL-based language-independent speaker anonymization method.

Odyssey 2022: Language-Independent Speaker Anonymization Approach using Self-...

Yamagishi Laboratory, National Institute of Informatics, Japan

Neural Network Language Models for Candidate Scoring in Multi-System Machine...

Matīss ‎‎‎‎‎‎‎

Asr

alexisronquillo

Corpus Linguistics :Analytical Tools

Jitendra Patil

A Recorded Debating Dataset

Scott Faria

SiddhantSancheti_MediumShortStory.pptx

SiddhantSancheti1

https://telecombcn-dl.github.io/2017-dlsl/ Winter School on Deep Learning for Speech and Language. UPC BarcelonaTech ETSETB TelecomBCN. The aim of this course is to train students in methods of deep learning for speech and language. Recurrent Neural Networks (RNN) will be presented and analyzed in detail to understand the potential of these state of the art tools for time series processing. Engineering tips and scalability issues will be addressed to solve tasks such as machine translation, speech recognition, speech synthesis or question answering. Hands-on sessions will provide development skills so that attendees can become competent in contemporary data analytics tools.

End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...

Universitat Politècnica de Catalunya

Subjective comparison of_speech_enhancement_algori (1)

Priyanka Reddy

Acceptance Testing Of A Spoken Language Translation System

Michele Thomas

Speaker Dependent WaveNet Vocoder

Akira Tamamori

Doing Something We Never Could with Spoken Language Technologies_109-10-29_In...

linshanleearchive

NMR Automation

cknoxrun

Searching for the Best Machine Translation Combination

Matīss ‎‎‎‎‎‎‎

Introduction to text to speech

Bilgin Aksoy

DK Panda from Ohio State University presented this deck at the Switzerland HPC Conference. "This talk will focus on challenges in designing programming models and runtime environments for Exascale systems with millions of processors and accelerators to support various programming models. We will focus on MPI+X (PGAS - OpenSHMEM/UPC/CAF/UPC++, OpenMP, and CUDA) programming models by taking into account support for multi-core systems (KNL and OpenPower), high-performance networks, GPGPUs (including GPUDirect RDMA), and energy-awareness. Features and sample performance numbers from the MVAPICH2 libraries, will be presented." Watch the video: http://wp.me/p3RLHQ-gCb Learn more: http://hpcadvisorycouncil.com/events/2017/swiss-workshop/agenda.php Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter

High-Performance and Scalable Designs of Programming Models for Exascale Systems

inside-BigData.com

Ähnlich wie MediaEval 2016 - BUT Zero-Cost Speech Recognition (20)

Mediaeval 2013 Spoken Web Search results slides

Wreck a nice beach: adventures in speech recognition

PPT-CCL: A Universal Phrase Tagset for Multilingual Treebanks

Preliminary study on using vector quantization latent spaces for TTS/VC syste...

Lenar Gabdrakhmanov (Provectus): Speech synthesis

Odyssey 2022: Language-Independent Speaker Anonymization Approach using Self-...

Neural Network Language Models for Candidate Scoring in Multi-System Machine...

Asr

Corpus Linguistics :Analytical Tools

A Recorded Debating Dataset

SiddhantSancheti_MediumShortStory.pptx

End-to-end Speech Recognition with Recurrent Neural Networks (D3L6 Deep Learn...

Subjective comparison of_speech_enhancement_algori (1)

Acceptance Testing Of A Spoken Language Translation System

Speaker Dependent WaveNet Vocoder

Doing Something We Never Could with Spoken Language Technologies_109-10-29_In...

NMR Automation

Searching for the Best Machine Translation Combination

Introduction to text to speech

High-Performance and Scalable Designs of Programming Models for Exascale Systems

Mehr von multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper62.pdf YouTube: https://youtu.be/gV-rvV3iFDA Pierre-Etienne Martin, Jenny Benois-Pineau, Boris Mansencal, Renaud Péteri and Julien Morlier : Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal CNN for MediaEval 2020. Proc. of MediaEval 2020, 14-15 December 2020, Online. This work presents a method for classifying table tennis strokes using spatio-temporal convolutional neural networks. The fine-grained classification is performed on trimmed video segments recorded at 120 fps with different players performing in natural conditions. From those segments, the frames are extracted, their optical flow is computed and the pose of the player is estimated. From the optical flow amplitude, a region of interest is inferred. A three stream spatio-temporal convolutional neural network using combination of those modalities and 3D attention mechanisms is presented in order to perform classification. Presented by: Pierre-Etienne Martin

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper50.pdf Hai Nguyen-Truong, San Cao, N. A. Khoa Nguyen, Bang-Dang Pham, Hieu Dao, Minh-Quan Le, Hoang-Phuc Nguyen-Dinh, Hai-Dang Nguyen and Minh-Triet Tran : HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table Tennis Strokes Classification Task. Proc. of MediaEval 2020, 14-15 December 2020, Online. The Sports Video Classification Tasks in the Multimedia Evaluation 2020 Challenge focuses on classifying different types of table tennis strokes in video segments. In this task, we - the HCMUS Team - perform multiple experiments, which includes a combination of models such as SlowFast, Optical Flow, DensePose, R2+1, Channel-Separated Convolutional Networks, to classify 21 types of table tennis strokes from video segments. In total, we submit eight runs corresponding to five different models with different sets of hyper-parameters in each of our models. In addition, we apply some pre-processing techniques on the dataset in order for our model to learn and classify more accurately. According to the evaluation results, one of our team's methods out-performs the other team's. In particular, our best run achieves 31.35\% global accuracy, and all of our methods show potential results in terms of local and global accuracy for action recognition tasks.

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper2.pdf YouTube: https://youtu.be/-bRL868b8ys Pierre-Etienne Martin, Jenny Benois-Pineau, Boris Mansencal, Renaud Péteri, Laurent Mascarilla, Jordan Calandre and Julien Morlier : Sports Video Classification: Classification of Strokes in Table Tennis for MediaEval 2020. Proc. of MediaEval 2020, 14-15 December 2020, Online. Fine-grained action classification has raised new challenges compared to classical action classification problems. Sport video analysis is a very popular research topic, due to the variety of application areas, ranging from multimedia intelligent devices with user-tailored digests, up to analysis of athletes' performances. Running since 2019 as a part of MediaEval, we offer a task which consists in classifying table tennis strokes from videos recorded in natural conditions at the University of Bordeaux. The aim is to build tools for teachers, coaches and players to analyse table tennis games. Such tools could lead to an automatic profiling of the player and adaptation of his training for improving his/her sport skills more efficiently. Presented by: Pierre-Etienne Martin

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper61.pdf YouTube: https://youtu.be/brmI4g3jLS4 Ricardo Kleinlein, Cristina Luna-Jiménez, Fernando Fernández-Martínez and Zoraida Callejas : Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention and LSTM Models. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper reports on the GTH-UPM team experience in the Predicting Media Memorability task at MediaEval 2020. Teams were requested to predict memorability scores at both short-term and long-term, understanding such score as a measure of whether a video was perdurable in a viewer's memory or not. Our proposed system relies on a late fusion of the scores predicted by three sequential models, each trained over a different modality: video captions, aural embeddings and visual optical flow-based vectors. Whereas single-modality models show a low or zero Spearman correlation coefficient value, their combination considerably boosts performance over development data up to 0.2 in the short-term memorability prediction subtask and 0.19 in the long-term subtask. However, performance over test data drops to 0.016 and -0.041, respectively. Presented by: Ricardo Kleinlein

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper52.pdf Janadhip Jacutprakart, Rukiye Savran Kiziltepe, John Q. Gan, Giorgos Papanastasiou and Alba G. Seco de Herrera : Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper, we present the methods of approach and the main results from the Essex NLIP Team’s participation in the MediEval 2020 Predicting Media Memorability task. The task requires participants to build systems that can predict short-term and long-term memorability scores on real-world video samples provided. The focus of our approach is on the use of colour-based visual features as well as the use of the video annotation meta-data. In addition, hyper-parameter tuning was explored. Besides the simplicity of the methodology, our approach achieves competitive results. We investigated the use of different visual features. We assessed the performance of memorability scores through various regression models where Random Forest regression is our final model, to predict the memorability of videos.

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper6.pdf YouTube: https://youtu.be/ySGGu_4vaxs Alba García Seco De Herrera, Rukiye Savran Kiziltepe, Jon Chamberlain, Mihai Gabriel Constantin, Claire-Hélène Demarty, Faiyaz Doctor, Bogdan Ionescu and Alan F. Smeaton : Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a Video Memorable? Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper describes the MediaEval 2020 Predicting Media Memorability task. After first being proposed at MediaEval 2018, the Predicting Media Memorability task is in its 3rd edition this year, as the prediction of short-term and long-term video memorability (VM) remains a challenging task. In 2020, the format remained the same as in previous editions. This year the videos are a subset of the TRECVid 2019 Video to Text dataset, containing more action rich video content as compare with the 2019 task. In this paper a description of some aspects of this task is provided, including its main characteristics, a description of the collection, the ground truth dataset, evaluation metrics and the requirements for the run submission. Presented by: Rukiye Savran Kiziltepe

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper45.pdf Benoit Bonnet, Teddy Furon and Patrick Bas : Fooling an Automatic Image Quality Estimator. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper we present our work on the 2020 MediaEval task: Pixel "Privacy: Quality Camouflage for Social Images". Blind Image Quality Assessment (BIQA) is a classifier that for any given image will return a quality score. Our task is to modify an image to decrease its BIQA score while maintaining a good perceived quality. Since BIQA is a deep neural network, we worked on an adversarial attack approach of the problem.

Fooling an Automatic Image Quality Estimator

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper16.pdf YouTube: https://youtu.be/ix_b9K7j72w Zhengyu Zhao : Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable Color Filter. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper presents the submission of our RU-DS team to the Pixel Privacy Task 2020. We propose to fool the blind image quality assessment model by transforming images based on optimizing a human-understandable color filter. In contrast to the common work that relies on small, $L_p$-bounded additive pixel perturbations, our approach yields large yet smooth perturbations. Experimental results demonstrate that in the specific context of this task, our approach is able to achieve strong adversarial effects, but has to sacrifice the image appeal. Presented by: Zhengyu Zhao

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper77.pdf YouTube: https://youtu.be/8Rr4KknGSac Zhuoran Liu, Zhengyu Zhao, Martha Larson and Laurent Amsaleg : Pixel Privacy: Quality Camouflage for Social Images. Proc. of MediaEval 2020, 14-15 December 2020, Online. High-quality social images shared online can be misappropriated for unauthorized goals, where the quality filtering step is commonly carried out by automatic Blind Image Quality Assessment (BIQA) algorithms. Pixel Privacy benchmarks privacy-protective approaches that protect privacy-sensitive images against unethical computer vision algorithms. In the 2020 task, participants are encouraged to develop camouflage methods that can effectively decrease the BIQA quality score of high-quality images and maintain image appeal. The camouflaged images need to be either imperceptible to the human eye, or it can be a visible enhancement. Presented by: Zhuoran Liu

Pixel Privacy: Quality Camouflage for Social Images

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper73.pdf YouTube: https://youtu.be/TadJ6y7xZeA Thuc Nguyen-Quang, Tuan-Duy Nguyen, Thang-Long Nguyen-Ho, Anh-Kiet Duong, Xuan-Nhat Hoang, Vinh-Thuyen Nguyen-Truong, Hai-Dang Nguyen and Minh-Triet Tran : HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching. Proc. of MediaEval 2020, 14-15 December 2020, Online. Matching text and images based on their semantics has an important role in cross-media retrieval. However, text and images in articles have a complex connection. In the context of MediaEval 2020 Challenge, we propose three multi-modal methods for mapping text and images of news articles to the shared space in order to perform efficient cross-retrieval. Our methods show systemic improvement and validate our hypotheses, while the best-performed method reaches a recall@100 score of 0.2064. Presented by: Thuc Nguyen-Quang

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper72.pdf Sabarinathan D and Suganya Ramamoorthy : Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attention Unit. Proc. of MediaEval 2020, 14-15 December 2020, Online. Colorectal cancer is the third most common cause of cancer worldwide. In the era of medical Industry, identifying colorectal cancer in its early stages has been a challenging problem. Inspired by these issues, the main objective of this paper is to develop a Multi supervision net algorithm for segmenting polys on a comprehensive dataset. The risk of colorectal cancer could be reduced by early diagnosis of poly during a colonoscopy. The disease and their symptoms are highly varying and always a need for a continuous update of knowledge for the doctors and medical analyst. The diseases fall into different categories and a small variation of symptoms may lead to higher rate of risk. We have taken Medico polyp challenge dataset, which consists of 1000 segmented polyp images from gastrointestinal track. We proposed an efficient Net B4 as a pre-trained architecture in multi-supervision net. The model is trained with multiple output layers. We present quantitative results on colorectal dataset to evaluate the performance and achieved good results in all the performance metrics. The experimental results proved that the proposed model is robust and provides a good level of accuracy in segmenting polyps on a comprehensive dataset for different metrics such as Dice coefficient, Recall, Precision and F2.

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper47.pdf YouTube: https://youtu.be/vMsM4zg2-JY Tien-Phat Nguyen, Tan-Cong Nguyen, Gia-Han Diep, Minh-Quan Le, Hoang-Phuc Nguyen-Dinh, Hai-Dang Nguyen and Minh-Triet Tran : HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ for Polyps Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. The Medico task, MediaEval 2020, explores the challenge of building accurate and high-performance algorithms to detect all types of polyps in endoscopic images. We proposed different approaches leveraging the advantages of either ResUnet++ or PraNet model to efficiently segment polyps in colonoscopy images, with modifications on the network structure, parameters, and training strategies to tackle various observed characteristics of the given dataset. Our methods outperform the other teams' methods, for both accuracy and efficiency. After the evaluation, we are at top 2 for task 1 (with Jaccard index of 0.777, best Precision and Accuracy scores) and top 1 for task 2 (with 67.52 FPS and Jaccard index of 0.658).

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper31.pdf Syed Muhammad Faraz Ali, Muhammad Taha Khan, Syed Unaiz Haider, Talha Ahmed, Zeshan Khan and Muhammad Atif Tahir : Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Intestinal Tract. Proc. of MediaEval 2020, 14-15 December 2020, Online. Identification of polyps in endoscopic images is critical for the diagnosis of colon cancer. Finding the exact shape and size of polyps requires the segmentation of endoscopic images. This research explores the advantage of using depth-wise separable convolution in the atrous convolution of the ResUNet++ architecture. Deep atrous spatial pyramid pooling was also implemented on the ResUNet++ architecture. The results show that architecture with separable convolution has a smaller size and fewer GFLOPs without degrading the performance too much.

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper22.pdf Debapriya Banik and Debotosh Bhattacharjee : Deep Conditional Adversarial learning for polyp Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. This approach has addressed the Medico automatic polyp segmentation challenge which is a part of Mediaeval 2020. We have proposed a deep conditional adversarial learning based network for the automatic polyp segmentation task. The network comprises of two interdependent models namely a generator and a discriminator. The generator network is a FCN employed for the prediction of the polyp mask while the discriminator enforces the segmentation to be as similar as the real segmented mask (ground truth). Our proposed model achieved a comparative result on the test dataset provided by the organizers of the challenge.

Deep Conditional Adversarial learning for polyp Segmentation

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper21.pdf Hwang Maxwell, Wu Cai, Hwang Kao-Shing, Xu Yong Si and Wu Chien-Hsing : A Temporal-Spatial Attention Model for Medical Image Detection. Proc. of MediaEval 2020, 14-15 December 2020, Online. A local region model with attentive temporal-spatial pathways is proposed for automatically learning various target structures. The attentive spatial pathway highlights the salient region to generate bounding boxes and ignores irrelevant regions in an input image. The proposed attention mechanism allows efficient object localization and the overall predictive performance is increased because there are fewer false positives for the object detection task for medical images with manual annotations. The experimental results show that proposed models consistently increase the base architectures' predictive performance for different datasets and training sizes without undue computational efficiency.

A Temporal-Spatial Attention Model for Medical Image Detection

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper20.pdf YouTube: https://youtu.be/CVelQl5Luf0 Quoc-Huy Trinh, Minh-Van Nguyen, Thiet-Gia Huynh and Minh-Triet Tran : HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Network and UNet for Polyps Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. The Medico: Multimedia Task focuses on developing an efficient and accurate framework to computer-aided diagnosis systems for automatic polyp segmentation to detect all types of polyps in endoscopic images of the gastrointestinal (GI) tract. We are HCMUS-team approach a solution, which includes combination Residual module, Inception module, Adaptive Convolutional neural network with Unet model and PraNet to semantic segmentation all types of polyps in endoscopic images. We submit multiple runs with different architecture and parameters in our model. Our methods show potential results in accuracy and efficiency through multiple experiments.

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper15.pdf Rabindra Khadka : Transfer of Knowledge: Fine-tuning for Polyp Segmentation with Attention. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper describes how the transfer of prior knowledge can effectively take on segmentation tasks with the help of attention mechanisms. The UNet model pretrained on brain MRI dataset was fine-tuned with the polyp dataset. Attention mechanism was integrated to focus on relevant regions in the input images. The implemented architecture is evaluated on 200 validation images based on intersection over union and dice score between groundtruth and predicted region. The model demonstrates a promising result with computational efciency.

Fine-tuning for Polyp Segmentation with Attention

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper12.pdf Adrian Krenzer and Frank Puppe : Bigger Networks are not Always Better: Deep Convolutional Neural Networks for Automated Polyp Segmentation. Proc. of MediaEval 2020, 14-15 December 2020, Online. This paper presents our team's (AI-JMU) approach to the Medico automated polyp segmentation challenge. We consider deep convolutional neural networks to be well suited for this task. To determine the best architecture we test and compare state of the art backbones and two different heads. Finally we achieve a Jaccard index of 73.74\% on the challenge test set. We further demonstrate that bigger networks do not always perform better. However the growing network size always increases the computational complexity.

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper51.pdf Amel Ksibi, Amina Salhi, Ala Alluhaidan and Sahar A. El-Rahman : Insights for wellbeing: Predicting Personal Air Quality Index using Regression Approach. Proc. of MediaEval 2020, 14-15 December 2020, Online. Providing air pollution information to individuals enables them to understand the air quality of their living environments. Thus, the association between people’s wellbeing and the properties of the surrounding environment is an essential area of investigation. This paper proposes Air Quality Prediction through harvesting public/open data and leveraging them to get the Personal Air Quality index. These are usually incomplete. To cope with the problem of missing data, we applied the KNN imputation method. To predict Personal Air Quality Index, we apply a voting regression approach based on three base regressors which are Gradient Boosting regressor, Random Forest regressor, and linear regressor. Evaluating the experimental results using the RMSE metric, we got an average score of 35.39 for Walker and 51.16 for Car.

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

multimediaeval

Paper: http://ceur-ws.org/Vol-2882/paper40.pdf YouTube: https://youtu.be/SL5Hvu1mARY Trung-Quan Nguyen, Dang-Hieu Nguyen and Loc Tai Tan Nguyen : Use Visual Features From Surrounding Scenes to Improve Personal Air Quality Data Prediction Performance. Proc. of MediaEval 2020, 14-15 December 2020, Online. In this paper, we propose a method to predict the personal air quality index in an area by using the combination of the levels of the following pollutants: PM2.5, NO2, and O3, measured from the nearby weather stations of that area, and the photos of surrounding scenes taken at that area. Our approach uses the Inverse Distance Weighted (IDW) technique to estimate the missing air pollutant levels and then use regression to integrate visual features from taken photos to optimize the predicted values. After that, we can use those values to calculate the Air Quality Index (AQI). The results show that the proposed method may not improve the performance of the prediction in some cases.

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

multimediaeval

Mehr von multimediaeval (20)

Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...

HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...

Sports Video Classification: Classification of Strokes in Table Tennis for Me...

Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...

Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task

Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...

Fooling an Automatic Image Quality Estimator

Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...

Pixel Privacy: Quality Camouflage for Social Images

HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching

Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...

HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...

Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...

Deep Conditional Adversarial learning for polyp Segmentation

A Temporal-Spatial Attention Model for Medical Image Detection

HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...

Fine-tuning for Polyp Segmentation with Attention

Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...

Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...

Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

Kürzlich hochgeladen

Just Call Vip call girls Srinagar Jammu Kashmir Escorts ☎️ 8617697112 Starting From 5K to 15K High Profile Escorts In Srinagar Jammu Kashmir ❤Personal Whatsapp Number Jammu Kashmir Call Girls 8617697112 💦✅. There are a number of Srinagar Jammu Kashmir Escorts willing to meet you at an affordable rate, which also possesses high moral standards and humanitarian tendencies. These girls can help satisfy the sexual desires of clients without fail; it is therefore essential that clients select an established service. Our services feature various packages at competitive rates: One shot: ₹2000/in-call, ₹5000/out-call Two shots with one girl: ₹3500/in-call, ₹6000/out-call Body to body massage with sex: ₹3000/in-call Full night for one person: ₹7000/in-call, ₹10000/out-call

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.

Nitya salvi

Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency

Sheetal Arora

Wepresent Atacama Large Millimeter/submillimeter Array 12-m, 7-m, and Total Power Array observations of the FUOrionis outbursting system, covering spatial scales ranging from 160 to 25,000 au. The high-resolution interferometric data reveal an elongated 12CO(2–1) feature previously observed at lower resolution in 12CO(3–2). Kinematic modeling indicates that this feature can be interpreted as an accretion streamer feeding the binary system. The mass infall rate provided by the streamer is significantly lower than the typical stellar accretion rates (even in quiescent states), suggesting that this streamer alone is not massive enough to sustain the enhanced accretion rates characteristic of the outbursting class prototype. The observed streamer may not be directly linked to the current outburst, but rather a remnant of a previous, more massive streamer that may have contributed enough to the disk mass to render it unstable and trigger the FU Orionis outburst. The new data detect, for the first time, a vast, slow-moving carbon monoxide molecular outflow emerging from this object. To accurately assess the outflow properties (mass, momentum, and kinetic energy), we employ 13CO(2–1) data to correct for optical depth effects. The analysis indicates that the outflow corresponds to swept-up material not associated with the current outburst, similar to the slow molecular outflows observed around other FUor and Class I protostellar objects.

Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...

Sérgio Sacani

Pulmonary drug delivery system M.pharm -2nd sem P'ceutics

sakshisoni2385

GBSN - Microbiology (Unit 1)

Areesha Ahmad

Botany is the branch of biology that deals with the scientific study of plants, including their structure, growth, reproduction, metabolism, development, and classification. It encompasses a wide range of topics, from the molecular biology of plant cells to the ecological relationships between plants and their environments. Botany is essential for understanding plant diversity, ecology, evolution, and the roles plants play in ecosystems and human societies.

Botany 4th semester series (krishna).pdf

Sumit Kumar yadav

GBSN - Biochemistry (Unit 1)

Areesha Ahmad

We explore different scenarios to explain the chemical difference found in the remarkable giant-giant binary system HD138202+CD−3012303. For the first time, we suggest how to distinguish these scenarios by taking advantage of the extensive convective envelopes of giant stars. Methods. We carried out a high-precision determination of stellar parameters and abundances by applying a full line-by-line differential analysis on GHOST high-resolution spectra. We used the FUNDPAR program with ATLAS12 model atmospheres and specific opacities calculated for an arbitrary composition through a doubly iterated method. Physical parameters were estimated with the isochrones package and evolutionary tracks were calculated via MIST models. Results. We found a significant chemical difference between the two stars (∆[Fe/H]∼0.08dex), which is largely unexpected considering the insensitivity of giant stars to planetary ingestion and diffusion effects. We tested the possibility of engulfment events by using several different combinations of stellar mass, ingested mass, metallicity of the engulfed object and different convective envelopes. However, the planetary ingestion scenario does not seem to explain the observed differences. For the first time, we distinguished the source of chemical differences using a giant-giant binary system. By ruling out other possible scenarios such as planet formation and evolutionary effects between the two stars, we suggest that primordial inhomogeneities might explain the observed differences. This remarkable result implies that the metallicity differences that were observed in at least some main-sequence binary systems might be related to primordial inhomogeneities rather than engulfment events. We also discuss the important implications of finding primordial inhomogeneities, which affect chemical tagging and other fields such as planet formation. We strongly encourage the use of giantgiant pairs. They are a relevant complement to main-sequence pairs for determining the origin of the observed chemical differences in multiple systems.

Disentangling the origin of chemical differences using GHOST

Sérgio Sacani

Forensic Biology & Its biological significance.pdf

rohankumarsinghrore1

RecoveringancientrecordsofEarth'smagneticfieldisessentialfordeterminingtheroleofthemagnetosphereinprotectingearlyEarthfromcosmicradiationandatmosphericescape.WepresentpaleomagneticfieldtestshintingthatarecordofEarth's3.7‐billion‐year(Ga)oldmagneticfieldmaybepreservedinthenortheasternIsuaSupracrustalBeltasachemicalremanentmagnetizationacquiredduringamphibolite‐grademetamorphisminthebandedironformation.MultiplepetrologicalandgeochronologicallinesofevidenceindicatethatthenorthernmostpartofIsuahasnotexperiencedmetamorphictemperaturesexceeding380°CsincetheEoarchean,suggestingtherockshavenotbeensignificantlyheatedsincemagnetizationwasacquired.Weuse“pseudo”bakedcontacttests(intrusionsemplaced3.26–3.5Gaago)andafoldtest(folding3.6Gaago)todemonstratethatsomesamplespreserveaca.3.7Garecordofthemagneticfield.Werecoverafieldstrengthof>15μT.ThissuggeststhatEarth'smagneticfieldmayhavebeenweakenoughtoenhanceatmosphericescapeduringtheArchean

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...

Sérgio Sacani

Context. Determining the size distribution of asteroids is key to understanding the collisional history and evolution of the inner Solar System. Aims. We aim to improve our knowledge of the size distribution of small asteroids in the main belt by determining the parallaxes of newly detected asteroids in the Hubble Space Telescope (HST) archive and subsequently their absolute magnitudes and sizes. Methods. Asteroids appear as curved trails in HST images because of the parallax induced by the fast orbital motion of the spacecraft. Taking into account the trajectory of this latter, the parallax effect can be computed to obtain the distance to the asteroids by fitting simulated trajectories to the observed trails. Using distance, we can obtain the absolute magnitude of an object and an estimation of its size assuming an albedo value, along with some boundaries for its orbital parameters. Results. In this work, we analyse a set of 632 serendipitously imaged asteroids found in the ESA HST archive. Images were captured with the ACS/WFC and WFC3/UVIS instruments. A machine learning algorithm (trained with the results of a citizen science project) was used to detect objects in these images as part of a previous study. Our raw data consist of 1031 asteroid trails from unknown objects, not matching any entries in the Minor Planet Center (MPC) database using their coordinates and imaging time. We also found 670 trails from known objects (objects featuring matching entries in the MPC). After an accuracy assessment and filtering process, our analysed HST asteroid set consists of 454 unknown objects and 178 known objects. We obtain a sample dominated by potential main belt objects featuring absolute magnitudes (H) mostly between 15 and 22 mag. The absolute magnitude cumulative distribution logN(H > H0) ∝ αlog(H0) confirms the previously reported slope change for 15 < H < 18, from α ≈ 0.56 to α ≈ 0.26, maintained in our case down to absolute magnitudes of around H ≈ 20, and therefore expanding the previous result by approximately two magnitudes. Conclusions. HST archival observations can be used as an asteroid survey because the telescope pointings are statistically randomly oriented in the sky and cover long periods of time. They allow us to expand the current best samples of astronomical objects at no extra cost in regard to telescope time.

Hubble Asteroid Hunter III. Physical properties of newly found asteroids

Sérgio Sacani

Bollworms are among the most damaging pests in cotton cultivation, affecting the bolls where the cotton fibers are formed. There are several species of bollworms, each capable of causing significant yield loss and quality degradation if not effectively managed. Here’s a detailed look at the primary bollworm species affecting cotton: Cotton Bollworm (Helicoverpa armigera): Also known as the corn earworm or the Old World bollworm, this pest is found in many regions around the world. It is highly polyphagous (feeds on many different plants) and poses a threat not only to cotton but also to maize, tomatoes, and legumes. The larvae bore into the cotton bolls, feeding on the developing seeds and fibers, which can lead to boll rot. Pink Bollworm (Pectinophora gossypiella): A significant pest of cotton, the pink bollworm larvae infest the cotton bolls, feeding on the seeds and lint. This can severely damage or destroy the bolls. In regions where pink bollworms are prevalent, they have been a major driver for the adoption of genetically engineered Bt cotton, which expresses a bacterium gene toxic to certain insects. Tobacco Budworm (Heliothis virescens): Closely related to the cotton bollworm, the tobacco budworm primarily attacks tobacco but is also a common pest in cotton. It primarily damages the flowers and bolls of the cotton plant. Differentiating between the tobacco budworm and the cotton bollworm based on appearance can be challenging, but it is crucial for effective management. American Bollworm (Helicoverpa zea): Known in some regions as the corn earworm, it is similar in behavior to Helicoverpa armigera and poses a threat to a variety of crops, including cotton. The larvae attack the cotton bolls, leading to direct damage to the cotton lint and seeds. Management Strategies: Cultural Controls: Crop rotation, destruction of crop residues, and deep plowing can help break the pest’s life cycle. Timing of planting can also be adjusted to avoid peak pest infestation. Biological Controls: Natural enemies like Trichogramma wasps, which parasitize bollworm eggs, and predators such as lacewings and ladybugs can be encouraged. Bacillus thuringiensis (Bt) products can also be sprayed, which are particularly effective against young larvae. Chemical Controls: Insecticides may be required when infestation levels exceed economic thresholds. However, resistance management must be considered, alternating modes of action to avoid developing resistance. Genetic Approaches: Bt cotton, genetically modified to express Bacillus thuringiensis toxin, has been highly effective in controlling bollworms and has dramatically reduced the reliance on chemical insecticides. Monitoring and Scouting: Regular field scouting and using pheromone traps to monitor adult populations can help in timely and targeted application of control measures. The effective management of bollworms often requires an integrated approach

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf

PirithiRaju

Biological Classification BioHack (3).pdf

muntazimhurra

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...

Lokesh Kothari

COST ESTIMATION FOR A RESEARCH PROJECT.pptx

FarihaAbdulRasheed

Recombination DNA Technology (Nucleic Acid Hybridization )

aarthirajkumar25

CELL -Structural and Functional unit of life.pdf

Nistarini College, Purulia (W.B) India

Context. WASP-76 b has been a recurrent subject of study since the detection of a signature in high-resolution transit spectroscopy data indicating an asymmetry between the two limbs of the planet. The existence of this asymmetric signature has been confirmed by multiple studies, but its physical origin is still under debate. In addition, it contrasts with the absence of asymmetry reported in the infrared (IR) phase curve. Aims. We provide a more comprehensive dataset of WASP-76 b with the goal of drawing a complete view of the physical processes at work in this atmosphere. In particular, we attempt to reconcile visible high-resolution transit spectroscopy data and IR broadband phase curves. Methods. We gathered 3 phase curves, 20 occultations, and 6 transits for WASP-76 b in the visible with the CHEOPS space telescope. We also report the analysis of three unpublished sectors observed by the TESS space telescope (also in the visible), which represents 34 phase curves. Results. WASP-76 b displays an occultation of 260±11 and 152±10 ppm in TESS and CHEOPS bandpasses respectively. Depending on the composition assumed for the atmosphere and the data reduction used for the IR data, we derived geometric albedo estimates that range from 0.05 ± 0.023 to 0.146 ± 0.013 and from <0.13 to 0.189 ± 0.017 in the CHEOPS and TESS bandpasses, respectively. As expected from the IR phase curves, a low-order model of the phase curves does not yield any detectable asymmetry in the visible either. However, an empirical model allowing for sharper phase curve variations offers a hint of a flux excess before the occultation, with an amplitude of ∼40 ppm, an orbital offset of ∼−30◦ , and a width of ∼20◦ . We also constrained the orbital eccentricity of WASP-76 b to a value lower than 0.0067, with a 99.7% confidence level. This result contradicts earlier proposed scenarios aimed at explaining the asymmetry observed in high-resolution transit spectroscopy. Conclusions. In light of these findings, we hypothesise that WASP-76 b could have night-side clouds that extend predominantly towards its eastern limb. At this limb, the clouds would be associated with spherical droplets or spherically shaped aerosols of an unknown species, which would be responsible for a glory effect in the visible phase curves.

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b

Sérgio Sacani

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...

ssifa0344

Biopesticide (2).pptx .This slides helps to know the different types of biop...

RohitNehra6

Kürzlich hochgeladen (20)

❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.

Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency

Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...

Pulmonary drug delivery system M.pharm -2nd sem P'ceutics

GBSN - Microbiology (Unit 1)

Botany 4th semester series (krishna).pdf

GBSN - Biochemistry (Unit 1)

Disentangling the origin of chemical differences using GHOST

Forensic Biology & Its biological significance.pdf

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...

Hubble Asteroid Hunter III. Physical properties of newly found asteroids

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf

Biological Classification BioHack (3).pdf

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...

COST ESTIMATION FOR A RESEARCH PROJECT.pptx

Recombination DNA Technology (Nucleic Acid Hybridization )

CELL -Structural and Functional unit of life.pdf

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b

TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...

Biopesticide (2).pptx .This slides helps to know the different types of biop...

MediaEval 2016 - BUT Zero-Cost Speech Recognition

1. ﬁt speech fit BUT Zero-Cost 2016 Speech Recognition Miroslav Skácel, Martin Karaﬁát, Lucas Ondel, Albert Uchytil, Igor Szöke BUT Speech@FIT Brno University of Technology Czech Republic MediaEval 2016 Workshop, October 20-21, 2016, Hilversum, Netherlands

2. Overview Our ideas for this task were: • to use previous knowledge of low-resource languages • to use existing systems from Babel1 • to adapt that on Vietnamese • to follow zero-cost requirements We ended up with: • lots of data processing and cleaning • 2x LVCSR sub-task systems • 1x subword sub-task systems • suggestions for further improvements 1https://www.iarpa.gov/index.php/research-programs/babel 1

3. Data preparation • audio downsampled from 16kHz to 8kHz • got Vietnamese alphabet from wordlist • cleaned other characters (punctuation marks, brackets, etc.) • numerals expanded to textual form 2

4. Data segmentation • audio longer than 1 minute caused troubles • alignment from ﬁrst training stages used • data split when: • silence longer than 0.5s occurs • segment would be longer than 15s 3

5. Language models 3 LMs were created: 1) cleaned transcriptions from train set 2) cleaned subtitles 3) cleaned text from websites • 3 or more words • we combined them together 4

6. LVCSR systems - GMM/DNN2 2Martin Karaﬁát et al. BUT Neural Network Features for Spontaneous Vietnamese in BABEL. In Proceedings of ICASSP 2014, pages 5659–5663. IEEE Signal Processing Society, 2014. 5

7. LVCSR systems - BLSTM3 • 16kHz audio, original transcriptions • Kaldi/TNet toolkits 3Martin Karaﬁát et al. Multilingual BLSTM and Speaker-Speciﬁc Vector Adaptation in 2016 BUT Babel System. Accepted at SLT 2016, 2016. 6

8. Subword system • Acoustic Unit Discovery4 (AUD) model • unlabeled data - no transcription needed • like phone-loop model, fully Bayesian, Dirichlet distribution • no ﬁxed number of components like in HMM • can learn complexity of model 4Lucas Ondel et al. Variational Inference for Acoustic Unit Discovery. In Procedia Computer Science, volume 2016, pages 80–86. Elsevier Science, 2016. 7

9. Results System Devel [WER] Test [WER] all (ELSA / Forvo / RhinoSpike) all (ELSA / Forvo / RhinoSpike / YouTube) Kaldi Baseline 60.4 (45.3 / 99.8 / 73.5) 75.5 (46.7 / 98.1 / 76.2 / 97.1) P-BUT - Babel Kaldi BLSTM 16kHz 17.9 (6.4 / 58.1 / 15.8) 48.0 (4.9 / 55.7 / 35.4 / 87.2) L-BUT - Babel Kaldi BLSTM 16kHz - LM tune 17.6 (6.2 / 56.4 / 16.9) 46.3 (4.6 / 52.6 / 32.2 / 84.7) L-BUT - Babel GMM/DNN 8kHz 36.1 (29.7 / 68.5 / 23.4) 55.7 (28.0 / 59.3 / 44.9 / 81.4) System Devel [NMI] Test [NMI] all (ELSA / Forvo / RhinoSpike) all (ELSA / Forvo / RhinoSpike / YouTube) Kaldi Baseline 5.48 (7.88 / 10.44 / 13.8) 4.29 (7.49 / 11.25 / 15.99 / 6.35) P-BUT AUD phone-loop 5.08 (6.45 / 8.76 / 14.19) 4.56 (5.52 / 9.59 / 18.49 / 7.59) • BLSTM overperformed GMM/DNN system • exception is unseen data (YouTube test set) 8

10. Further work Ideas for future experiments: • ﬁnd better way to clean original transcriptions • methods to detect inappropriate audio/transcriptions • adding noise to audio for more robust system • audio reverberation using RIR 9

11. Thanks for your attention! 9

MediaEval 2016 - BUT Zero-Cost Speech Recognition

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (16)

Ähnlich wie MediaEval 2016 - BUT Zero-Cost Speech Recognition

Ähnlich wie MediaEval 2016 - BUT Zero-Cost Speech Recognition (20)

Mehr von multimediaeval

Mehr von multimediaeval (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

MediaEval 2016 - BUT Zero-Cost Speech Recognition