UNIVPM: Guida insegnamenti

[W001232] - COMPUTER VISION E DEEP LEARNINGCOMPUTER VISION AND DEEP LEARNING
Emanuele FRONTONI
Lingua di erogazione: ITALIANOLessons taught in: ITALIAN
Laurea Magistrale - [IM11] INGEGNERIA ELETTRONICA (Curriculum: Electronic Systems for Digital Audio Applications) Master Degree (2 years) - [IM11] ELECTRONICS ENGINEERING (Curriculum: Electronic Systems for Digital Audio Applications)
Dipartimento: [040040] Dipartimento Ingegneria dell'InformazioneDepartment: [040040] Dipartimento Ingegneria dell'Informazione
Anno di corsoDegree programme year : 1 - Secondo Semestre
Anno offertaAcademic year: 2019-2020
Anno regolamentoAnno regolamento: 2019-2020
Crediti: 9
Ore di lezioneTeaching hours: 72
TipologiaType: D - A scelta dello studente
Settore disciplinareAcademic discipline: ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI

LINGUA INSEGNAMENTO LANGUAGE

Italiano

Italian

PREREQUISITI PREREQUISITES

Fondamenti di Informatica

Fundamentals of Computer Science

MODALITÀ DI SVOLGIMENTO DEL CORSO DEVELOPMENT OF THE COURSE

Lezioni frontali sugli argomenti in programma, esercitazioni di Computer Vision e Deep Learning svolte in classe. Gli studenti dovranno possedere un computer personale sul quale eseguire gli esercizi pratici di programmazione si sistemi di elaborazione e interpretazione di immagini e video.
Il corso sarà suddiviso in lezioni (60%) ed esercitazioni (40%).

Lectures on topics scheduled, Computer Vision and Deep Learning exercises carried out in class. Students will have to own a personal computer to complete practical Computer Vision assignments.
Lectures will be divided in frontal lectures (60%) and workshops (40%).

RISULTATI DI APPRENDIMENTO ATTESI LEARNING OUTCOMES

Conoscenze e comprensione.

Acquisire basi teoriche, conoscenze metodologiche e tecnologiche nonché esperienze pratiche riguardanti l'area della Computer Vision con particolare riferimento all'acquisizione, l'elaborazione, l'analisi e la comprensione del contenuto di immagini e video. Conoscenze avanzate di metodi di Deep Learning per la Computer Vision, sia per l’estrazione di feature che per la loro classificazione.

Capacità di applicare conoscenze e comprensione.

Tramite queste conoscenze e numerosi esempi applicativi, lo studente verrà messo in grado di progettare un sistema di visione artificiale in vari ambiti applicativi: l'ispezione industriale, la sorveglianza, l'identificazione biometrica (impronte digitali, retiniche, immagini del viso, ecc.), l'analisi del movimento umano, l'analisi del territorio da immagini aeree o da satellite, la scansione 3D, la navigazione robotica.

Competenze trasversali.

L’esecuzione di un progetto su un argomento concordato con il docente e che verrà svolto in un gruppo di 3-4 studenti, contribuirà a migliorare sia il grado di autonomia di giudizio in generale, sia la capacità comunicativa che deriva anche dal lavoro in gruppo, sia la capacità di apprendimento in autonomia e di trarre conclusioni.

Knowledge and Understanding.

The course enables students to acquire the fundamental notions and advanced knowledge of Object-Oriented Programming, also through the study of a reference programming languages. This knowledge, by integrating the knowledge gained during the previous programming courses will form the insights that will enrich the understanding of the theory of languages and of the advantages of different programming paradigms. Advanced knowledge of Deep Learning methods for Computer Vision, both for the extraction of features and for their classification.

Capacity to apply Knowledge and Understanding.

At the end of the course students will be able to use properly the principles of object-oriented programming and the syntax of the language used in the course to develop software of medium complexity, characterized by flexibility, compliance with the requirements and efficiency, choosing the most appropriate data structures and algorithms for the particular problem at hand.

Transversal Skills.

The execution of a project, which will be played in groups or independently and that will lead to the drafting of a report, will help improve both the communication skills that also stems from teamwork, and autonomous learning and problem-solving skills.

PROGRAMMA PROGRAM

Introduzione alla Computer Vision

I seguenti argomenti saranno trattati attraverso lezioni teoriche (L) ed esercitazioni (E):

Pixels, Features e Camere
- Pixels e Filtri (L)
- Edge detection, RANSAC (L)
- Feature detectors, Harris (L)
- Difference of Gaussians, SIFT (L+E)
- Panorama Stitching (L+E)

Regioni di Immagini e Segmentazione
- Introduzione alla Segmentation (L)
- K-means Clustering (L)
- Feature tracking (L+E)
- Linear Classifiers & Classification (L+E)
- Visual Tracking (L+E)

Riconoscimento di facce e oggetti
- Introduzione al Riconoscimento, Nearest Neighbor Match (L+E)
- PCA e Eigenfaces (L+E)

Deep Learning per la Computer Vision
- Le reti Deep: Autoencoders, Convolutional Neural Networks, Recurrent and Recursive Networks, Practical Methodology, Performance Metrics and baseline models, Selecting hyper-parameters (L+E)
- Training di reti Deep: Objective functions, Activation Functions, Regularization, Gradient-based optimization (L+E)
- Cenni di reti generative (L+E)

Le esercitazioni saranno svolte per i vari argomenti attraverso esempi applicativi di elaborazioni di immagini, utilizzando le librerie OpenCV ed il linguaggio Python.

Introduction to Computer Vision

The following contents will be provided by frontal lectures (L) and workshops (W).

Pixels, Features, and Cameras
- Pixels and Filters (L)
- Edge detection, RANSAC (L)
- Feature detectors, Harris (L)
- Difference of Gaussians, SIFT (L+W)
- Panorama Stitching (L+W)

Regions of Images, and Segmentation
- Basic Concepts of Segmentation (L)
- K-means Clustering (L)
- Feature tracking (L+W)
- Linear Classifiers & Classification (L+W)
- Visual Tracking (L+W)

Recognizing Faces and Objects
- Basic Concepts in Recognition, Nearest Neighbor Match (L+W)
- PCA and Eigenfaces (L+W)
- Introduction to Deep Learning (L+W)

Deep Learning for Computer Vision
- Focus on Deep Networks: Autoencoders, Convolutional Neural Networks, Recurrent and Recursive Networks, Practical Methodology, Performance Metrics and baseline models, Selecting hyper-parameters (L+E)
- Training Deep Networks: Objective functions, Activation Functions, Regularization, Gradient-based optimization (L+E)
- Generative Networks: introduction (L+E)

Workshops are based on the practical use of Computer Vision libraries (i.e. OpenCV) and the Python programming language.

MODALITÀ DI SVOLGIMENTO DELL'ESAME DEVELOPMENT OF THE EXAMINATION

Modalità di valutazione dell'apprendimento.

La valutazione del livello di apprendimento degli studenti consiste in due prove:
- una prova pratica, consistente nella soluzione di esercizi di Computer Vision proposti su argomenti trattati nel corso, da completare in una o due ore;
- una prova orale, consistente in domande di approfondimento sugli esercizi della prova scritta o su qualsiasi altro argomento trattato nel corso.
La prova scritta è propedeutica alla prova orale, per accedere alla quale lo studente deve aver ottenuto almeno la sufficienza nella prova scritta.
La prova orale deve essere sostenuta nello stesso appello della prova scritta. Nel caso di esito negativo per la prova orale, lo studente deve ripetere anche la prova scritta.

Criteri di valutazione dell'apprendimento.

Per superare con esito positivo la valutazione dell'apprendimento, lo studente deve dimostrare, attraverso le prove prima descritte, di aver ben compreso i concetti esposti nel corso, possedendo quindi una conoscenza di base delle metodologie trattate nel corso, nonché avendo sviluppato una adeguata familiarità con metodi e librerie di Computer Vision al fine di completare in autonomia una prova pratica nell'ambito della visione.

Criteri di misurazione dell'apprendimento.

Ad ognuna delle prove prima indicate è assegnato un punteggio compreso tra zero e trenta. Il voto complessivo, in trentesimi, è dato dalla media dei voti ottenuti nelle due prove, con arrotondamento all'intero per eccesso.

Criteri di attribuzione del voto finale.

Perché l'esito complessivo della valutazione sia positivo, lo studente deve conseguire almeno la sufficienza, pari a diciotto punti, in ognuna delle prove prima descritte.
La valutazione massima è raggiunta dimostrando una conoscenza approfondita dei contenuti del corso nell'ambito delle prove.
La lode è riservata agli studenti che, avendo svolto tutte le prove in modo corretto e completo, abbiano dimostrato una particolare brillantezza nella esposizione orale e/o nello svolgimento di progetti software.

Learning Evaluation Methods.

The assessment of student learning consists of two parts:
- A written test, consisting in the solution of exercises on Computer Vision topics covered in the course, to be completed in an hour or two;
- An oral, consisting of in-depth questions on the written test exercises or any other topic covered in the course.
The written test is in preparation for the oral exam, access to which the student must have obtained at least a pass in the written test.
The oral examination must be supported in the same session of the written test. In case of failure of the oral exam, the student must also repeat the written test.

Learning Evaluation Criteria.

To successfully pass the assessment of learning, the student must demonstrate, through the tests described above, to possess a full understanding of the methods covered in the course, as well as to master methods and libraries of Computer Vision to solve autonomously a test on one of the field of application of the Computer Vision

Learning Measurement Criteria.

For each one of the tests specified before it is assigned a score between zero and thirty. The overall grade is the average of grades obtained in the two tests.

Final Mark Allocation Criteria.

In order for the overall outcome of the evaluation to be positive, the student must rate at least 18/30 points in each of the tests described above.
Praise is given to students who, having done all the tests correctly, have demonstrated a particular brilliance in argumentation and/or in the quality of code produced.

TESTI CONSIGLIATI RECOMMENDED READING

Digital Image Processing, Global Edition, 4/E
Rafael C. Gonzalez, University of Tennessee
Richard E. Woods, MedData Interactive
ISBN-10: 1292223049 • ISBN-13: 9781292223049
©2018 • Pearson • Paper, 1024 pp
Published 08 Nov 2017

https://learn.univpm.it

E-LEARNING E-LEARNING

https://learn.univpm.it

Scheda insegnamento erogato nell’A.A. 2019-2020
Le informazioni contenute nella presente scheda assumono carattere definitivo solo a partire dall'A.A. di effettiva erogazione dell'insegnamento. Academic year 2019-2020

Guida degli insegnamenti

Syllabus

Partially translatedTradotto parzialmente