|
اُردو / English |
[ Courses Offered ]
[ CLE Post-Graduate Certificates ]
|
|
[ 2015 ]
[ 2016 ]
[ 2017 ]
[ 2019 ]
[ 2024 ]
|
|
CLE Post-Graduate Certificate in Text to Speech Synthesis & Speech Corpus Development |
|
Date: |
15th - 16th November & 22nd - 23rd, November, 2024 |
|
Venue: |
Class room #1, Main Auditorium, University of Engineering & Technology (UET), Lahore |
Registration Deadline: |
28th October, 2024 |
|
|
|
|
|
| |
|
Introduction |
Center for Language Engineering (CLE), Al-Khawarizmi Institute of Computer Science (KICS), University of Engineering and Technology (UET), in collaboration with Society for Natural Language Processing (SNLP), is conducting CLE Post-Graduate Certificate in “Text to Speech Synthesis & Speech Corpus Development”. This program is specifically designed for faculty and researchers from disciplines including Computer Science, Electrical Engineering (Language Processing), Artificial Intelligence, Data Science, Applied Linguistics and related areas and is organized on weekends to encourage industry and academia participation.
|
Coverage
TTS Synthesis:
- Hands-On Development of Text-to-Speech System
- Training of Tacotron 2 Synthesizer using Speech Corpus
- Subjective Evaluation of the Synthesized Speech
Speech Corpus Development:
- Design of Speech Corpus
- Review of Vocalic and Consonantal sounds in Urdu\English
- Urdu Phonetic Inventory and Lexicon
- Temporal and Spectral Analysis of Urdu Audios using PRAAT
Target Audience
This training is targeted for faculty and researchers in the following areas:
- Computer Science (Speech Processing)
- Electrical and Computer Engineering (Speech Processing)
- Artificial Intelligence
- Data Science
- Applied Linguistics
Program
Day 1: Friday-15th November, 2024
Time Duration |
Task |
12:00 - 02:00 |
Registration |
02:00 - 03:00 |
Introduction to Text-to-Speech (TTS) system |
03:00 - 03:15 |
Design of TTS speech corpus |
03:15 - 03:30 |
Design and development of TTS text corpus |
03:30 - 04:00 |
Lab session on text corpus development |
04:00 - 04:15 |
Tea Break |
04:15 - 04:30 |
TTS development frameworks |
04:30 - 05:00 |
Mozilla TTS setup |
05:00 - 05:30 |
Lab session on TTS setup and dataset prepration |
Day 2: Saturday-16th November, 2024
Time Duration |
Task |
09:30 - 11:30 |
Audio signal processing (Fourier Transform & MFCCs) |
11:30 - 11:45 |
Tea Break |
11:45 - 12:15 |
Training a TTS model |
12:15 - 01:00 |
Lab session on TTS configuration and training |
01:00 - 02:00 |
Lunch Break |
02:00 - 02:30 |
Review of vocalic and consonantal sounds in Urdu\English |
02:30 - 03:00 |
Design and development of Lexicon & IPA to CISAMPA mapping |
03:00 - 04:00 |
Lab session on CISAMPA transcription |
04:00 - 04:15 |
Tea Break |
04:15 - 04:30 |
Speaker selection process for TTS Audio Recordings |
04:30 - 05:30 |
Introduction to PRAAT Temporal and spectral analysis of vocalic and consonantal sounds |
Day 3: Friday-22nd November, 2024
Time Duration |
Task |
02:00 - 02:15 |
Recap |
02:15 - 04:00 |
Lab session on temporal and spectral analysis of sounds using PRAAT |
04:00 - 04:15 |
Tea Break |
04:15 - 04:30 |
Audio recording process |
04:30 - 05:30 |
Lab session on speech corpus development using audiobooks |
Day 4: Saturday-23rd November, 2024
Time Duration |
Task |
09:30 - 10:00 |
TTS deployment and evaluation |
10:00 - 10:30 |
Lab session on deployment and testing of the trained voice |
10:30 - 10:45 |
Evaluation of the synthesized speech |
10:45 - 11:30 |
Lab session on subjective evaluation of synthesized speech |
11:30 - 11:45 |
Tea Break |
11:45 - 12:15 |
TTS for a language other than English |
12:15 - 12:45 |
Text analysis module implementation |
12:45 - 01:30 |
Lunch Break |
01:30 - 05:00 |
Open house and closing ceremony |
Registration Fees
Early registrations for the professional category are available.
Registration Type |
Early (1st-20th Oct, 2024) |
Regular (21st-28th Oct, 2024) |
Professional (Industry/Non-Faculty) |
PKR 75,000 |
PKR 100,000 |
Professional (Faculty) |
PKR 30,000 |
PKR 50,000 |
Student (PhD & MPhil) |
PKR 15,000 |
PKR 15,000 |
Registration Deadline:
28th October, 2024
- Participation will be on first come first served basis
- Faculty and students are requested to show their valid original university identity cards for registration
- On-site registration will depend on the availability of seats
- Registration fees is non-refundable and non-transferable
Payment Procedure:
Participants are requested to pay the registration fee in order to confirm their seat. Payments are to be made in any one of the following ways:
- Visit CLE, at University of Engineering and Technology, G.T. Road, Lahore, make payment to Ms. Kashaf Shahzad, Assistant Manager Accounts and Admin, CLE and get a payment receipt.
- Pay through JazzCash. Get your payment receipt from CLE, on the first day of the program.
- Book payment for Receiver Name: Muhammad Kamran Khan, CNIC: 35202-2789993-7, Phone No. 0300-4441100.
- Pay through Bank Deposit.
Banking Information is as follows:-
Account Title: KICS-CLE projects
Account No. 0063-100-000365-4
IBAN. PK74-ASCM-0000-6310-0000-3654
Currency: PKR
Bank Name: Askari Bank Limited
Branch: (0063), Baghbanpura Branch
Address: 9-A, Shalimar Link Road Baghbanpura, Lahore, Pakistan
Venue
Class room #1, Main Auditorium,
University of Engineering & Technology (UET), Lahore
Contact Us
For queries please contact:
CLE Post-Graduate Certificate Program
Center for Language Engineering
Al-Khawarizmi Institute of Computer Science
University of Engineering and Technology, G.T. Road,
Lahore, Pakistan
Tel: +92-42-99029450, +92-42-36821444
Email: pgcertificate@cle.org.pk
|
|
|
|