Skip to content

Interpreting Chatter

Facebook
Twitter
LinkedIn
Andrea DeMarco writes about the best decision of his life

In 2008, I ventured into the area of speech technology during the final year of my B.Sc. I.T. (Hons)(Melit.). I did not know what I was getting into. Through my four-year course, I studied Artificial Intelligence. However, I ventured into speech technology (as well as human language technology) because there seemed to be a renewed interest, and an increased reliance of speech and language technology on statistical modelling techniques. The particular area I studied was speaker identification, which is when you identify a person from a voice sample.

A little over a year later I enrolled for a Master of Science by research in Computer Science at the University of Malta in the same topic. The reasons were two-fold. Firstly, I liked the idea of research but was unsure about the long-term commitment required for a Ph.D. Secondly, my undergraduate research project spawned many ideas that I had no time to implement. However, this M.Sc. gave me time to do so. I was inspired by cognitive scientists on how language and phonetics could be processed in humans. I then loosely applied these ideas to algorithmic equivalents for speaker identification.

In this project I discovered salient fractions of phrases which are important for algorithms to identify speakers. I cut down the amount of data required for proper identification making it more efficient. A few months before completing my M.Sc. I started contacting a number of research labs in the UK. By that time I had realised that I loved solving research problems. That was when I felt ready for a longer-term commitment to research, and started a Ph.D. at the University of East Anglia, in the field of native accent and speaker identification.

During my Ph.D. I developed a state-of-the-art classifier for accent identification from speech. The classifier does not require any speech transcription, which is how accent identification usually works. I collaborated with researchers from the University of Birmingham to adapt baseline speech recognition to work better for regional accents. I am now exploring the combination of accent identification with speaker identification systems.

I am currently in the final year of my Ph.D. studies at the University of East Anglia, and employed as a senior researcher in algorithms that can identify emotions. We are developing a mobile app that tracks and keeps a diary of your mood using your voice. This project is funded by a Technology Strategy Board grant. Taking the leap from artificial intelligence into speech technology might have been the best decision of my life. 

Andrea DeMarco was awarded a STEPS scholarship for his Masters studies, and a UEA Vice-Chancellor’s scholarship for his Ph.D. studies.

Author

More to Explore

Reproducibility in Science – Why It Matters More Than Ever

Have you ever scrolled past a viral claim online and thought, ‘That sounds true’? Maybe it was a headline about a miracle cure or a diet hack. Or noticed how quickly new ideas spread online – sometimes before anyone has checked if they’re real? From scientific labs to TikTok feeds, we’re constantly asked to decide: What is true? And more often than not, the answer lies in one underappreciated scientific principle – reproducibility.

A Pocket Guide on Dumplings

Who doesn’t love a tight, little meat package? Before the prudes boo me off stage, I’m talking about dumplings. These delectable morsels are found all over the world. In fact, it could be argued that every country or region has their own type of dumpling. Because, when you get down to eat, few things can match that universal, carnal appeal of a mouthful of warm, juicy meat (okay, that was the last one, I promise). While this is by no means an exhaustive list, we’ve selected six scrumptious dumplings to whet your appetite!

Life Against Entropy

Most of us move through life with a quiet certainty that being alive is self-evident. We grow, think, love, worry, plan. We distinguish instinctively between what lives and what does not. A person is alive; a stone is not. A dog is alive; a machine is not. The line feels obvious, until someone asks us to explain it.

Comments are closed for this article!