Blog

Do You Really Know Who Is Calling?

How AI Voice Fraud Is Exploiting Our Trust in Familiar Voices

Introduction

For generations, we believed that recognising a familiar voice was enough.

A voice meant identity.

A voice meant trust.

Artificial intelligence has quietly changed that assumption, and organised criminals have moved faster than most people realise.

Today, a few seconds of publicly available audio can be enough to create a convincing digital copy of someone’s voice.

Combined with social engineering, that technology is creating a new generation of fraud where hearing is no longer believing.

Imagine this scene.

It is two o’clock in the morning.

A loud, aggressive phone ring wakes you from a deep sleep.

The screen lights up, showing the name of your son or daughter. Before you even answer, your stomach tightens.

You pick up the phone.

On the other end, you hear a panicked, crying voice.

“Mum, I’ve had a car accident… I’m at the hospital with a friend. I urgently need money to cover the costs; otherwise, they’ll lock me up. Please send it immediately to this account.”

The voice is identical.

The tone.

The pitch.

That familiar accent.

Even the way they swallow words when they are under stress.

Everything sounds real.

Your heart pounds.

Your hands shake.

Your finger hovers over the emergency transfer button in your banking app.

Of course you would pay.

Almost every parent would.

Now take a deep breath.

Put the phone down.

That call, which just took ten years off your life, most likely did not come from your child.

On the other end of the line, a stranger, possibly thousands of kilometres away, is using AI-powered voice cloning software. All they needed was a few seconds of your child’s voice and a computer.

Welcome to our new reality.

Technology has advanced to the point where hearing someone’s voice is no longer proof of identity.

Scammers knew this issue long before most of us did.

This is an epidemic that does not discriminate against its victims

After years investigating fraud and analysing criminal behaviour, nothing surprises me.

What does concern me, however, is the volume of intelligence now pointing to a rapidly growing wave of AI-enabled fraud.

Voice cloning has quickly become one of the most convincing forms of modern social engineering, combining artificial intelligence with timeless psychological manipulation.

The statistics reveal a concerning trend.

According to the FBI’s Internet Crime Complaint Centre (IC3), cyber-enabled fraud continues to cause billions of dollars in losses each year, with artificial intelligence increasingly appearing as an enabling tool.

Europol has also warned that generative AI is making impersonation scams more scalable, convincing and accessible to organised criminal groups.

Such fraud is no longer a problem; it affects only people who “don’t understand technology.”.

The victims are mothers and fathers.

Business owners.

Finance directors.

Professionals.

Retirees.

Even experienced investigators.

One of the greatest misconceptions about fraud is that intelligence alone protects us.

It does not.

Successful scams rarely exploit a lack of intelligence.

They exploit trust.

Artificial intelligence has not changed human psychology.

It has simply made deception faster, cheaper and far more persuasive.

The technology has become so sophisticated that most people can no longer reliably distinguish a genuine human voice from an AI-generated copy.

That should concern every one of us.

How Machines Stole Our Voice (And Why They Only Need Three Seconds)

You might be asking yourself the following:

“How did they even get my child’s voice?”

The answer is simple—and uncomfortable.

In many cases, we gave it to them ourselves.

Remember that holiday video you shared on Instagram? A voice message in a public WhatsApp or Telegram group? Your daughter’s TikTok video? Or perhaps a podcast interview?

Every recording contributes to your digital footprint.

Today, the most advanced voice-cloning models require only three to five seconds of clear audio to create a remarkably convincing digital copy.

Just a few years ago, when companies such as OpenAI demonstrated early voice-cloning technology, reproducing a voice required significantly longer recordings, and access was tightly controlled.

Today, equivalent open-source tools are readily available. The technical barrier has almost disappeared. Anyone with basic computer skills and access to the internet can produce convincing synthetic speech.

For organised crime groups, that is an extraordinary opportunity.

Scammers typically obtain voice samples in several ways:

Social media: Videos, stories and live streams published on TikTok, Instagram, Facebook, and YouTube.

Voice Messages: Audio recordings shared in public or poorly secured messaging groups.

Direct Collection: An unknown caller asks a simple question such as, “Can you hear me?” You answer, “Yes, I can.” Then, you might ask, “Who is this caller?” In those few seconds, the caller has captured enough audio to begin building a digital voice profile.

One lesson I have learned throughout years of investigations is that criminals rarely rely on a single piece of information.

They collect fragments.

A voice.

A name.

A phone number.

A social media profile.

Separately, each appears harmless.

Together, they create a convincing deception.

Real-Time Magic

Earlier versions of voice-cloning technology had obvious limitations.

The scammer typed text.

The software generated an audio recording.

The result often sounded robotic, with awkward pauses and unnatural speech patterns.

Those limitations have largely disappeared.

Modern systems operate in real time.

The criminal speaks into a microphone, while artificial intelligence simultaneously transforms that speech into the voice of your loved one. The delay is often less than 200 milliseconds—so small that it is virtually impossible to detect during a normal conversation.

Perhaps even more concerning is that these systems no longer reproduce only the sound of a voice.

They reproduce emotion.

Crying.

Breathlessness.

Fear.

Anger.

Panic.

That matters because successful scams are built on emotion rather than logic.

Behavioural research consistently shows that under stress; people make faster decisions and are less likely to critically evaluate information.

Criminals understand these principles exceptionally well. Their objective is not to simply convince you that you recognise the voice.

Their objective is to trigger an emotional response before your rational mind has time to question what is happening.

When you hear what sounds like your child crying in distress, your first instinct is to protect them, not to investigate.

It is to protect.

That instinct has not changed.

Artificial intelligence simply provides criminals a far more convincing way to exploit it.

How Scammers Weave the Web

After analysing numerous fraud investigations, I found that, although technology continues to evolve, the underlying methodology seldom changes.

Artificial intelligence is simply the latest tool.

The real weapon remains human psychology.

In hundreds of cases, four behavioural patterns occur repeatedly. Each targets a specific psychological vulnerability, encouraging victims to act emotionally rather than rationally.

Technology may have changed. Human nature has not.

Family Emergency

The call usually comes late at night.

A “child” or “grandchild” claims they have been involved in a car accident, have been arrested or are sitting in a hospital emergency department. They urgently need money for bail, medical treatments, or legal costs.

Every minute counts.

There is no time to think.

Only time to act.

Why it works

Late-night calls immediately create emotional shock. A parent’s instinct to protect overrides rational thinking, while urgency discourages verification.

Artificial intelligence makes the deception even more convincing by allowing criminals to reproduce the voice of someone you trust most.

CEO Scam

An employee in the finance department receives an urgent call from what appears to be the managing director or chief executive officer.

The familiar voice requests an immediate and confidential payment relating to an important commercial transaction.

The instruction is clear.

Transfer the money.

Tell no one.

One widely reported case in Hong Kong resulted in a company losing approximately US$25 million after employees believed they were acting on instructions from senior executives during an AI-enabled impersonation scam.

Why it works

The scam exploits authority, workplace hierarchy, and the fear of making expensive mistakes.

The instruction to keep the payment confidential removes the one safeguard most likely to expose the fraud—independent verification.

Bank Customer Service

Your phone displays what appears to be your bank’s official number.

The caller calmly explains that suspicious activity has been detected on your account and asks you to confirm security codes or verify your identity.

Everything sounds professional.

Everything appears legitimate.

Why it works

Modern scams frequently combine AI voice cloning with caller ID spoofing, creating multiple layers of credibility.

People are not simply trusting the voice.

They are trusting the institution.

Fake Police Inspector

A caller introduces themselves as a police investigator.

They explain that your bank account has allegedly been linked to money laundering or another criminal investigation.

To protect your funds, they ask you to verify personal information or transfer money while the investigation continues.

The conversation is formal.

Professional.

Authoritative.

Why it works

Authority has always been one of the strongest tools in social engineering.

Most people naturally cooperate with police during stressful situations.

Artificial intelligence enhances that trust by creating a genuine-sounding voice, thereby increasing the chances of victims complying without questioning the information they receive.

What Every Scam Has in Common

Although these scams appear different, they follow the same operational pattern.

Each relies on one or more predictable human responses:

Urgency prevents careful thinking.
Authority discourages questions.
Fear narrows attention.
Trust lowers scepticism.
Familiarity replaces verification.

This is why I describe artificial intelligence as an enabler rather than the problem itself.

Social engineering has existed for decades.

AI has simply made it faster, cheaper, and significantly more convincing.

The lesson is simple.

Technology has not changed our behaviour.

It changed the scale at which criminals can exploit it.

Five Golden Rules of Defence

Although artificial intelligence continues to evolve at remarkable speed, scammers still leave clues. The challenge is learning to recognise them before emotions take over.

The objective is not to become suspicious of everyone.

It is to develop the habit of verification.

Look for the Delay

Even the most advanced real-time voice-cloning systems require a fraction of a second to process speech.

In a genuine conversation, responses flow naturally.

If you notice a slight delay every time the caller responds—particularly when you interrupt them or suddenly change the subject—treat it as a warning sign.

A simple technique is to ask an unexpected question that only the real person would answer immediately.

For example:

“What did we have for lunch yesterday?”

“What is the name of our next-door neighbour?”

Do not focus only on the answer.

Pay attention to how naturally it comes.

Listen to the emotion.

People under genuine stress rarely speak with perfectly consistent emotion.

They hesitate.

Pause.

Breathe heavily.

They may lose their focus.

Artificial intelligence has become remarkably adept at reproducing emotion, but many cloned voices still maintain the same emotional intensity throughout the conversation.

If someone sounds equally panicked from the first second to the last, without natural variation, question what you are hearing.

Listen Beyond the Voice

A hospital is noisy.

So is a police station.

So is the scene of a serious traffic accident.

Listen carefully to the background.

Can you hear people talking?

Footsteps?

Medical equipment?

Passing traffic?

Artificial intelligence may successfully reproduce a voice, but recreating a believable environment is far more difficult.

Many scammers use looped background recordings that repeat without the listener realising it.

Sometimes the strongest clue is not the voice itself.

Everything is happening around it.

Create a Family Code Word

This is one of the simplest and most effective security measures any family can adopt.

Agree on a private word or phrase known only to close family members.

Do not use birthdays.

Pet names.

Avoid using anything that can be seen on social media.

Choose something completely unrelated that cannot be guessed or researched online.

If a family member ever calls asking for urgent financial help, ask one simple question:

“What’s our code word?”

If they hesitate or avoid answering, end the conversation immediately.

As one victim later reflected:
“I’d rather have my family laugh because I asked for the password than lose my savings to someone pretending to be my child.”

Hang Up and Call Them Yourself

This remains the single most effective defence against AI voice-cloning scams.

No matter how convincing the voice sounds…

No matter what name appears on your screen…

Hang up.

Next, please access your contacts and place the call to the individual directly.

Scammers can spoof incoming numbers.

They cannot usually intercept a genuine outbound call that you initiate independently.

If the caller insists,

“Don’t hang up.”

That is often the clearest indication that you should.

How to Secure Yourself for the Long Run

Responding to individual scams is no longer enough.

The better strategy is to reduce your exposure before criminals ever target you.

First Layer: Reduce Your Digital Voice Footprint

Completely disappearing from the internet is unrealistic.

Being more selective is not.

Review your social media privacy settings.

Limit who can view your videos.

Avoid posting unnecessary voice recordings publicly.

In large messaging groups, consider typing rather than sending voice messages.

Keep your conversations brief when you receive calls from strangers.

Every second of clear audio is valuable training material for modern voice-cloning systems.

Second Layer: Use Technology to Your Advantage

A phone call is often only the first stage of the scam.

The next step usually involves a fraudulent website or payment link.

Before clicking any link received during an unexpected conversation, verify it independently.

Use reputable spam filters, caller identification services and browser security tools.

Technology helped create this problem.

Technology can also play a role in solving this problem.

Third Layer: Educate the People Around You

Awareness is the strongest defence against social engineering.

Share this information with your parents.

Your grandparents.

Your children.

Your colleagues.

Older people remain frequent targets because they often place enormous trust in institutions and family relationships.

A short conversation today may prevent a devastating financial loss tomorrow.

Emergency Plan

If this article resonates with you and you realise that a scam has already targeted you, don’t blame yourself.

Modern AI-enabled fraud is designed to exploit human trust and emotion. Intelligent, experienced people become victims every day.

The important thing is to act quickly.

Please contact your bank immediately.

Inform your bank that you suspect a scam and ask them to freeze your account, as well as the recipient’s account if possible. Rapid reporting may enable the recovery of funds in certain situations.

Report the matter to police.

Provide every piece of information you have, including the telephone number, the time of the call, your bank account details, and any websites or links you were asked to visit.

Preserve the evidence.

Save screenshots, SMS messages, emails, call logs and payment confirmations. Every detail may assist investigators in tracing criminal activity.

Inform your family.

If scammers have cloned the voice of one family member, they may attempt to target others within the same network. A simple warning may prevent someone else from becoming the next victim.

What Awaits Us?

Artificial intelligence will continue to evolve.

Unfortunately, so will the criminals who misuse it.

Voice cloning is unlikely to be the final stage of AI-enabled fraud.

Investigators are already seeing increasingly sophisticated combinations of synthetic voices, manipulated videos, deepfake identities, and automated social engineering.

Soon, the scam may extend beyond a phone call.

You may receive a video call in which you both hear and see a family member apparently asking for urgent financial help.

Artificial intelligence also allows criminals to automate attacks on an unprecedented scale.

Using data obtained through previous breaches, bots can contact thousands of potential victims, tailoring conversations to everyone with remarkable accuracy.

That sounds confronting.

It should.

But fear is not the answer.

Awareness is.

Conclusion

Technology has fundamentally changed one of humanity’s oldest assumptions.

For generations, we believed that recognising a familiar voice meant identifying it.

That is no longer true.

Artificial intelligence has made it possible to imitate voices with exceptional accuracy, giving organised criminals a powerful new tool for deception.

Yet despite all the technological advances, the objective of every scam remains the same.

The goal is to prompt you to act impulsively.

The strongest defence is not better software.

It is better judgement.

Pause.

Verify.

Ask questions.

Call the person back using a trusted number.

These brief moments could protect your savings, your family, and your mental well-being.

There was a time when hearing a familiar voice was enough.

Today, trust is no longer something we hear.

It is something we verify.

This post was written by Mario Bekes

Corporate Investigations and
Risk Management

Blog

Do You Really Know Who Is Calling?

How AI Voice Fraud Is Exploiting Our Trust in Familiar Voices

Introduction

This is an epidemic that does not discriminate against its victims

How Machines Stole Our Voice (And Why They Only Need Three Seconds)

Real-Time Magic

How Scammers Weave the Web

Family Emergency

Why it works

CEO Scam

Why it works

Bank Customer Service

Why it works

Fake Police Inspector

Why it works

What Every Scam Has in Common

Five Golden Rules of Defence

How to Secure Yourself for the Long Run

First Layer: Reduce Your Digital Voice Footprint

Second Layer: Use Technology to Your Advantage

Third Layer: Educate the People Around You

Emergency Plan

Please contact your bank immediately.

Report the matter to police.

Preserve the evidence.

Inform your family.

What Awaits Us?

Conclusion

It is something we verify.

Recent Post

The 4 AM Rebellion

The Price of a Dress

The Digital Trap

Do You Really Know Who Is Calling?

The Digital Soul Catcher

Our Services