✓ RPA Challenge 36 - Redacting Sensitive Data


Hello again and welcome back!

I would like to invite you to join to our challenges after summer! :sun_with_face:

Let’s keep practicing together and building better and stronger robots! :muscle:

This new challenge is associated with documents, using regex and a little document understanding.

We have this document (contract) General_Contracting.pdf (84.0 KB)

And the purpose

of today’s robot is help us to find a sensitive data and hide them
:black_circle: We take as a sensitive data: names, emails, bank account numbers, phone numbers, sums of money, …
:heavy_plus_sign: For those who want it more complex, take addresses and signatures as well

:question: HINTS:
– names: if there are two words following each other with each word is capital letter, this may be name
– dates: we expect to be things like dd.MM.yyyy or else dd/MM/yy etc …

How to participate?

Simply send me your code and don’t forget to support/challenge other participants a bit - you can post your processed documents do the comments section.

First three correct, original solvers earn 3 robopoints, next get 1 robopoint.
I will hold a leaderboard with your score and update it.

The challenge deadline is 25. 9. by the end of the day. You can still submit your projects after the deadline but you cannot earn robopoints. A new challenge is published every 2 weeks so don’t worry if you did not catch this one. Keep in mind, your submitted files may be published after the deadline so others can learn from solutions.

Looking forward to your solution :eyes:

Tick tock, tick tock :alarm_clock:

Time is over and our leaderboard is still the same.
Even though we didn’t have any enthusiasts for this challenge you have at least my quick solution and simple approach using RegEx.
BlackOutSensitiveData.zip (79.8 KB)

All used patterns you can find in Multiple assign activity. But a bit tricky it is with names - my pattern takes all two words with capital letters (this could be also an address or on the other hand it will not count with names like Jasmijn van de Weterink, etc…)

I helped myself with PDF Redactor - RPA Component

  • it means you need to at first add this package to your project UiPathTeam.PDFRedaction.Activities via Manage packages image

I also found a RegEx builder page as very helpful.

So this is all form me. I hope it helped you to learn something new :slight_smile:

…and see you next time! :wave:

1 Like