
32:13
yes we have started it

32:44
Thank you

46:40
What languages are supported by Document Understanding?

47:25
I guess it's mentioned in the taxonomy json file

50:26
Supported Languages :"SupportedLanguages": [{"Name": "Undetermined Language","Code": "und"},{"Name": "Bulgarian","Code": "bul"},{"Name": "Cantonese","Code": "yue"},{"Name": "Danish","Code": "dan"},{"Name": "Dutch","Code": "nld"},{"Name": "English","Code": "eng"},{"Name": "French","Code": "fra"},{"Name": "Georgian","Code": "kat"},{"Name": "German","Code": "deu"},{"Name": "Greek","Code": "ell"},{"Name": "Hebrew","Code": "heb"},{"Name": "Hindi","Code": "hin"},{"Name": "Italian","Code": "ita"},{"Name": "Japanese","Code": "jpn"},{"Name": "Korean","Code": "kor"},{"Name": "Macedonian","Code": "mkd"},{"Name": "Mandarin","

51:02
is studio pro a mandate for taxonomy ?

51:06
Yes it can be found in Taxanomy.json

52:04
No Mukesh - StudioPro is not mandate - Either Studio or StudioPro will work

53:43
You forgot ADD HEADERS in write range activity

53:54
Ismith

53:59
I have two questions: 1. This can extract data from image file too? 2. What if we have multiple dataset available in raw file ? eg . DATE 01/02/2020 and Date 11/02/2020

55:21
can we train the extracted data by human after the next run?

55:24
yes we can extract data file too

55:39
from image file

55:56
Can we use any other Ocr apart from omni ?

56:01
What in case the OCR doesn't reads the scanned file well?

56:08
Ty

56:15
or the scanned copy is not of good quality?

56:16
yes u can use other OCR

56:33
is this licensed ?

56:35
How to train Regex Extractor?

56:39
Can we train our bot by providing sample invoices and we don't require to validate always?

56:52
yes Mukesh we can user microsoft ocr google ocr

56:54
that is why we use present validation, so that human could give us the correct value

56:55
yes sourav

56:59
what if we want to process multiple pdfs . every time user have to validate it or we can create task in orchestrator for same?

56:59
Its not mandatory to validate

57:02
For regex build : https://regexr.com/

57:21
No Rohan

57:31
For Regex Build I'll suggest: https://regex101.com/

57:53
Rohan If the values are having any error we cannot supply it to the client right?

58:40
Hi Parth

58:52
hey Sourav

59:00
we do the data extraction through Kofax too so how beneficial is UIPath in compare to Kofax

59:11
its ur choice to validate or not......u can validate either in stdio or in orchestrator in action center

59:20
how does the bot "feels" like which template to choose/use? Can you please elaborate on the logic?

59:21
It looks like we can use multiple extractors in Data Extraction Scope. How does that work?

59:22
I have seen Ismith webinar earlier with the same ppt.

59:32
lost words..

01:00:13
okay Sourav

01:00:22
Can we train our bot by providing sample invoices and we don't require to validate always?

01:00:22
@Sahil - For Each page we need to provide 5 distinct fields to identify the page and BOT uses this logic

01:00:42
Can we train bot to capture data similar to IQ Bot?

01:01:02
yes, but which template it uses.....if there a multiple that's my question. How does the bot decide which template to use out of the 5

01:01:03
If your using multiple extractors(ex:regex,form) in same data extraction scope ..it choose best one based on confidence level....we can even select which one to choose

01:01:38
and what if the confidence level is same? then which gets preference?

01:01:43
https://youtu.be/H8y65QbsMRw @sahil

01:01:45
The Best Extraction Result and it has accuracy/Confidence score numbers obtained after getting each field data

01:01:46
go through this

01:01:49
Sahil We have classify document scope activity for the identifying the tempate to be used

01:02:33
not able to see the ishmeet scree

01:02:59
Parth that's what happens with me

01:03:19
Sourav can't get your doubt

01:03:21
The same as happening with Ismith

01:03:31
share recording

01:03:40
Great session so far. what to do incase if the document contains handwritten text and needs to be extracted ?

01:04:06
handwritten documents could be extracted using ocr

01:04:11
we can extract hand written document has well...bot extractor is diffrent

01:04:35
is everyone able to see the screen ?

01:04:36
intelligent form extractor....wont remember exactly

01:04:40
I can't able to see

01:04:57
yes..we are able to see screen

01:05:00
visible to me

01:05:02
yes i am able to see

01:05:06
thanks

01:05:26
@Sankar : Can you please restart and check your settings

01:05:36
We tried handwritten documents (signature,place,date) but failed to extract

01:06:05
SIgnature it can extract,but it can detect whether signature is present or not in document

01:06:16
*signature it cannot extract

01:06:48
.it can detect wheather signature is present or not in document

01:06:53
Oneplus6: it depends upon ocr

01:06:53
Ok any idea on checkboxes and radio buttons

01:07:22
we can do for checkbox as well..not sure about radio button

01:07:55
you can train your ocr engine.. we use abbyy

01:07:57
in machine extractor even option for checkbox....

01:08:04
Again we tried but failed

01:08:34
anyone who needs help with du we can get in touch post session on 7620344502

01:10:27
So we can train the classifier. How can we train the extractor?

01:10:29
If any1 working on Document Undestanding or ABBYY ..can reach out to me for any help 9900220066

01:13:38
Upcoming Sessions:16th July: UiPath AI Computer Vision: Automation without limitations: https://communityevents.uipath.com/e/mja9vy/18th July: Deep Dive into UiPath RE Framework : https://communityevents.uipath.com/e/m2udyg/25th July: End-to-End Automation of your ITSM (ServiceNow and UiPath Working Together)https://communityevents.uipath.com/e/mvz6zt/

01:24:50
If no one takes an action in orchestrator , will the bot be in idle state ? and till what time

01:24:57
Please shown extraction for multiple invoices ..which is the main issue to be solved

01:25:05
show

01:25:27
https://youtu.be/H8y65QbsMRw @Sourav

01:26:18
Parth Is it your youtube channel link

01:26:42
Hi Ishmeet, I've two questions1) How about handwritten text (including signatures)Can we achieve with the same approach? (Set of activites) or is there any other OCR/ Package has to be used for that?2) What about multiple invoice templates (Un- Structured), how to achieve that

01:27:59
https://youtu.be/H8y65QbsMRw: For multiple invoices of different formats

01:28:18
2) What about multiple invoice templates (Un- Structured), how to achieve thatsame question

01:28:41
any suggestions for extracting informations from an email body ?

01:28:46
you can configure templates

01:28:54
can you send a dummy invoice

01:29:47
you will have to create taxonomy and template for each type

01:30:22
Ismith You haven't show yet invoice data extraction for multiple invoices?

01:31:04
https://youtu.be/i6Qv11tbU34: for extraction of multiple invoices @Sourav

01:32:14
your link is wrong Parth

01:32:24
youtu.be

01:32:35
Ishmeet, is it possible to train the extractor activities like the classifier activities?

01:32:36
using ML how confidently it hass extraxted first date only?

01:32:37
Will the ML extract help me get unstructured data as well ?

01:32:42
https://www.youtube.com/watch?v=i6Qv11tbU34

01:33:47
Parth I have seen your video even that way I'm unable to do that

01:34:02
what error are you getting

01:34:40
Its only extrating data from one nvoice that is provided for Template

01:38:40
can we use document understanding to extract information from email ?

01:39:27
Will this document understanding work with Acord forms ?

01:39:29
is this licensed versio

01:40:15
is the bot attended or unattended when we create a validation station?

01:40:28
will it work for multiple invoices ?