15 videos • 📅 2025-01-27 09:00:00 America/Bahia_Banderas
24:24
2025-01-27 13:13:59
2:06:12
2025-01-27 13:42:41
3:36:29
2025-01-28 09:08:14
4:33
2025-01-28 13:48:42
55:46
2025-01-28 14:06:51
2:02
2025-01-29 10:22:33
1:02:14
2025-01-29 10:25:14
2:10
2025-01-29 11:38:26
2:26
2025-01-29 12:03:00
1:23:37
2025-01-29 12:05:56
35:40
2025-01-29 15:01:26
1:40:43
2025-01-30 09:07:07
1:08:48
2025-01-30 11:20:20
1:10:50
2025-01-30 13:15:56
3:50:03
2025-01-31 07:20:07

Visit the Talend Big Data Integration y Ecosistema de Talend course recordings page

United Arab Emirates - Talend Big Data Integration

                WEBVTT

00:00:29.060 --> 00:00:29.740
Thank you very much.

00:00:58.460 --> 00:00:59.740
Thank you very much.

00:01:10.000 --> 00:01:13.640
Thank you very much.

00:01:13.640 --> 00:01:18.060
Thank you very much.

00:01:18.120 --> 00:01:20.160
Thank you very much.

00:02:01.360 --> 00:02:13.420
Thank you very much.

00:02:43.520 --> 00:02:43.620
Thank you very much.

00:03:13.620 --> 00:03:17.720
So we're going to start with the example from yesterday,

00:03:18.540 --> 00:03:22.180
in which we're going to integrate the Big Data part with IA.

00:03:22.240 --> 00:03:24.220
How would we do that integration process?

00:03:25.680 --> 00:03:29.220
At first, we had already done a job that allowed us to

00:03:29.220 --> 00:03:30.700
connect to an IA and ask questions.

00:03:31.260 --> 00:03:34.840
The idea now is, let's say, to integrate some work done

00:03:34.840 --> 00:03:38.480
with Big Data, and then we can do an analysis or some

00:03:38.480 --> 00:03:42.440
projection using IA, in this case, the OpenAI.

00:03:43.380 --> 00:03:47.280
Next, we're going to continue with Hive, which is an engine.

00:03:48.560 --> 00:03:53.260
Basically, Hive is used a lot to form Data Warehouse or Data

00:03:53.260 --> 00:03:53.940
Lighthouse.

00:03:54.880 --> 00:03:57.600
And we continue with Spark, although with Spark I'm not going

00:03:57.600 --> 00:03:58.160
to give an example.

00:03:58.180 --> 00:03:59.080
I'm going to show you how it's done.

00:03:59.120 --> 00:04:00.960
Actually, it's very simple in the commercial version.

00:04:01.100 --> 00:04:02.140
It's nothing out of this world.

00:04:03.300 --> 00:04:05.940
And we end then with, oh, well,

00:04:05.940 --> 00:04:08.160
I'll give you an example of what the version part would be

00:04:08.160 --> 00:04:10.180
like with the proprietary tool.

00:04:12.200 --> 00:04:16.780
And we end with the part of the study case, evaluable.

00:04:17.100 --> 00:04:19.580
In the end, then, we do it in the last two hours.

00:04:20.660 --> 00:04:23.320
We're going to continue then, as I mentioned with the first

00:04:23.320 --> 00:04:26.180
example, which was the integration of Big Data with

00:04:26.180 --> 00:04:26.620
IA.

00:04:27.020 --> 00:04:29.280
For that, then, I'm going to connect now to the

00:04:29.280 --> 00:04:31.040
virtual machine, which I haven't connected yet.

00:04:31.400 --> 00:04:32.260
I'm going to see here.

00:04:32.320 --> 00:04:33.100
I think I started it.

00:04:33.200 --> 00:04:33.880
Yes, perfect.

00:04:34.520 --> 00:04:37.320
I'm going to share the virtual machine.

00:04:37.980 --> 00:04:38.760
It's here.

00:04:38.800 --> 00:04:39.280
Perfect.

00:04:39.980 --> 00:04:41.680
And the first thing we're going to do, then,

00:04:41.680 --> 00:04:44.380
is we're going to upload the Hadoop cluster that we have in

00:04:44.380 --> 00:04:44.820
Hyper-V.

00:04:45.520 --> 00:04:50.800
Then we go here to the direct access of Hyper-V,

00:04:50.880 --> 00:04:54.120
the administrator, and I just give it to start.

00:04:56.920 --> 00:04:57.480
Perfect.

00:04:57.520 --> 00:05:00.800
And I should leave it in the state that the virtual machine

00:05:00.800 --> 00:05:01.300
had yesterday.

00:05:03.400 --> 00:05:10.100
And then we're going to upload the one in Big Data,

00:05:11.560 --> 00:05:13.420
which we have here in the desktop.

00:05:15.020 --> 00:05:15.320
And that's it.

00:05:15.380 --> 00:05:17.440
Those are the two initial tools that we're going to see.

00:05:17.720 --> 00:05:18.820
We're done with this example.

00:05:18.860 --> 00:05:20.940
We're going to add a new tool called Hype,

00:05:21.020 --> 00:05:21.740
as I was saying.

00:05:22.240 --> 00:05:23.640
For that, I created a container.

00:05:24.440 --> 00:05:26.140
In this container, I have version 4 of Hype.

00:05:26.200 --> 00:05:28.300
I have to explain how the architecture is,

00:05:29.260 --> 00:05:32.360
how it's currently installed in each virtual machine.

00:05:34.020 --> 00:05:35.320
Let's go over here.

00:05:35.400 --> 00:05:35.700
Done.

00:05:35.780 --> 00:05:37.060
And there are some that are going up.

00:05:37.220 --> 00:05:38.120
There are some that already have it.

00:05:38.220 --> 00:05:38.480
Perfect.

00:05:38.540 --> 00:05:41.960
There are some that already have the one in Big Data installed.

00:05:42.780 --> 00:05:43.820
Perfect, then.

00:05:47.040 --> 00:05:47.560
Perfect.

00:05:47.740 --> 00:05:49.660
Here, then, it has gone up.

00:05:50.240 --> 00:05:52.600
And we're looking for the last job we did,

00:05:52.820 --> 00:05:55.980
which was to get the three products less sold.

00:05:57.020 --> 00:06:00.980
And we had made an initial filter by city and by gender.

00:06:02.320 --> 00:06:05.720
Particularly, I took as a city Yangon and as a male gender.

00:06:07.240 --> 00:06:10.040
We're going to look for that last job we did,

00:06:12.240 --> 00:06:14.740
which is here in the Big Data project.

00:06:15.740 --> 00:06:20.740
And it's called listing 10 products less sold.

00:06:20.780 --> 00:06:24.580
But in the end, with the filter, I had six records left.

00:06:24.580 --> 00:06:29.440
So we ended up only filtering the last three,

00:06:29.600 --> 00:06:32.000
that is, the three less sold.

00:06:46.160 --> 00:06:47.760
Perfect, then.

00:06:48.000 --> 00:06:49.760
Here I have that part.

00:06:49.920 --> 00:06:50.300
Done.

00:06:50.360 --> 00:06:52.500
I'm going to wait for some to be here.

00:06:53.520 --> 00:06:55.780
Well, this job, what it does,

00:06:56.160 --> 00:06:58.580
I had commented on a case with Netflix,

00:06:58.780 --> 00:07:00.520
and it's something very popular with Big Data.

00:07:01.540 --> 00:07:05.540
Normally, Big Data is used a lot for real-time

00:07:05.540 --> 00:07:07.600
or almost real-time in the case of Netflix,

00:07:07.700 --> 00:07:09.420
which analyzes everything you do.

00:07:10.660 --> 00:07:13.120
There may be a large amount of data that is going to be processed,

00:07:13.140 --> 00:07:16.780
but in the end, the result can be 10, 15, 20 records,

00:07:16.860 --> 00:07:17.620
as in this case.

00:07:18.740 --> 00:07:19.560
For this job,

00:07:19.560 --> 00:07:25.040
take into account that we take as an input from the Hadoop cluster,

00:07:25.360 --> 00:07:27.820
and remember that when we take as an input from the Hadoop cluster,

00:07:27.980 --> 00:07:31.500
what the tool does internally is that it converts that to MacReduce,

00:07:31.980 --> 00:07:34.300
and it would execute it against the cluster that you have configured.

00:07:35.660 --> 00:07:38.040
But this job has something in particular

00:07:38.040 --> 00:07:40.980
in that the output is sent directly to a file.

00:07:41.440 --> 00:07:44.820
From the fact that the file has few records,

00:07:44.820 --> 00:07:50.680
I think it lacks a bit of sense to send it to the Hadoop cluster,

00:07:50.820 --> 00:07:52.280
given the fact that they are 10 records

00:07:52.280 --> 00:07:54.900
and I can easily handle it as a traditional job.

00:07:55.680 --> 00:07:57.900
So that's why I sent it directly to a file

00:07:57.900 --> 00:08:01.500
and I didn't send it directly to the Hadoop cluster.

00:08:01.840 --> 00:08:03.220
Let's do the following then.

00:08:04.100 --> 00:08:08.140
Let's configure our PROM for this part.

00:08:09.300 --> 00:08:11.980
To configure this PROM, let's do the following.

00:08:13.560 --> 00:08:16.200
Go to the folder that we downloaded yesterday

00:08:16.200 --> 00:08:18.300
which is called Credentials OpenAI.

00:08:22.860 --> 00:08:25.960
To vary a bit and also so that it doesn't damage

00:08:25.960 --> 00:08:28.000
the PROM that we had with the previous job,

00:08:29.120 --> 00:08:31.700
we are going to create a file with a new PROM,

00:08:31.780 --> 00:08:32.980
so that we have the two PROMs.

00:08:33.880 --> 00:08:38.380
So there is a file that says prom.json.txt.

00:08:38.880 --> 00:08:41.500
Please take that file and take out a copy.

00:08:43.280 --> 00:08:44.820
Copy. If you want, paste it there.

00:08:46.280 --> 00:08:47.980
So here I am going to change the name.

00:08:53.680 --> 00:08:54.160
PROMPT.

00:08:56.180 --> 00:09:00.860
I am going to call it PROM less sold items.

00:09:02.800 --> 00:09:04.540
There I made a mistake of leaving it in TXT,

00:09:04.740 --> 00:09:06.980
but remember that the tool has no problem with that part.

00:09:09.260 --> 00:09:11.220
In the end, what it does is change the filter,

00:09:11.220 --> 00:09:13.940
but the important thing is the internal format,

00:09:14.400 --> 00:09:15.780
not so much the extension that it has,

00:09:15.900 --> 00:09:17.600
because the tool can read it.

00:09:18.600 --> 00:09:20.240
The idea of this is to create a new file

00:09:20.240 --> 00:09:22.060
so that it does not damage the previous job

00:09:22.060 --> 00:09:23.880
and also to see how we would do

00:09:23.880 --> 00:09:26.500
to change the initial configuration

00:09:26.500 --> 00:09:27.700
that we had of the metadata,

00:09:27.980 --> 00:09:29.680
because we are going to have a new file.

00:09:30.620 --> 00:09:33.220
For this, then, we are going to enter the new file.

00:09:37.820 --> 00:09:39.160
Well, I'm going to write a PROM.

00:09:39.160 --> 00:09:41.160
PROMs are not very good at these PROMs.

00:09:41.960 --> 00:09:43.240
I'm going to write a PROM.

00:09:43.260 --> 00:09:45.160
I don't know if you want to write one of your own

00:09:45.160 --> 00:09:48.240
or follow me in the writing of this PROM.

00:09:48.440 --> 00:09:49.800
Remember that by the format of the file,

00:09:49.920 --> 00:09:52.800
we are going to write the PROM in the double quotes.

00:09:54.340 --> 00:09:56.400
So I'm going to start this way.

00:09:56.720 --> 00:10:07.160
I have a company that sells products in the city

00:10:09.880 --> 00:10:11.620
of Yangon, I don't know what it's called,

00:10:11.720 --> 00:10:13.040
I don't know, it sounds like India.

00:10:13.460 --> 00:10:15.040
I have a company that sells products

00:10:15.040 --> 00:10:17.160
in the city of Yangon

00:10:23.320 --> 00:10:26.080
to men and women.

00:10:29.640 --> 00:10:35.180
I have a company that sells products

00:10:35.180 --> 00:10:37.920
in the city of Yangon to men and women.

00:10:43.660 --> 00:10:51.720
Currently, men consume little of the following products.

00:10:52.660 --> 00:10:54.140
And here is where the trick comes from,

00:10:54.980 --> 00:10:56.720
but this trick is going to be done with Java.

00:10:56.840 --> 00:10:59.420
That is, I am going to insert the data

00:10:59.420 --> 00:11:00.620
that the Big Data brings me

00:11:00.620 --> 00:11:02.420
just after that product.

00:11:03.660 --> 00:11:04.660
Give it two points.

00:11:04.900 --> 00:11:07.180
This is important, regardless of the PROM

00:11:07.180 --> 00:11:08.560
that you want to do,

00:11:09.040 --> 00:11:10.500
you are going to put the following,

00:11:10.740 --> 00:11:12.600
percentage S,

00:11:13.900 --> 00:11:15.000
percentage S,

00:11:15.040 --> 00:11:17.300
two points, percentage S.

00:11:17.700 --> 00:11:18.920
That percentage S, what it's going to do

00:11:18.920 --> 00:11:19.920
is that in execution time,

00:11:20.080 --> 00:11:22.060
I'm going to replace a string

00:11:22.060 --> 00:11:24.220
where that percentage is.

00:11:24.340 --> 00:11:26.700
Currently, men consume little

00:11:27.500 --> 00:11:29.340
of the following products.

00:11:30.340 --> 00:11:38.200
I want you to make me a sales plan

00:11:38.200 --> 00:11:46.260
taking the cultural and socio-economic part

00:11:48.160 --> 00:11:50.280
of the region.

00:11:51.520 --> 00:11:57.640
If you want, I'll send you the text

00:11:57.640 --> 00:11:58.340
that I put here.

00:11:58.580 --> 00:11:59.880
Well, I don't know if you want to do it.

00:11:59.880 --> 00:12:03.040
The important thing is the percentage S.

00:12:03.200 --> 00:12:05.040
That percentage S has nothing to do with talent,

00:12:05.080 --> 00:12:05.840
it's more about Java.

00:12:07.200 --> 00:12:08.280
There I sent you the PROM

00:12:08.280 --> 00:12:09.500
through the chat.

00:12:11.700 --> 00:12:14.220
Please remember to save it.

00:12:15.020 --> 00:12:16.160
And I repeat, the PROM,

00:12:16.320 --> 00:12:17.300
just in case you are writing it.

00:12:17.460 --> 00:12:19.260
I started this way, it is not the best PROM,

00:12:19.380 --> 00:12:22.000
but the idea is the exercise of how we integrate it.

00:12:22.460 --> 00:12:23.700
I have a company that sells products

00:12:23.700 --> 00:12:25.220
in the city of Yangon to men and women.

00:12:25.220 --> 00:12:26.960
Currently, men consume little

00:12:26.960 --> 00:12:27.880
of the following products.

00:12:28.220 --> 00:12:30.100
I want you to make me a sales plan

00:12:30.100 --> 00:12:32.560
taking the cultural and socio-economic part

00:12:32.560 --> 00:12:33.840
of the region.

00:12:34.400 --> 00:12:35.640
The percentage S is important.

00:12:37.040 --> 00:12:38.500
Perfect, ready then.

00:12:39.440 --> 00:12:43.580
Let's save it here.

00:12:45.200 --> 00:12:49.780
Let's go back to Mr. Talent Big Data.

00:12:50.240 --> 00:12:51.620
And look what we are going to do now.

00:12:52.820 --> 00:12:55.340
We are going to take the job.

00:12:55.840 --> 00:12:57.640
We open the job base from yesterday.

00:12:57.660 --> 00:12:58.540
We made the one from yesterday.

00:12:58.780 --> 00:13:00.760
It is called Job Connection API Open AI.

00:13:03.620 --> 00:13:04.560
And copy all the job.

00:13:05.040 --> 00:13:06.100
I think if you right click,

00:13:06.920 --> 00:13:08.740
it gives you to copy all the job.

00:13:08.820 --> 00:13:11.920
Right click on the sub job you have there.

00:13:12.180 --> 00:13:13.600
And you have to copy all the elements

00:13:13.600 --> 00:13:14.300
of that sub job.

00:13:14.880 --> 00:13:16.480
And you are going to paste it

00:13:16.480 --> 00:13:17.220
just

00:13:17.220 --> 00:13:18.080
like that.

00:13:20.200 --> 00:13:20.800
Yes, here it is.

00:13:20.920 --> 00:13:23.360
And you paste it in the bottom part.

00:13:24.900 --> 00:13:26.420
You paste it in the new one

00:13:26.420 --> 00:13:28.960
which is the Job List 10 less sold items.

00:13:29.940 --> 00:13:31.760
It could be a generic job

00:13:31.760 --> 00:13:33.340
given a file

00:13:34.960 --> 00:13:36.380
but I don't think

00:13:36.380 --> 00:13:38.400
I have time for that generic job.

00:13:41.400 --> 00:13:43.200
Try to place it well down here

00:13:43.200 --> 00:13:44.780
so that it doesn't get confused.

00:13:44.780 --> 00:13:46.100
I am going to close

00:13:46.780 --> 00:13:49.460
I am going to add a little more here

00:13:49.460 --> 00:13:51.080
and I am going to minimize

00:13:51.080 --> 00:13:52.820
here in a minus that I have here

00:13:52.820 --> 00:13:54.520
so that it is more expanded.

00:13:55.980 --> 00:13:56.920
Let me know then

00:13:56.920 --> 00:13:58.480
if there are some that are still ready.

00:13:58.680 --> 00:13:59.240
Perfect.

00:14:00.380 --> 00:14:03.120
First, I am going to modify the PROM.

00:14:03.280 --> 00:14:04.780
Why? Because the PROM

00:14:04.780 --> 00:14:06.820
now I have to add a different element

00:14:06.820 --> 00:14:09.920
now. So look what I am going to do here.

00:14:10.460 --> 00:14:10.940
The line

00:14:10.940 --> 00:14:12.680
that connects the PROM with the

00:14:12.680 --> 00:14:14.940
T-Map, you are going to erase it.

00:14:14.980 --> 00:14:16.900
The PROM connection line

00:14:16.900 --> 00:14:19.080
with the T-Map.

00:14:19.700 --> 00:14:20.620
But there is still a moment

00:14:20.620 --> 00:14:22.420
that there are some that are still organizing

00:14:22.420 --> 00:14:23.760
the moment to see.

00:14:27.780 --> 00:14:28.280
It seems that

00:14:28.280 --> 00:14:29.380
a moment has been left.

00:14:30.760 --> 00:14:32.500
It seems that

00:14:32.500 --> 00:14:34.660
there is a moving window that does not allow

00:14:34.660 --> 00:14:36.680
to move here. Give me a moment then

00:14:37.400 --> 00:14:38.640
and I enter interactive

00:14:38.640 --> 00:14:40.160
to see something.

00:14:40.160 --> 00:14:42.160
Ready then.

00:14:43.980 --> 00:14:44.160
Super weird.

00:14:51.060 --> 00:14:51.680
Oops.

00:14:52.840 --> 00:14:54.280
I am going to close then the engineer

00:14:54.280 --> 00:14:55.640
let's see because how weird

00:14:57.240 --> 00:14:58.520
that happens.

00:14:59.420 --> 00:15:01.540
A moment to see then.

00:15:02.120 --> 00:15:03.080
Wait a moment

00:15:03.080 --> 00:15:04.160
please the others.

00:15:06.660 --> 00:15:09.520
I do not know why I have the CPU so

00:15:10.060 --> 00:15:11.740
let's see here. Let's try to open

00:15:11.740 --> 00:15:13.180
again. I think it is the JOK

00:15:13.180 --> 00:15:15.540
I do not know.

00:15:16.140 --> 00:15:17.640
There was another JOK to see.

00:15:21.820 --> 00:15:22.700
This is weird.

00:15:23.200 --> 00:15:24.700
It was here.

00:15:25.660 --> 00:15:26.980
I think it was the tab.

00:15:28.500 --> 00:15:29.080
A moment please.

00:15:29.740 --> 00:15:31.660
Being honest I do not know what happened there.

00:15:31.860 --> 00:15:33.740
But let's try to open

00:15:33.740 --> 00:15:34.700
the other again.

00:15:34.920 --> 00:15:37.060
Being honest I do not know what happened there.

00:15:37.680 --> 00:15:38.900
Let's try to open the JOK

00:15:38.900 --> 00:15:39.360
a moment please.

00:15:39.520 --> 00:15:41.900
Let's try to open the JOK a moment please.

00:15:41.900 --> 00:15:46.040
Being honest I do not know what happened there.

00:15:46.040 --> 00:15:46.640
Being honest I do not know what happened there.

00:15:46.640 --> 00:15:47.640
I am going to copy this man

00:15:47.640 --> 00:15:49.700
and paste it.

00:15:49.700 --> 00:15:52.060
I click here and correct the error.

00:15:52.240 --> 00:15:53.020
I do not know why it happens there.

00:15:54.960 --> 00:15:56.680
Remember to click here because

00:15:56.680 --> 00:15:59.240
it seems that the perspective is not damaged.

00:15:59.440 --> 00:16:00.060
I do not know why.

00:16:02.020 --> 00:16:03.040
Ready then.

00:16:03.060 --> 00:16:05.120
We start from this point here.

00:16:05.120 --> 00:16:06.460
First what we are going to do

00:16:06.460 --> 00:16:09.100
there is a file called PROM

00:16:09.100 --> 00:16:11.160
which is the one that connects to the theme.

00:16:11.940 --> 00:16:13.560
Please, the connection line

00:16:13.560 --> 00:16:15.080
remove it because

00:16:15.080 --> 00:16:17.120
we are going to format that PROM.

00:16:19.560 --> 00:16:21.660
Notice that now when you disconnect it

00:16:21.660 --> 00:16:23.260
it no longer belongs

00:16:23.260 --> 00:16:24.200
to that JOK.

00:16:25.520 --> 00:16:27.100
So if there is any problem

00:16:27.100 --> 00:16:29.140
to be able to connect it, lower the

00:16:29.140 --> 00:16:31.260
JOK a little so that this PROM is free up here.

00:16:31.860 --> 00:16:33.320
Later when you reconnect it

00:16:33.320 --> 00:16:35.200
there is no problem to be able to

00:16:35.200 --> 00:16:36.600
put it in this JOK.

00:16:38.060 --> 00:16:38.780
Just in case.

00:16:41.640 --> 00:16:43.920
Now we are going to place a file.

00:16:44.760 --> 00:16:46.360
This file that I generated here

00:16:46.360 --> 00:16:47.520
which is the one I want to integrate

00:16:47.520 --> 00:16:48.940
this file that is out

00:16:48.940 --> 00:16:51.300
is the one I want to read to integrate it

00:16:51.300 --> 00:16:52.840
and send it over there.

00:16:57.660 --> 00:16:59.420
This is not finished.

00:17:00.320 --> 00:17:01.240
So do the following.

00:17:01.240 --> 00:17:01.780
I am going to do

00:17:01.780 --> 00:17:04.800
I am going to export this man.

00:17:04.980 --> 00:17:05.560
He is an example.

00:17:06.100 --> 00:17:07.820
I am going to export this JOK.

00:17:08.360 --> 00:17:12.200
And I share this JOK with him.

00:17:12.240 --> 00:17:12.640
Perfect.

00:17:15.240 --> 00:17:17.720
I am going to share it

00:17:17.720 --> 00:17:19.480
here in 10.

00:17:26.000 --> 00:17:26.800
Perfect.

00:17:28.800 --> 00:17:30.620
And now let me share

00:17:30.620 --> 00:17:32.920
this 10 here.

00:17:36.300 --> 00:17:38.220
Here is the local project.

00:17:39.780 --> 00:17:40.240
Wait a moment.

00:17:40.520 --> 00:17:43.120
It didn't generate it correctly.

00:17:43.280 --> 00:17:44.440
Wait a moment.

00:17:45.060 --> 00:17:45.380
Export item.

00:17:46.180 --> 00:17:49.880
Here it is.

00:17:50.260 --> 00:17:51.560
JOK connection API.

00:17:51.620 --> 00:17:52.280
Perfect.

00:17:52.900 --> 00:17:53.500
Perfect.

00:17:55.960 --> 00:17:58.360
I am going to compress it.

00:17:59.500 --> 00:18:00.180
And I am going to upload it

00:18:00.180 --> 00:18:01.180
to the shared folder.

00:18:01.400 --> 00:18:03.780
If you wish, go to the shared folder.

00:18:08.740 --> 00:18:11.320
I think I have an open Gmail connection here.

00:18:11.720 --> 00:18:12.020
I don't know.

00:18:15.860 --> 00:18:18.060
I am going to try to send it here

00:18:18.060 --> 00:18:19.680
to my team and

00:18:19.680 --> 00:18:21.280
it is faster for me to upload it here.

00:18:21.460 --> 00:18:23.680
I am going to try to see it.

00:18:42.980 --> 00:18:43.460
Done.

00:18:43.600 --> 00:18:45.480
Let me download it here in my team

00:18:45.480 --> 00:18:46.740
and upload it immediately

00:18:46.740 --> 00:18:48.600
to the shared folder.

00:18:48.920 --> 00:18:51.820
I already told you how to export it.

00:18:51.820 --> 00:18:52.620
To import it.

00:18:53.060 --> 00:18:53.920
If it stays there.

00:18:54.160 --> 00:18:54.420
Done.

00:18:56.460 --> 00:18:58.600
I am going to present your screen

00:18:58.600 --> 00:19:00.080
to see how we can import it.

00:19:00.180 --> 00:19:01.300
Who called me? Brenda, right?

00:19:01.440 --> 00:19:02.320
Yes, Brenda.

00:19:03.320 --> 00:19:05.080
I am on your screen.

00:19:05.460 --> 00:19:07.400
Go to the shared folder.

00:19:09.100 --> 00:19:10.200
Access the Edge.

00:19:11.160 --> 00:19:12.640
Open your Edge browser.

00:19:14.120 --> 00:19:15.820
I think it should be

00:19:15.820 --> 00:19:16.480
in the history.

00:19:16.740 --> 00:19:17.700
It should be there.

00:19:18.120 --> 00:19:18.220
There it is.

00:19:19.440 --> 00:19:20.000
Perfect.

00:19:22.920 --> 00:19:24.460
Close that window, please.

00:19:24.560 --> 00:19:25.180
From the right side.

00:19:25.980 --> 00:19:27.740
That one is perfect.

00:19:28.220 --> 00:19:29.800
Now download that one called

00:19:29.800 --> 00:19:31.720
Job Connection Open.

00:19:32.480 --> 00:19:32.840
Done.

00:19:32.920 --> 00:19:35.460
Go to Downloads and unzip it, please.

00:19:35.620 --> 00:19:37.480
Now go to the Big Data tab

00:19:37.480 --> 00:19:39.820
and it is located in the

00:19:39.820 --> 00:19:41.440
Big Data tab.

00:19:41.860 --> 00:19:43.820
Right click there.

00:19:44.060 --> 00:19:45.520
Give it

00:19:45.520 --> 00:19:46.820
Import Items.

00:19:47.020 --> 00:19:48.600
Select it, please.

00:19:48.860 --> 00:19:50.440
Now I got confused.

00:19:51.820 --> 00:19:52.580
Did you have the Open Eye one?

00:19:52.640 --> 00:19:54.540
Or was it missing the three?

00:19:54.540 --> 00:19:55.340
Which one was missing?

00:19:57.980 --> 00:19:58.900
Ok.

00:19:59.140 --> 00:20:00.920
Choose the Open Eye tab.

00:20:02.180 --> 00:20:02.880
Job Open Eye.

00:20:03.860 --> 00:20:05.440
Select the tab.

00:20:06.000 --> 00:20:07.060
Wait a moment.

00:20:07.060 --> 00:20:08.840
It was missing the other one.

00:20:08.960 --> 00:20:10.940
Wait a moment.

00:20:14.100 --> 00:20:15.060
Yes. Wait a moment.

00:20:17.180 --> 00:20:18.260
Ok. Wait a moment.

00:20:19.380 --> 00:20:21.600
I am going to export this one.

00:20:25.160 --> 00:20:25.720
This one.

00:20:39.720 --> 00:20:41.620
Go to the shared folder again.

00:20:41.980 --> 00:20:42.280
Wait a moment.

00:20:42.280 --> 00:20:43.240
It is going to be here.

00:20:44.020 --> 00:20:45.560
Download the one called

00:20:45.560 --> 00:20:46.560
F5, please.

00:20:46.880 --> 00:20:49.800
The one called Job Listar 10.

00:20:49.820 --> 00:20:51.320
There it is. Job Listar 10.

00:20:51.380 --> 00:20:52.280
Less sold items.

00:20:53.600 --> 00:20:54.740
And unzip it again.

00:20:54.880 --> 00:20:56.840
Perfect. Now go to the Big Data tab again.

00:20:57.140 --> 00:20:58.640
Right click there, please.

00:21:00.260 --> 00:21:00.980
But wait a moment.

00:21:01.400 --> 00:21:02.560
You have to delete it.

00:21:02.560 --> 00:21:03.980
Because I think it is called exactly the same.

00:21:03.980 --> 00:21:05.280
So delete the one you have.

00:21:06.200 --> 00:21:07.440
If you have to close it before deleting it,

00:21:07.460 --> 00:21:08.540
you have to close the job.

00:21:08.740 --> 00:21:12.080
Close it.

00:21:12.540 --> 00:21:13.220
Say no.

00:21:15.120 --> 00:21:15.660
And delete it.

00:21:15.840 --> 00:21:17.280
There is a small problem with the tool.

00:21:18.060 --> 00:21:19.380
Even if I have deleted it,

00:21:20.360 --> 00:21:21.960
it won't let me create it with the same name.

00:21:22.340 --> 00:21:23.000
Delete it there.

00:21:23.120 --> 00:21:25.240
And at the end, it says to delete it.

00:21:25.880 --> 00:21:27.480
Delete it from the folder because

00:21:27.480 --> 00:21:28.620
it is still in the folder.

00:21:29.800 --> 00:21:30.720
Delete it from there.

00:21:30.720 --> 00:21:31.240
Exactly.

00:21:33.040 --> 00:21:34.800
Delete forever.

00:21:37.020 --> 00:21:38.060
Now try again.

00:21:38.780 --> 00:21:40.900
From the folder, click in.

00:21:41.280 --> 00:21:41.980
Import item.

00:21:42.440 --> 00:21:43.620
Look for the download folder.

00:21:44.340 --> 00:21:45.100
It should be called Job.

00:21:48.220 --> 00:21:49.440
That would be that folder.

00:21:50.620 --> 00:21:51.700
Select folder.

00:21:53.240 --> 00:21:53.840
Exactly.

00:21:56.780 --> 00:21:57.900
Select everything there.

00:21:57.900 --> 00:21:59.720
Click on local project.

00:22:00.180 --> 00:22:02.040
Select everything.

00:22:02.600 --> 00:22:06.360
It is in the AXA folder.

00:22:06.540 --> 00:22:07.960
Open it.

00:22:08.820 --> 00:22:09.740
Wait a moment.

00:22:10.740 --> 00:22:11.200
Perfect.

00:22:11.380 --> 00:22:12.900
If you want, go to the AXA folder.

00:22:14.160 --> 00:22:15.780
Go to the AXA folder.

00:22:15.960 --> 00:22:16.720
It is there.

00:22:18.400 --> 00:22:20.420
Go to the AXA folder.

00:22:20.580 --> 00:22:21.540
Go to the Aprovesamiento Big Data folder.

00:22:22.500 --> 00:22:23.000
And take it.

00:22:23.360 --> 00:22:25.240
And upload it to Aprovesamiento Big Data.

00:22:25.240 --> 00:22:25.400
And upload it.

00:22:25.820 --> 00:22:28.080
And upload it exactly so it stays in the route.

00:22:28.880 --> 00:22:30.520
Now delete the folders that

00:22:30.520 --> 00:22:31.300
were left there.

00:22:31.400 --> 00:22:32.180
Delete it.

00:22:33.020 --> 00:22:35.220
And now go to Job.

00:22:35.300 --> 00:22:36.680
Click on 10 less sold items.

00:22:37.140 --> 00:22:41.240
Finally, copy the whole Job connection

00:22:41.800 --> 00:22:42.920
API Open Eye.

00:22:42.920 --> 00:22:43.700
It has it there.

00:22:44.340 --> 00:22:44.940
Right click.

00:22:45.920 --> 00:22:47.080
We are going to copy everything.

00:22:47.140 --> 00:22:49.980
The best thing is to right click on the blue part.

00:22:50.080 --> 00:22:52.520
Within the blue part of the right click there.

00:22:52.520 --> 00:22:54.520
Copy it.

00:22:55.180 --> 00:22:55.580
Ready.

00:22:56.700 --> 00:22:58.520
And paste it here.

00:22:59.900 --> 00:23:00.340
Ready.

00:23:00.780 --> 00:23:02.800
It stayed more to the left.

00:23:03.160 --> 00:23:03.820
There it is.

00:23:04.660 --> 00:23:06.980
Continue more to the left because I need you to delete something.

00:23:07.940 --> 00:23:09.340
Select the prompt component.

00:23:10.580 --> 00:23:11.700
That prompt that is there.

00:23:11.900 --> 00:23:12.520
Select it.

00:23:12.580 --> 00:23:13.380
Click there.

00:23:13.700 --> 00:23:14.500
And it is going to delete it.

00:23:15.500 --> 00:23:18.520
It is going to delete the prompt connection line

00:23:18.520 --> 00:23:19.720
to the subject.

00:23:19.720 --> 00:23:21.940
Now go down a little bit to the sub job

00:23:21.940 --> 00:23:23.320
because later you won't be able to select the prompt.

00:23:23.340 --> 00:23:25.360
Go down the complete sub job.

00:23:26.040 --> 00:23:27.840
Click on the blue and go down.

00:23:28.280 --> 00:23:29.280
No, directly in the blue.

00:23:30.080 --> 00:23:31.060
No, in the blue.

00:23:31.540 --> 00:23:32.860
There it is.

00:23:33.520 --> 00:23:35.620
Well, there it is.

00:23:35.680 --> 00:23:37.780
It minimized it. Now it maximizes it.

00:23:38.480 --> 00:23:40.600
There in the plus it maximizes it.

00:23:41.060 --> 00:23:42.700
In the plus it has the theme 3.

00:23:42.720 --> 00:23:44.500
In the theme 3 it was minimized.

00:23:44.920 --> 00:23:46.460
In the theme 3, give it more there.

00:23:46.800 --> 00:23:48.780
Better go down a little bit.

00:23:48.780 --> 00:23:50.240
Select it with the click.

00:23:50.580 --> 00:23:50.900
Go down.

00:23:52.360 --> 00:23:54.260
And it is going to delete the prompt.

00:23:54.660 --> 00:23:56.020
There it is.

00:23:56.440 --> 00:23:57.760
Well, let's go then.

00:23:57.880 --> 00:23:59.860
If we continue with this part.

00:24:01.620 --> 00:24:03.160
This is the window.

00:24:03.400 --> 00:24:04.560
Perfect, done.

00:24:05.560 --> 00:24:06.520
I told you then.

00:24:06.720 --> 00:24:09.200
The data that I want to send to Mr. OpenEye

00:24:09.200 --> 00:24:11.140
is in this file that is here.

00:24:11.580 --> 00:24:13.360
I already told you why I didn't send it to Duque.

00:24:13.460 --> 00:24:14.820
It makes sense because it is very small.

00:24:15.700 --> 00:24:17.300
I want to read the data

00:24:17.300 --> 00:24:18.280
of this file.

00:24:18.520 --> 00:24:20.620
I need a tefile

00:24:21.300 --> 00:24:22.420
tefile input.

00:24:22.860 --> 00:24:25.480
We add a tefile input delimited.

00:24:27.320 --> 00:24:29.740
Well, here each one organizes it a little differently.

00:24:31.480 --> 00:24:33.780
I need a theme.

00:24:39.300 --> 00:24:41.440
I am going to configure this tefile input delimited.

00:24:41.620 --> 00:24:43.400
Remember that the inputs and outputs

00:24:43.400 --> 00:24:44.440
need a scheme.

00:24:44.440 --> 00:24:46.040
Like the scheme that I need,

00:24:46.240 --> 00:24:47.540
I have it in the tefile output.

00:24:47.880 --> 00:24:49.240
I go to the tefile output

00:24:49.240 --> 00:24:51.880
and I am going to export the scheme.

00:24:52.900 --> 00:24:54.320
Everything that is input and output

00:24:54.320 --> 00:24:56.080
is going to find this edit scheme.

00:24:57.460 --> 00:24:58.420
What is going to vary is

00:24:58.420 --> 00:24:59.780
the position in which it is going to find it.

00:24:59.860 --> 00:25:01.340
So we give here edit scheme

00:25:02.180 --> 00:25:03.860
and we do the export.

00:25:04.380 --> 00:25:06.000
In this case it would be the

00:25:06.000 --> 00:25:07.180
penultimate button.

00:25:09.260 --> 00:25:10.540
I am in the tefile

00:25:10.540 --> 00:25:11.400
output delimited.

00:25:11.400 --> 00:25:13.040
We are going to call it

00:25:13.040 --> 00:25:15.700
menos vendidos.xml

00:25:15.700 --> 00:25:17.580
Importante el xml

00:25:17.580 --> 00:25:19.340
because then it does not read it.

00:25:19.640 --> 00:25:21.400
Venas menos vendidos.

00:25:23.760 --> 00:25:25.400
Menos vendidos.xml

00:25:26.100 --> 00:25:27.900
We give here save.

00:25:31.400 --> 00:25:32.680
And what we do is import

00:25:32.680 --> 00:25:34.200
that scheme that is already exported.

00:25:34.300 --> 00:25:36.020
We import it in the input.

00:25:37.120 --> 00:25:38.460
Edit scheme.

00:25:40.560 --> 00:25:41.280
Importamos

00:25:41.280 --> 00:25:43.460
y buscamos menos vendidos.

00:25:43.560 --> 00:25:44.820
Ahí está.

00:25:46.140 --> 00:25:46.860
Dígame.

00:25:52.000 --> 00:25:53.880
Ok, aquí exportar.

00:25:54.760 --> 00:25:56.660
Ok, perfecto.

00:25:57.840 --> 00:25:58.140
Listo.

00:26:01.380 --> 00:26:02.260
Bueno, listo.

00:26:02.280 --> 00:26:05.320
Y por último, del tefile output delimited

00:26:05.320 --> 00:26:07.360
copie la ruta del archivo de salida

00:26:07.360 --> 00:26:09.120
y la va a pegar en el input

00:26:09.120 --> 00:26:11.160
porque voy a leer exactamente el mismo archivo.

00:26:11.260 --> 00:26:12.460
Entonces voy al output,

00:26:13.100 --> 00:26:15.360
copio la ruta del archivo de salida

00:26:18.500 --> 00:26:21.440
y es la que

00:26:21.440 --> 00:26:23.380
la aquí

00:26:23.380 --> 00:26:25.020
la borro y la pego aquí.

00:26:26.140 --> 00:26:27.780
En este caso recuerde que

00:26:27.780 --> 00:26:29.180
estos componentes

00:26:29.180 --> 00:26:31.740
cuando es un stream debe llevar comillas doble.

00:26:32.060 --> 00:26:33.100
Solamente hemos visto

00:26:33.100 --> 00:26:34.820
dos casos en que

00:26:34.820 --> 00:26:36.160
solamente, perdón, un caso

00:26:36.820 --> 00:26:38.460
en que no necesita comillas dobles, que es cuando

00:26:38.460 --> 00:26:40.580
creer contextos.

00:26:41.180 --> 00:26:42.580
Cuando dice la parte

00:26:42.580 --> 00:26:43.760
final. Ah, sí.

00:26:44.420 --> 00:26:45.700
Sí, la de salida, sí, exactamente.

00:26:46.340 --> 00:26:48.440
Aunque de verdad que estoy mal que la salida es

00:26:48.440 --> 00:26:49.740
igual a la de salida.

00:26:50.100 --> 00:26:51.180
Es diferente a la salida.

00:26:52.420 --> 00:26:52.860
Exactamente.

00:26:55.200 --> 00:26:56.940
Menos vendidos.xml

00:26:56.940 --> 00:26:57.920
pero está en descarga.

00:26:59.260 --> 00:26:59.960
No está en temporal.

00:27:01.400 --> 00:27:02.680
Tendría que buscarlo después

00:27:02.680 --> 00:27:03.560
que está ahí en descarga.

00:27:06.040 --> 00:27:07.000
Y después

00:27:08.060 --> 00:27:09.020
lo importa

00:27:09.020 --> 00:27:10.800
en el file delimited.

00:27:11.360 --> 00:27:12.980
Parece que le falta agregar el component

00:27:12.980 --> 00:27:14.580
de file input, exactamente,

00:27:14.840 --> 00:27:16.680
de file input delimited.

00:27:17.820 --> 00:27:18.700
Entra al esquema del

00:27:18.700 --> 00:27:20.420
file input delimited. Listo.

00:27:21.240 --> 00:27:23.060
Ahora por favor vamos entonces

00:27:23.060 --> 00:27:24.600
a unir el prom

00:27:24.600 --> 00:27:26.720
con el TMAG4. En este caso

00:27:26.720 --> 00:27:28.380
a mí me marcó como TMAG4.

00:27:28.520 --> 00:27:30.340
No sé si de pronto usted le marcó con el mismo

00:27:30.340 --> 00:27:30.500
nombre.

00:27:32.280 --> 00:27:33.420
Conectamos el prom

00:27:35.340 --> 00:27:36.220
y el

00:27:36.220 --> 00:27:38.060
file input delimited con el tema

00:27:40.740 --> 00:27:42.280
Como el orden puede

00:27:42.280 --> 00:27:44.140
influir con el caso, creo que fue de Luis,

00:27:44.200 --> 00:27:46.200
no se fue Alejandro que de pronto lo colocó en un orden

00:27:46.200 --> 00:27:48.160
diferente. Para saber

00:27:48.160 --> 00:27:50.360
cómo lo vamos a organizar vamos a cambiar el nombre

00:27:50.360 --> 00:27:52.080
de la flecha que es importante como buena

00:27:52.080 --> 00:27:54.540
práctica. La flecha que conecta

00:27:54.540 --> 00:27:56.140
el prom con el TMAG4

00:27:56.140 --> 00:27:58.340
la voy a llamar

00:28:00.480 --> 00:28:01.140
plantilla

00:28:01.140 --> 00:28:02.340
on the record prom

00:28:03.580 --> 00:28:04.240
PT

00:28:04.240 --> 00:28:06.480
Recuerde cómo cambiarla, le da click

00:28:06.480 --> 00:28:07.880
y después nuevamente le da click.

00:28:09.060 --> 00:28:10.600
Y la que conecta

00:28:10.600 --> 00:28:12.840
el file input con el TMAG4

00:28:12.840 --> 00:28:14.620
la voy a llamar resultado

00:28:14.620 --> 00:28:15.740
big data.

00:28:18.080 --> 00:28:20.540
Listo, perfecto. Ya estamos terminando

00:28:20.540 --> 00:28:22.700
esta configuración. No

00:28:22.700 --> 00:28:23.620
queda mucho.

00:28:26.060 --> 00:28:26.700
Vamos ahora

00:28:26.700 --> 00:28:28.960
entonces de aquí donde juega muy importante

00:28:28.960 --> 00:28:30.740
para lo que tenga en cuenta a nivel práctico

00:28:30.740 --> 00:28:32.680
este señor que había colocado como

00:28:32.680 --> 00:28:34.640
porcentaje S. En ese

00:28:34.640 --> 00:28:36.720
porcentaje S voy a reemplazar el

00:28:36.720 --> 00:28:38.140
valor que me devolvió el big data.

00:28:39.240 --> 00:28:40.440
Entonces, ¿cómo hago eso?

00:28:40.460 --> 00:28:42.240
Me voy nuevamente a mi talent

00:28:43.200 --> 00:28:44.200
mi talent

00:28:44.200 --> 00:28:45.400
big data

00:28:46.720 --> 00:28:48.400
voy al TMAG4

00:28:48.400 --> 00:28:51.040
y lo voy a configurar.

00:28:51.920 --> 00:28:52.900
Voy a agregar aquí

00:28:52.900 --> 00:28:53.740
una variable

00:28:54.520 --> 00:28:55.680
el nombre, la variable

00:28:55.680 --> 00:28:56.920
la voy a llamar prompt

00:28:57.740 --> 00:28:59.560
el nombre y va a ser tipo string

00:28:59.560 --> 00:29:01.560
muy pendiente cuando cree esas variables

00:29:01.560 --> 00:29:03.620
porque el tipo tiene que

00:29:03.620 --> 00:29:05.820
coincidir con la expresión.

00:29:06.320 --> 00:29:07.580
Si la expresión devuelve un entero

00:29:07.580 --> 00:29:09.600
pues el tipo tiene que ser entero. En este caso la

00:29:09.600 --> 00:29:11.520
expresión va a volver un string.

00:29:12.860 --> 00:29:13.860
Por último entonces

00:29:13.860 --> 00:29:15.640
vamos aquí a lo siguiente.

00:29:18.180 --> 00:29:19.880
Hay un pequeño

00:29:20.620 --> 00:29:21.920
un momento que no me va

00:29:21.920 --> 00:29:23.980
a salir de la forma en que aquí

00:29:23.980 --> 00:29:26.040
me toca hacer un pequeño cambio. Un momento

00:29:26.040 --> 00:29:27.580
a ver si lo hago aquí directamente.

00:29:33.700 --> 00:29:35.860
Pero un momento a ver, me toca hacer aquí

00:29:35.860 --> 00:29:38.000
un pequeño cambio porque

00:29:38.440 --> 00:29:39.720
de si lo dejo de esta forma

00:29:39.720 --> 00:29:41.720
solamente me va a leer fila por fila

00:29:41.720 --> 00:29:43.920
y como hay tres registros me va a

00:29:43.920 --> 00:29:45.300
hacer tres llamados a lo que hay.

00:29:46.020 --> 00:29:47.780
Me va a tocar hacer aquí un pequeño cambio

00:29:47.780 --> 00:29:49.700
para adaptar que este

00:29:49.700 --> 00:29:51.220
file input delimited

00:29:51.220 --> 00:29:52.120
en realidad

00:29:53.900 --> 00:29:56.580
en realidad que este file input delimited

00:29:56.580 --> 00:29:57.740
solamente me lea

00:29:57.740 --> 00:29:59.840
todos los registros para que envíe

00:29:59.840 --> 00:30:01.760
todos de un solo

00:30:01.760 --> 00:30:03.860
llamado al OpenAI.

00:30:04.120 --> 00:30:05.740
Me va a tocar hacer aquí una pequeña modificación

00:30:05.740 --> 00:30:06.780
disculpa

00:30:07.480 --> 00:30:09.720
vamos aquí a la metadata mejor y lo hacemos

00:30:09.720 --> 00:30:10.240
acá.

00:30:11.400 --> 00:30:14.200
Vaya aquí a file delimited

00:30:14.200 --> 00:30:15.680
y va a crear

00:30:15.680 --> 00:30:17.960
un archivo delimitado acá en el metadata.

00:30:17.960 --> 00:30:18.900
Ahora

00:30:19.720 --> 00:30:21.700
salte un momentico Alejandro, ahí vamos a crear

00:30:21.700 --> 00:30:23.980
en la metadata que aquí fue un error mío aquí

00:30:23.980 --> 00:30:25.320
inicialmente de

00:30:26.360 --> 00:30:28.160
vamos a llamarlo ahora

00:30:28.160 --> 00:30:30.380
a este nuevo archivo, vamos a llamarlo

00:30:30.380 --> 00:30:31.960
salida big data

00:30:34.220 --> 00:30:35.380
big data

00:30:36.440 --> 00:30:37.960
vamos a darle aquí next

00:30:39.760 --> 00:30:41.960
busque por favor el fichero

00:30:43.120 --> 00:30:44.280
donde quedó

00:30:44.280 --> 00:30:46.380
creo que en el caso mío quedó

00:30:46.380 --> 00:30:47.180
sin temporal

00:30:47.180 --> 00:30:50.380
se llama producto menos vendido

00:30:56.040 --> 00:30:58.340
dale por favor next

00:31:01.940 --> 00:31:04.080
no le digas que tiene encabezado

00:31:04.080 --> 00:31:05.200
ya le digo, déjelo así

00:31:05.200 --> 00:31:07.300
sí dígame

00:31:09.100 --> 00:31:11.600
ya voy un momento claro que sí

00:31:14.240 --> 00:31:14.680
listo

00:31:15.240 --> 00:31:15.680
antes

00:31:15.680 --> 00:31:17.260
se lo voy a compartir inmediatamente

00:31:17.260 --> 00:31:19.960
no le coloque encabezado y le voy a decir cuál es el truco

00:31:19.960 --> 00:31:21.920
que quiero hacer. Esto lo pude haber hecho

00:31:21.920 --> 00:31:23.580
de una forma más complicada

00:31:23.580 --> 00:31:25.540
leyendo el archivo pero creo que esta es la forma

00:31:25.540 --> 00:31:27.280
más fácil de hacerlo

00:31:27.720 --> 00:31:29.860
quiero leer todo el archivo

00:31:29.860 --> 00:31:31.660
en un solo llamado

00:31:31.660 --> 00:31:33.240
o sea en una sola fila

00:31:33.240 --> 00:31:35.560
porque si lo dejo así por defecto el componente

00:31:35.560 --> 00:31:38.000
de este file input me lee fila por fila

00:31:38.000 --> 00:31:39.780
ya un momento, ya lo recupero

00:31:39.780 --> 00:31:42.280
entonces lo que quiero es que todos los registros

00:31:42.280 --> 00:31:43.780
se lean en una sola fila

00:31:44.960 --> 00:31:46.120
entonces el truquito

00:31:46.120 --> 00:31:48.160
un momentito ya le pongo esto acá

00:31:48.160 --> 00:31:49.260
ya lo descargo acá

00:31:49.260 --> 00:31:52.120
y lo voy a colocar entonces en talent

00:31:53.680 --> 00:31:54.240
td4

00:31:55.000 --> 00:31:55.560
listo

00:31:55.560 --> 00:31:57.180
puede descargarlo en la carpeta compartida

00:31:57.180 --> 00:31:59.780
mire si ya le aparece del f5

00:32:02.200 --> 00:32:03.020
¿qué te parece?

00:32:06.800 --> 00:32:07.920
déjame para ver

00:32:07.920 --> 00:32:09.860
así, ya está, un momento

00:32:11.720 --> 00:32:13.920
no es cierto un momento que pasó aquí

00:32:15.420 --> 00:32:17.240
es el momento que no lo

00:32:17.920 --> 00:32:19.740
me la veo y ahora mire para ver

00:32:19.740 --> 00:32:21.920
si le aparece o no

00:32:22.560 --> 00:32:23.900
ok, por favor, descargue

00:32:26.340 --> 00:32:27.380
mire lo siguiente

00:32:27.380 --> 00:32:29.760
por defecto dese cuenta que aquí me está leyendo

00:32:29.760 --> 00:32:31.920
en dos columnas y en cuatro filas

00:32:32.600 --> 00:32:34.080
yo quiero que todo el

00:32:34.080 --> 00:32:35.920
contenido me lo lea en una sola fila

00:32:35.920 --> 00:32:37.680
¿por qué? porque quiero mandar

00:32:37.680 --> 00:32:39.560
en una sola request, quiero mandar

00:32:39.560 --> 00:32:41.260
todos esos datos al OpenI

00:32:41.260 --> 00:32:43.160
entonces el truco que voy a hacer es lo siguiente

00:32:43.920 --> 00:32:45.860
en el separador de campo que está aquí

00:32:45.860 --> 00:32:47.540
le voy a colocar aquí

00:32:47.540 --> 00:32:49.880
custom utf8

00:32:49.880 --> 00:32:51.500
y le voy a colocar arroba

00:32:51.500 --> 00:32:53.720
y en el separador de fila

00:32:53.720 --> 00:32:55.760
le voy a colocar custom string

00:32:55.760 --> 00:32:57.860
y también le coloco arroba

00:32:57.860 --> 00:32:59.280
al cambiar esto

00:32:59.280 --> 00:33:02.320
me debe leer todo en una sola fila

00:33:02.320 --> 00:33:03.160
lele refresh

00:33:03.160 --> 00:33:04.600
y mire como lo lee

00:33:05.160 --> 00:33:07.140
lo lee como si fuese una sola fila

00:33:07.140 --> 00:33:09.160
este es el truco para poder enviarlo entonces

00:33:10.780 --> 00:33:11.960
dele aquí por favor

00:33:12.640 --> 00:33:13.160
next

00:33:16.000 --> 00:33:17.480
y cambie el nombre

00:33:17.480 --> 00:33:19.160
de la columna por resultado

00:33:21.840 --> 00:33:23.320
aqui le dejo nuevamente

00:33:23.320 --> 00:33:25.420
lo que cambie fue el separador de campo

00:33:25.420 --> 00:33:27.140
en custom utf8

00:33:27.140 --> 00:33:28.920
y le mande una arroba

00:33:28.920 --> 00:33:31.160
y cambie el separador de fila

00:33:31.160 --> 00:33:32.800
por una arroba

00:33:32.800 --> 00:33:34.420
y tome, entonces que hace eso?

00:33:34.440 --> 00:33:37.160
que todos los datos del archivo lo mande en una sola linea

00:33:39.180 --> 00:33:39.800
next

00:33:41.760 --> 00:33:42.380
aqui

00:33:43.160 --> 00:33:44.460
cambielo al resultado

00:33:46.380 --> 00:33:47.480
y aqui

00:33:48.300 --> 00:33:48.920
finish

00:33:52.120 --> 00:33:53.280
borra ahora

00:33:53.280 --> 00:33:55.300
al final el tefallinput

00:33:55.300 --> 00:33:56.280
que creamos

00:33:56.280 --> 00:33:58.640
y tome el tefallinput del metadata

00:33:59.180 --> 00:34:00.280
que lo llame

00:34:00.280 --> 00:34:01.300
salida big data

00:34:07.920 --> 00:34:08.760
y ahora

00:34:08.760 --> 00:34:10.300
lo une con el tema

00:34:12.900 --> 00:34:13.740
ahora si

00:34:15.060 --> 00:34:16.360
ese lo pude haber hecho

00:34:16.360 --> 00:34:18.200
en realidad eso que hice lo pude haber hecho

00:34:18.200 --> 00:34:20.000
directamente pero lo que pasa es que si lo hacia

00:34:20.640 --> 00:34:22.640
directamente sobre el archivo no voy a entender por que lo hizo

00:34:23.340 --> 00:34:24.820
al final el movimiento

00:34:24.820 --> 00:34:26.300
que hice o la configuracion que hice

00:34:26.300 --> 00:34:28.040
fue simplemente para que todos los datos

00:34:28.040 --> 00:34:30.080
que me arrojara big data

00:34:30.080 --> 00:34:31.880
estuviese en una sola fila

00:34:31.880 --> 00:34:33.840
de tal forma que le mandara un solo request

00:34:33.840 --> 00:34:35.980
todos los datos al señor

00:34:35.980 --> 00:34:37.020
open eye

00:34:38.720 --> 00:34:39.680
si digame

00:34:40.300 --> 00:34:41.980
menos vendido

00:34:41.980 --> 00:34:43.620
el menos vendido

00:34:43.620 --> 00:34:45.200
creo que usted no lo tiene

00:34:45.200 --> 00:34:47.300
porque usted no lo ha ejecutado

00:34:47.300 --> 00:34:49.880
asi como el caso de, no tenia ese archivo

00:34:49.880 --> 00:34:52.500
entonces descargue lo por favor, haga lo siguiente

00:34:53.040 --> 00:34:54.620
descargue lo del

00:34:54.620 --> 00:34:56.620
de la carpeta compartida

00:34:57.380 --> 00:34:58.780
que verdad que usted no lo ha ejecutado

00:34:58.780 --> 00:35:01.000
todavia, usted no lo tiene

00:35:01.000 --> 00:35:02.400
dale este 5 por favor

00:35:02.400 --> 00:35:04.540
ahi esta producto menos vendido, ese que esta ahi

00:35:04.540 --> 00:35:06.620
descargue lo por favor, si esta en aval de

00:35:07.280 --> 00:35:08.740
lo descargue en el temporal

00:35:09.380 --> 00:35:11.680
bueno, deje de ir en descarga, no hay problema

00:35:11.680 --> 00:35:13.020
vayas entonces al

00:35:13.020 --> 00:35:13.880
tal en big data

00:35:14.580 --> 00:35:16.440
y busque en descargas ahora si

00:35:16.440 --> 00:35:18.840
voy a mostrar como va abriendo de pronto alguno que va por aqui

00:35:18.840 --> 00:35:20.860
abra por favor ahi, dale next

00:35:20.860 --> 00:35:22.660
ahora cambie el separador

00:35:22.660 --> 00:35:23.320
separador de campo

00:35:23.320 --> 00:35:26.760
esa codificacion esta bien porque esta en ingles no hay mucho problema

00:35:26.760 --> 00:35:28.320
separador de cambio

00:35:28.320 --> 00:35:30.240
dele click y escoja la ultima opcion

00:35:30.240 --> 00:35:32.560
que es custom utf8

00:35:32.560 --> 00:35:34.500
y cambielo para arroba

00:35:34.500 --> 00:35:36.420
porque lo cambio por arroba porque el carater

00:35:36.420 --> 00:35:38.340
actualmente es punto y coma entonces

00:35:38.340 --> 00:35:39.880
para que me de una sola columna

00:35:39.880 --> 00:35:42.060
y el separador de fila

00:35:42.060 --> 00:35:44.060
dele custom nuevamente custom

00:35:44.060 --> 00:35:46.960
y pongale ahi

00:35:46.960 --> 00:35:48.160
y cambielo arroba

00:35:48.160 --> 00:35:50.140
ese slash n lo cambia arroba

00:35:50.140 --> 00:35:52.320
dele un refresh y deberia entonces

00:35:52.320 --> 00:35:55.280
traer todos los datos en una sola fila

00:35:55.280 --> 00:35:55.900
perfecto

00:35:55.900 --> 00:35:58.340
dele next ahora y llame la columna resultado

00:35:58.340 --> 00:36:00.060
la columna 0 que esta y llamela

00:36:00.060 --> 00:36:02.140
para que tenga mas sentido resultado

00:36:02.140 --> 00:36:04.580
dele enter ahora listo dele finish ahora

00:36:04.580 --> 00:36:06.040
ahora en el job borre

00:36:06.040 --> 00:36:08.640
ese te file input del limited borrelo por favor

00:36:08.640 --> 00:36:10.740
ese te file input del limited del job

00:36:10.740 --> 00:36:12.220
lo borra si el te file

00:36:12.220 --> 00:36:13.800
input del limited que esta y lo borra

00:36:14.300 --> 00:36:16.080
suprimir dele con el

00:36:16.080 --> 00:36:18.140
tecle suprimir y se arroba listo

00:36:18.140 --> 00:36:20.080
ahora vais a la metadata

00:36:20.080 --> 00:36:21.280
del file del limited

00:36:22.140 --> 00:36:23.940
y busque el archivo que se llama salida big data

00:36:23.940 --> 00:36:26.280
lo suelte en el canvas y une

00:36:26.280 --> 00:36:28.140
ese señor con el te map

00:36:29.460 --> 00:36:30.960
este file input

00:36:30.960 --> 00:36:32.400
ahora une el señor

00:36:32.400 --> 00:36:34.500
con el te map, cambie el row 7

00:36:34.500 --> 00:36:36.400
por resultado de score big data

00:36:36.400 --> 00:36:37.820
el row 7

00:36:37.820 --> 00:36:40.460
sabe como cambiarlo la linea

00:36:41.460 --> 00:36:42.580
nuevamente click

00:36:42.580 --> 00:36:44.080
y lo llama resultado

00:36:44.080 --> 00:36:45.040
de score big data

00:36:46.560 --> 00:36:47.840
big data

00:36:47.840 --> 00:36:49.100
big data

00:36:49.100 --> 00:36:51.540
enter y ahora enter al te map

00:36:51.540 --> 00:36:53.040
y es por donde estamos entonces

00:36:54.860 --> 00:36:56.540
listo un momento por favor

00:36:56.540 --> 00:36:58.480
le hago aqui el cambio

00:36:59.320 --> 00:36:59.960
resultado

00:36:59.960 --> 00:37:02.460
under score big data

00:37:02.460 --> 00:37:03.800
vamos aqui al te

00:37:03.800 --> 00:37:04.960
al te map 4

00:37:05.840 --> 00:37:07.140
y ahora si

00:37:07.140 --> 00:37:08.380
esta como yo quiero

00:37:09.040 --> 00:37:11.040
entonces vamos aqui a la expresion

00:37:11.880 --> 00:37:12.860
y recuerde que

00:37:12.860 --> 00:37:14.380
para poder modificar ese

00:37:14.380 --> 00:37:16.520
porcentaje ese voy a hacer la siguiente funcion

00:37:16.520 --> 00:37:18.860
string.format

00:37:20.480 --> 00:37:22.220
le va a mandar entonces

00:37:22.220 --> 00:37:24.840
la plantilla como primera opcion

00:37:26.160 --> 00:37:28.500
y despues le va a mandar resultado big data

00:37:28.500 --> 00:37:30.220
entonces como te va a quedar esto en grande

00:37:30.220 --> 00:37:31.340
te va a quedar de esta manera

00:37:31.340 --> 00:37:32.720
te lo voy a colocar

00:37:33.720 --> 00:37:37.220
te va a quedar esto asi

00:37:39.000 --> 00:37:40.500
lo que debes copiar es

00:37:40.500 --> 00:37:42.580
string.format

00:37:42.580 --> 00:37:44.600
ya el nombre de la variable

00:37:44.600 --> 00:37:45.760
lo toma directamente

00:37:46.940 --> 00:37:48.800
del te block que esta al ladito

00:37:48.800 --> 00:37:50.060
en una variable

00:37:50.060 --> 00:37:53.180
quien me hablo para guiarla mejor

00:37:53.180 --> 00:37:54.500
quien me hablo

00:37:55.140 --> 00:37:56.440
Beatriz vamos entonces

00:37:57.320 --> 00:37:57.880
exactamente

00:37:58.500 --> 00:37:59.440
entonces vamos a ver Beatriz

00:38:00.560 --> 00:38:02.560
entra al te map exactamente

00:38:02.560 --> 00:38:03.940
dentro de la expresion por favor

00:38:03.940 --> 00:38:05.820
dele click en la expresion en la primera

00:38:07.260 --> 00:38:08.800
y copia string

00:38:08.800 --> 00:38:09.740
con ese mayuscula

00:38:09.740 --> 00:38:11.900
string.format

00:38:12.840 --> 00:38:14.700
ahi esta el primerito

00:38:15.540 --> 00:38:16.040
ahora

00:38:17.300 --> 00:38:19.060
reemplaza ese null por plantilla

00:38:19.060 --> 00:38:20.800
pero donde el lado derecho estan las variables

00:38:22.160 --> 00:38:22.660
mejor

00:38:22.660 --> 00:38:24.500
tome la rayada que sale mas facil

00:38:25.940 --> 00:38:26.800
doble click ahi

00:38:27.820 --> 00:38:29.000
selecciona el siguiente null

00:38:29.000 --> 00:38:30.380
doble click en el siguiente null

00:38:30.380 --> 00:38:32.220
y vaya el resultado ese que esta ahi

00:38:34.660 --> 00:38:35.160
perfecto

00:38:35.160 --> 00:38:36.040
ya quedo esta ahi

00:38:36.040 --> 00:38:37.220
ok listo

00:38:37.220 --> 00:38:40.080
por favor si esta mal yo tambien lo voy a dar aqui

00:38:41.820 --> 00:38:42.380
ok

00:38:43.780 --> 00:38:44.340
ok

00:38:44.340 --> 00:38:46.040
voy a crear aqui una salida

00:38:46.800 --> 00:38:47.800
una salida

00:38:47.800 --> 00:38:49.880
que la voy a llamar

00:38:49.880 --> 00:38:51.540
prompt pt

00:38:51.540 --> 00:38:54.040
salida prompt pt

00:38:55.260 --> 00:38:56.980
aqui le da en el mas de aca arriba

00:38:57.940 --> 00:39:02.620
y manda la variable hacia esa salida

00:39:03.800 --> 00:39:04.920
perfecto prompt

00:39:04.920 --> 00:39:06.640
y manda la variable

00:39:11.020 --> 00:39:13.320
le damos aqui ok

00:39:13.320 --> 00:39:15.940
y unimos el tema 4

00:39:15.940 --> 00:39:17.020
con el tema 3

00:39:17.020 --> 00:39:19.620
es decir entonces vamos a unir este señor

00:39:19.620 --> 00:39:21.540
por la salida

00:39:21.540 --> 00:39:23.260
dele click derecho porque despues no le sale

00:39:23.880 --> 00:39:25.560
situa en el tema click derecho

00:39:25.560 --> 00:39:27.360
y debe aparecerle cuando aparece en fila

00:39:27.360 --> 00:39:28.920
debe aparecerle la salida que configuro

00:39:29.520 --> 00:39:30.880
tome esa salida prompt

00:39:30.880 --> 00:39:32.420
y la suelta en el tema 3

00:39:35.700 --> 00:39:37.220
listo ahi

00:39:37.220 --> 00:39:38.800
tema 3 la suelta

00:39:38.800 --> 00:39:40.760
ya nos falta ya una confersion

00:39:40.760 --> 00:39:42.940
sencilla que es la del tema 3

00:39:42.940 --> 00:39:44.920
voy aqui al tema 3

00:39:48.220 --> 00:39:49.880
y en realidad

00:39:49.880 --> 00:39:51.160
aqui

00:39:51.160 --> 00:39:52.920
listo modifico lo siguiente

00:39:54.920 --> 00:39:56.920
aqui como no tengo los nombres de

00:39:57.740 --> 00:39:58.500
prompt

00:39:59.520 --> 00:40:00.280
payload

00:40:01.420 --> 00:40:02.180
rose

00:40:02.180 --> 00:40:04.020
la madre rose

00:40:06.060 --> 00:40:09.420
expande la expresion esta por favor

00:40:12.060 --> 00:40:13.640
mire por favor como llamo

00:40:13.640 --> 00:40:16.120
este es lo importante de colocarle los nombres

00:40:16.120 --> 00:40:18.080
a estas filas

00:40:18.080 --> 00:40:20.400
mire como el nombre de esta fila que esta aqui

00:40:20.400 --> 00:40:22.080
la segunda que esta aqui en el caso mio

00:40:22.080 --> 00:40:22.500
rose

00:40:24.480 --> 00:40:25.920
entonces nada mas cambio

00:40:25.920 --> 00:40:28.060
aca tengo rose 5 y cambio rose 5

00:40:28.060 --> 00:40:30.640
cambio el 5 por 6 para que lo una

00:40:30.640 --> 00:40:32.680
mire como se llama aca

00:40:32.680 --> 00:40:34.660
y cambia el encabezado

00:40:34.660 --> 00:40:36.120
porque la variedad se llama igual

00:40:46.700 --> 00:40:48.060
rose 6

00:40:48.640 --> 00:40:51.120
y cambia

00:40:51.120 --> 00:40:52.800
momento momento momento

00:40:52.800 --> 00:40:54.600
perdon no no es rose 6

00:40:54.600 --> 00:40:55.460
es la de arriba

00:40:55.460 --> 00:40:57.460
es que

00:40:58.280 --> 00:40:59.440
haga lo siguiente

00:40:59.440 --> 00:41:00.640
para no equivocarlo

00:41:01.460 --> 00:41:02.880
con estos nombres me equivoque

00:41:03.460 --> 00:41:05.840
vamos a nombrarlo bien

00:41:05.840 --> 00:41:07.460
cierre un momento y vamos a nombrarlo bien

00:41:08.760 --> 00:41:09.960
el payload

00:41:09.960 --> 00:41:11.960
que une con tema 3

00:41:11.960 --> 00:41:13.600
cambie ese nombre a payload

00:41:13.600 --> 00:41:15.480
para ubicarme bien

00:41:15.480 --> 00:41:17.200
la linea que une

00:41:17.200 --> 00:41:19.860
payload con tema 3 cambiela a payload

00:41:19.860 --> 00:41:21.460
si si

00:41:22.000 --> 00:41:22.660
payload

00:41:24.340 --> 00:41:25.580
para ubicarme

00:41:25.580 --> 00:41:26.660
mejor

00:41:26.660 --> 00:41:28.780
que ese nombre road 1 road 3

00:41:28.780 --> 00:41:30.680
te pega una vez en una partida 1

00:41:30.680 --> 00:41:32.920
road payload listo

00:41:32.920 --> 00:41:34.660
y ahora si vamos a entrar a configurarlo

00:41:38.700 --> 00:41:40.800
bien aqui esta bien

00:41:41.920 --> 00:41:43.200
y en realidad

00:41:43.200 --> 00:41:44.400
la segunda

00:41:44.400 --> 00:41:46.840
la segunda columna no es payload

00:41:46.840 --> 00:41:48.560
sino prompt

00:41:50.040 --> 00:41:51.040
que seria

00:41:51.040 --> 00:41:51.800
la de arriba

00:41:51.800 --> 00:41:52.480
prompt

00:41:54.900 --> 00:41:56.380
ahora si prompt

00:41:56.380 --> 00:41:59.080
porque el payload va aca y el prompt va de segundo

00:42:00.280 --> 00:42:01.420
para verificar

00:42:01.420 --> 00:42:02.960
que todo esta bien

00:42:02.960 --> 00:42:05.540
deberia aparecer 2 lineas

00:42:05.540 --> 00:42:07.040
uniéndose a la variable

00:42:07.040 --> 00:42:08.960
aqui esta una linea amarilla

00:42:08.960 --> 00:42:11.280
de la parte abajo y una linea amarilla de la parte arriba

00:42:11.280 --> 00:42:13.040
si no estan esta mal configurado

00:42:15.520 --> 00:42:16.720
solamente cambie

00:42:16.720 --> 00:42:18.480
la segunda

00:42:19.080 --> 00:42:19.560
encabezado

00:42:19.560 --> 00:42:21.480
lo cambia a promptpt

00:42:22.180 --> 00:42:23.760
y yo creo que ya

00:42:23.760 --> 00:42:25.640
verifique por favor cuando termine aqui

00:42:25.640 --> 00:42:27.760
que las 2 filas

00:42:27.760 --> 00:42:29.200
las 2 entradas

00:42:29.200 --> 00:42:32.140
se unan en la variable con una linea amarilla

00:42:32.140 --> 00:42:34.080
diciendo que estan relacionadas a esas variables

00:42:34.080 --> 00:42:35.280
la entrada con la variable

00:42:35.280 --> 00:42:37.880
vamos a darle aqui ok

00:42:39.660 --> 00:42:40.140
listo

00:42:40.140 --> 00:42:42.360
no lo hago usted aca lo voy a hacer solamente yo aca

00:42:42.360 --> 00:42:43.740
voy a agregar aqui un telo road

00:42:43.740 --> 00:42:46.560
para que me muestre como quedo el payload

00:42:47.360 --> 00:42:47.880
anterior

00:42:48.820 --> 00:42:50.520
y ya lo ultimo que va a hacer

00:42:50.520 --> 00:42:52.400
usted nada mas alla es unir

00:42:52.400 --> 00:42:53.780
los subjogs

00:42:53.780 --> 00:42:56.880
para que se ejecute primero el de arriba y despues el de abajo

00:42:56.880 --> 00:42:58.820
como unimos los subjogs

00:42:58.820 --> 00:43:00.380
va al primer elemento del

00:43:00.380 --> 00:43:02.940
subjog 1 que quiere unir

00:43:02.940 --> 00:43:04.440
y lo une con el

00:43:04.440 --> 00:43:06.420
disparador o un subjog ok con el primer

00:43:06.420 --> 00:43:08.100
componente del subjog 2

00:43:08.100 --> 00:43:10.560
en este caso primero tiene que ejecutar

00:43:11.080 --> 00:43:12.080
el input de arriba

00:43:12.080 --> 00:43:14.160
para que de la salida y despues

00:43:14.160 --> 00:43:15.540
se ejecuta hacia lo que hay

00:43:15.540 --> 00:43:16.280
voy para ver

00:43:16.280 --> 00:43:19.100
vamos a ver

00:43:19.100 --> 00:43:20.400
ya se rojo

00:43:22.940 --> 00:43:23.420
perfecto

00:43:23.420 --> 00:43:25.340
la pregunta es que hay veces que

00:43:25.340 --> 00:43:27.040
se queda

00:43:27.760 --> 00:43:30.400
por la conexion

00:43:30.400 --> 00:43:32.160
bueno voy a probar

00:43:34.720 --> 00:43:35.840
voy a ejecutar

00:43:35.840 --> 00:43:37.500
voy a comprobar que este sea el jog

00:43:38.160 --> 00:43:39.200
listar 10 articulos

00:43:39.900 --> 00:43:40.620
vamos a ver

00:43:40.620 --> 00:43:42.600
vamos a ejecutarlo inicialmente para ver

00:43:43.540 --> 00:43:44.780
y digo para ver

00:43:44.780 --> 00:43:46.560
si hay algun error aqui

00:43:49.760 --> 00:43:50.400
bueno

00:43:50.400 --> 00:43:52.620
empezó haciendo la parte de big data

00:43:53.400 --> 00:43:54.660
leyendo directamente

00:43:54.660 --> 00:43:56.620
del dejado vamos a ver

00:43:57.780 --> 00:43:58.640
empezó a hacer

00:43:58.640 --> 00:44:00.620
el pron listo

00:44:01.180 --> 00:44:01.720
ah

00:44:03.960 --> 00:44:04.880
que pena

00:44:04.880 --> 00:44:05.600
un momento

00:44:06.400 --> 00:44:08.520
se nos olvido cambiar

00:44:08.520 --> 00:44:10.600
el archivo

00:44:10.600 --> 00:44:12.520
pron que lo modificamos creamos uno nuevo

00:44:14.200 --> 00:44:15.120
se nos olvido

00:44:15.120 --> 00:44:16.820
se nos olvido a mi

00:44:16.820 --> 00:44:17.820
vayase al pron

00:44:17.820 --> 00:44:20.380
pron pt el archivo

00:44:20.380 --> 00:44:22.100
y note que esta

00:44:22.920 --> 00:44:24.880
direccionando al pron del día de ayer

00:44:24.880 --> 00:44:26.300
hoy hicimos un nuevo archivo

00:44:26.300 --> 00:44:28.720
para cambiarlo nada mas vaya aqui donde dice

00:44:28.720 --> 00:44:30.460
los 3 punticos al final

00:44:32.840 --> 00:44:34.840
y cuando le aparezca este

00:44:34.840 --> 00:44:35.640
dele chain

00:44:35.640 --> 00:44:37.500
no le de un day sino chain

00:44:37.500 --> 00:44:39.440
y el le va a permitir cambiar

00:44:39.440 --> 00:44:41.300
la ubicacion del archivo

00:44:42.060 --> 00:44:43.640
nuevamente aqui le doy

00:44:44.620 --> 00:44:45.660
despues que doy chain

00:44:45.660 --> 00:44:47.460
tengo que dar nuevamente click

00:44:47.460 --> 00:44:49.560
y busco el nuevo pron

00:44:49.560 --> 00:44:51.640
que yo lo llame

00:44:54.400 --> 00:44:57.740
pron articulos menos vendidos

00:45:00.600 --> 00:45:01.200
listo

00:45:01.200 --> 00:45:01.860
porque estaba

00:45:01.860 --> 00:45:03.680
ejecutando el pron anterior

00:45:03.680 --> 00:45:06.360
vamos a ver ahora con este nuevo pron

00:45:06.360 --> 00:45:07.680
a ver si

00:45:10.160 --> 00:45:11.520
si aqui lo mando

00:45:11.520 --> 00:45:12.240
no chain

00:45:13.800 --> 00:45:14.900
quien me dijo

00:45:14.900 --> 00:45:16.000
Brenda fue Brenda

00:45:17.360 --> 00:45:19.060
voy a para que lo muestre

00:45:20.460 --> 00:45:21.680
dele chain primero

00:45:22.960 --> 00:45:23.780
y listo

00:45:23.780 --> 00:45:25.180
ahora escoja el nuevo pron

00:45:26.060 --> 00:45:27.460
que seguramente lo tienen

00:45:27.460 --> 00:45:28.580
credenciales open eye

00:45:28.580 --> 00:45:31.180
ese menos vendido listo

00:45:31.180 --> 00:45:31.940
dele abrir

00:45:32.680 --> 00:45:34.500
y dele ejecutar usted inicialmente

00:45:34.500 --> 00:45:37.300
para ver si no tiene ningun problema

00:45:37.300 --> 00:45:39.420
tengo un pequeño error

00:45:39.420 --> 00:45:41.180
el pron esta bien

00:45:41.700 --> 00:45:42.860
pero tengo un error

00:45:42.860 --> 00:45:44.060
vamos a ver si a todo le sale el mismo error

00:45:44.060 --> 00:45:45.800
me quede con la ventana aqui de Beatriz

00:45:45.800 --> 00:45:47.940
para ver si le sale el mismo error

00:45:48.480 --> 00:45:50.100
hay algunos que ya estan ejecutando

00:45:50.100 --> 00:45:51.320
vamos a ver

00:45:51.320 --> 00:45:53.280
vamos con Brenda tiene un error

00:45:55.780 --> 00:45:56.220
exactamente

00:45:56.220 --> 00:45:58.660
dele ok

00:45:58.660 --> 00:45:59.820
para buscarlo

00:46:00.660 --> 00:46:01.720
vaya donde dice code

00:46:01.720 --> 00:46:04.140
vaya al code del job

00:46:04.140 --> 00:46:06.200
y debe marcarle algun punto rojo

00:46:06.200 --> 00:46:07.340
que esta mal configurado

00:46:09.140 --> 00:46:11.000
payload dice que no existe eso

00:46:11.000 --> 00:46:12.020
entonces vayamos

00:46:12.020 --> 00:46:14.220
eso esta en el tema

00:46:14.220 --> 00:46:15.460
vayas al designer

00:46:15.460 --> 00:46:18.280
y entre al tema que une

00:46:18.280 --> 00:46:19.720
ese tema 4 que esta ahi

00:46:19.720 --> 00:46:21.520
ah no perdon es en el 3

00:46:23.920 --> 00:46:24.480
bien

00:46:24.480 --> 00:46:25.780
dese cuenta lo siguiente

00:46:25.780 --> 00:46:27.260
lo que pasa es que

00:46:27.260 --> 00:46:29.260
en primer lugar dese cuenta que row 8

00:46:29.260 --> 00:46:30.700
no esta en amarillo verdad

00:46:31.580 --> 00:46:33.200
y no esta

00:46:33.780 --> 00:46:34.940
no esta porque

00:46:34.940 --> 00:46:37.040
porque ahi su conexion se llama

00:46:37.040 --> 00:46:38.440
row 8

00:46:38.440 --> 00:46:40.160
significa en pocas palabras lo siguiente

00:46:40.160 --> 00:46:43.200
vamos a hacer lo siguiente

00:46:44.160 --> 00:46:45.120
dele ok

00:46:45.780 --> 00:46:46.920
dele ok mejor ahi

00:46:46.920 --> 00:46:48.760
y vamos a cambiar el nombre de la linea

00:46:48.760 --> 00:46:49.560
dele ok

00:46:49.560 --> 00:46:51.320
y la linea que conecta

00:46:51.320 --> 00:46:52.880
baja un poquito por favor

00:46:52.880 --> 00:46:54.920
esa linea que conecta ese row 8

00:46:54.920 --> 00:46:57.060
que esta ahi cambia el nombre a payload

00:46:57.060 --> 00:46:58.060
por favor si estan amable

00:46:58.060 --> 00:47:01.320
ahora si entre al tema 3 y asi nos vamos a ubicar

00:47:02.020 --> 00:47:03.040
listo tema 3

00:47:03.040 --> 00:47:05.320
doble click por favor listo

00:47:06.480 --> 00:47:07.700
entre si estan amable

00:47:07.700 --> 00:47:08.720
ahora si

00:47:08.720 --> 00:47:11.220
dese cuenta que ahora si le marco bien

00:47:11.220 --> 00:47:13.880
payload y prom ahora si esta bien

00:47:13.880 --> 00:47:15.660
dele ok vuelve a ejecutar

00:47:15.660 --> 00:47:16.840
parece que ese nomas le da error

00:47:16.840 --> 00:47:17.880
vuelve a ejecutar nuevamente

00:47:17.880 --> 00:47:20.660
vamos a ver que error le sale a usted

00:47:20.660 --> 00:47:21.920
empezó ya

00:47:21.920 --> 00:47:23.460
la parte de big data

00:47:27.000 --> 00:47:28.000
por donde va

00:47:28.000 --> 00:47:28.680
se esta demorando

00:47:28.680 --> 00:47:31.840
por donde va

00:47:31.840 --> 00:47:33.880
subo un poquito por donde va

00:47:38.420 --> 00:47:39.380
esta por ahi

00:47:39.380 --> 00:47:41.540
que raro que haya demorado en ese punto

00:47:41.540 --> 00:47:43.880
note que ahi

00:47:44.460 --> 00:47:46.240
aparece starting

00:47:46.240 --> 00:47:48.140
si nota que esta en ese punto

00:47:48.140 --> 00:47:50.020
ahi que se ha demorado que raro

00:47:50.020 --> 00:47:52.240
que se haya demorado de esa forma

00:47:52.240 --> 00:47:54.380
espera un momento para ver si de pronto

00:47:54.380 --> 00:47:55.980
ahora vuelvo con usted

00:47:55.980 --> 00:47:58.620
no se si alguno le salio

00:48:01.080 --> 00:48:02.640
y que le salio

00:48:02.640 --> 00:48:03.900
muestre para ver

00:48:03.900 --> 00:48:05.440
ya voy para ahi

00:48:06.320 --> 00:48:08.500
listo vayase al designer

00:48:08.500 --> 00:48:10.240
porque le si copio el

00:48:11.340 --> 00:48:12.240
subo por favor

00:48:14.300 --> 00:48:16.120
vayase al tema 2

00:48:16.120 --> 00:48:18.760
tema 2

00:48:19.380 --> 00:48:20.500
ese que esta ahi

00:48:21.160 --> 00:48:22.960
no se porque esta ahi

00:48:22.960 --> 00:48:24.260
pero bueno

00:48:24.260 --> 00:48:26.560
note el product line de

00:48:26.560 --> 00:48:27.600
hay dos tablas

00:48:27.600 --> 00:48:29.940
la izquierda y la derecha que esta en la parte de abajo

00:48:30.720 --> 00:48:32.280
la tabla de la derecha

00:48:32.280 --> 00:48:33.860
le dice los archivos de salida

00:48:33.860 --> 00:48:35.940
ese product line tiene que ser a intero

00:48:35.940 --> 00:48:38.020
no se porque se cambio a este

00:48:38.020 --> 00:48:39.860
cambielo ahi a intero

00:48:39.860 --> 00:48:41.820
que es tan amable

00:48:41.820 --> 00:48:43.080
dele enter

00:48:43.840 --> 00:48:45.020
listo dele ok

00:48:45.020 --> 00:48:47.220
y ahora vuelve a ejecutar para ver

00:48:47.220 --> 00:48:49.360
Brenda detenga ese job

00:48:49.360 --> 00:48:52.160
mate los kills y vuelve a ejecutar nuevamente porque esta como rara

00:48:52.160 --> 00:48:53.800
ya no esta que demora tanto

00:48:53.800 --> 00:48:55.780
bueno creo que hay algunos listo vamos a ver

00:48:55.780 --> 00:48:57.860
ok dele ok un momento ahi

00:48:58.980 --> 00:48:59.100
ok

00:48:59.100 --> 00:49:01.620
entra nuevamente al tema 2 a ver que paso ahi

00:49:01.620 --> 00:49:04.300
porque hay una conversion extraña al tema 2

00:49:10.140 --> 00:49:12.520
no que pena ahi me equivoque

00:49:12.520 --> 00:49:14.900
fue yo ahi en realidad el stream estaba bien

00:49:14.900 --> 00:49:16.240
deje el stream por favor

00:49:16.240 --> 00:49:17.760
deje yo acceder un momento

00:49:17.760 --> 00:49:19.960
que fue rovino

00:49:19.960 --> 00:49:21.320
un momento enter

00:49:22.060 --> 00:49:23.860
ok ahi nuevamente

00:49:26.440 --> 00:49:28.060
yes no el cuento

00:49:28.060 --> 00:49:29.760
era con

00:49:29.760 --> 00:49:30.900
ya se

00:49:32.920 --> 00:49:33.820
accedo un momento

00:49:33.820 --> 00:49:35.620
espere un momento que accedo al studio

00:49:36.820 --> 00:49:38.100
porque el problema

00:49:38.100 --> 00:49:40.040
creo que con el tipo de datos no era doble

00:49:40.040 --> 00:49:41.700
era que era

00:49:41.700 --> 00:49:44.260
un momento a ver

00:49:44.260 --> 00:49:46.000
este señor que esta aqui

00:49:47.440 --> 00:49:48.040
es

00:49:48.040 --> 00:49:49.860
total doble no veo ningun problema

00:49:51.400 --> 00:49:52.380
permíteme verlo

00:49:52.380 --> 00:49:54.340
el error está

00:49:55.520 --> 00:49:56.120
en

00:49:56.120 --> 00:49:57.980
dice que es stream ah ya

00:49:57.980 --> 00:50:00.340
aqui hay algo

00:50:00.340 --> 00:50:02.500
extrañamente raro ya se que le pasa

00:50:02.500 --> 00:50:03.640
este

00:50:03.640 --> 00:50:06.320
esta fila tenia que mandarla

00:50:06.320 --> 00:50:06.740
aca

00:50:07.480 --> 00:50:09.780
para cambiar la posición va a colocarla de primero

00:50:09.780 --> 00:50:11.620
simplemente por cuestión de gusto

00:50:13.560 --> 00:50:14.120
ok

00:50:14.120 --> 00:50:15.800
y no se porque

00:50:15.800 --> 00:50:17.800
en ese job pasa esto aqui

00:50:17.800 --> 00:50:19.900
no era pro online sino

00:50:19.900 --> 00:50:22.220
era rovno voy a guardar

00:50:22.220 --> 00:50:23.700
y verificar que el cambio

00:50:23.700 --> 00:50:25.720
esta aqui listo no hay ningun error

00:50:25.720 --> 00:50:27.700
voy a proceder a ejecutar

00:50:28.300 --> 00:50:30.020
perfecto listo este es el error que tengo

00:50:30.020 --> 00:50:31.840
un cuarto ciento de menos deja ahi por favor

00:50:31.840 --> 00:50:34.180
brenda usted tiene la máquina virtual arriba

00:50:34.180 --> 00:50:35.540
que pronto toda tiene arriba

00:50:36.700 --> 00:50:38.460
ah un momento ya voy aqui

00:50:38.460 --> 00:50:40.140
prendo el caso suyo

00:50:40.140 --> 00:50:41.540
se quedo ahi verdad

00:50:41.540 --> 00:50:44.520
permita un momento voy a acceder a su máquina de manera

00:50:44.520 --> 00:50:46.280
interactiva y voy a comprobar un par de

00:50:46.800 --> 00:50:48.260
listo voy a ver algo

00:50:48.260 --> 00:50:50.260
aqui no es que la máquina no tiene encendida

00:50:50.260 --> 00:50:52.080
entonces intentaba conectarte a esa máquina

00:50:52.080 --> 00:50:53.860
y por eso se bloqueo

00:50:53.860 --> 00:50:56.100
vamos a encenderla aqui iniciar y

00:50:56.100 --> 00:50:57.980
voy a bloquear un momento

00:50:57.980 --> 00:51:00.300
listo ya esta esperamos un momento

00:51:00.300 --> 00:51:02.360
vamos a ver vamos a ejecutar entonces

00:51:02.360 --> 00:51:03.780
ahora si nuevamente por haber

00:51:04.360 --> 00:51:06.360
ahora si vean este cuenta que va por aqui

00:51:07.800 --> 00:51:08.660
listo este era el error

00:51:08.660 --> 00:51:10.360
que queria que no saliera listo

00:51:11.600 --> 00:51:12.760
quien mas tiene errores

00:51:12.760 --> 00:51:14.780
diferentes a este cuatrocientos que nos devolvio

00:51:14.780 --> 00:51:16.360
nadie mas listo voy a corregir este error

00:51:16.360 --> 00:51:18.080
cuatrocientos que todavia no se que vamos a ver

00:51:18.080 --> 00:51:19.960
cuatrocientos que dice

00:51:21.920 --> 00:51:22.320
ahhhh

00:51:22.320 --> 00:51:24.480
este es el de Beatriz este es el mio

00:51:26.180 --> 00:51:27.380
a ver el

00:51:27.380 --> 00:51:28.500
cuatrocientos por que se da

00:51:28.500 --> 00:51:30.500
en pocas palabras me esta diciendo que mande

00:51:30.500 --> 00:51:32.240
de forma RadalJason

00:51:32.800 --> 00:51:34.500
este es el payload que estoy enviando

00:51:36.620 --> 00:51:38.620
ah listo lo que pasa es que este señor

00:51:38.620 --> 00:51:40.100
OpenAI es un poquito sensible

00:51:40.820 --> 00:51:42.800
que tiene que tener en cuenta lo siguiente

00:51:42.800 --> 00:51:44.380
dese cuenta que yo modifique

00:51:44.380 --> 00:51:46.500
la lectura del archivo

00:51:47.200 --> 00:51:48.700
pero no modifique el archivo

00:51:48.700 --> 00:51:50.880
como tal el archivo es de salida big data

00:51:51.380 --> 00:51:52.740
y parece ser que

00:51:52.740 --> 00:51:54.020
este señor no le gusta

00:51:55.020 --> 00:51:56.500
los saltos de linea

00:51:58.220 --> 00:51:58.700
entonces

00:51:58.700 --> 00:52:00.560
que vamos a hacer

00:52:00.560 --> 00:52:01.760
vallase al tema

00:52:02.700 --> 00:52:04.700
parece que es eso vallase al tema 3

00:52:05.860 --> 00:52:07.540
perdon al tema 4

00:52:07.540 --> 00:52:07.620
perdon al tema 4

00:52:09.540 --> 00:52:10.700
perdon al tema 4

00:52:11.980 --> 00:52:13.240
vallase a la expresion del tema 4

00:52:13.240 --> 00:52:15.140
vallase a la expresion del tema 4

00:52:17.180 --> 00:52:19.040
dele aqui al final

00:52:19.040 --> 00:52:21.180
un punto replace

00:52:22.100 --> 00:52:23.300
punto replace

00:52:24.180 --> 00:52:25.360
ya le coloco

00:52:25.360 --> 00:52:26.680
como queda en grande

00:52:26.680 --> 00:52:28.680
va a agregar esto

00:52:30.420 --> 00:52:31.820
al final

00:52:33.060 --> 00:52:34.680
un punto replace

00:52:36.280 --> 00:52:38.520
abro parenthesis, comilla doble

00:52:38.520 --> 00:52:40.720
slash n y reemplazo los

00:52:40.720 --> 00:52:42.580
slash n por

00:52:42.580 --> 00:52:43.600
espacio en blanco

00:52:43.600 --> 00:52:46.160
que al señor no le gusta

00:52:46.680 --> 00:52:48.460
seguramente en el jason que vaya un

00:52:48.460 --> 00:52:49.860
salto de linea

00:52:49.860 --> 00:52:52.580
porque bueno ya es cuestión muy propia

00:52:52.580 --> 00:52:54.300
de el de porque no debe aceptar eso

00:52:54.300 --> 00:52:56.360
agrega al final un punto

00:52:56.360 --> 00:52:57.200
replace

00:52:58.820 --> 00:53:00.100
mientras usted

00:53:00.100 --> 00:53:03.720
voy ejecutando para ver si ese es el error

00:53:04.300 --> 00:53:06.300
creeria que ese es el error

00:53:15.660 --> 00:53:16.420
perfecto

00:53:16.420 --> 00:53:18.420
me respondio debe de haber la respuesta

00:53:18.420 --> 00:53:20.300
en el correo voy a confirmar

00:53:20.300 --> 00:53:21.540
para ver

00:53:22.060 --> 00:53:23.300
efectivamente ya me respondio

00:53:23.300 --> 00:53:25.220
para aumentar la mente de productos entre hombres en yangon

00:53:25.220 --> 00:53:26.940
es importante tener en cuenta la parte cultural

00:53:27.540 --> 00:53:29.760
investigar las preferencias y tendencias de consumo de hombres en yangon

00:53:29.760 --> 00:53:32.360
bueno ahi el pron si estuvo mal

00:53:32.360 --> 00:53:34.140
la idea es saber como integrar

00:53:34.140 --> 00:53:35.740
bueno a quien no le funciono a ver

00:53:35.740 --> 00:53:38.200
me regala dos minutico y busco un cafecito

00:53:38.640 --> 00:53:39.500
y mientras usted va

00:53:39.500 --> 00:53:41.540
ejecutando ahi busco un cafe ya vengo

00:53:43.860 --> 00:53:44.940
bueno volvi que pena

00:53:45.640 --> 00:53:47.820
cuenteme si a quien no le funciono

00:53:47.820 --> 00:53:48.780
si le funciono a todos

00:53:51.080 --> 00:53:52.120
ah ok

00:53:53.600 --> 00:53:54.740
si creo que

00:53:54.740 --> 00:53:55.500
no se si le canso

00:53:55.500 --> 00:53:57.000
vaya a ser al tema 4

00:53:57.000 --> 00:54:00.400
debe tenerlo como tema 4

00:54:00.400 --> 00:54:01.900
este que esta ahi

00:54:01.900 --> 00:54:02.780
tema 4

00:54:04.880 --> 00:54:05.660
por cierto

00:54:05.660 --> 00:54:07.840
para cambiarle el nombre tambien a los componentes

00:54:07.840 --> 00:54:09.560
que de pronto

00:54:09.560 --> 00:54:10.340
con tema 4

00:54:10.340 --> 00:54:13.120
hace lo mismo como cambiamos las flechitas

00:54:13.120 --> 00:54:14.900
le da click en el componente

00:54:14.900 --> 00:54:16.800
que quiere cambiarle el nombre en el label

00:54:16.800 --> 00:54:18.400
click nuevamente

00:54:18.400 --> 00:54:19.600
y le puede cambiar el nombre

00:54:20.640 --> 00:54:21.840
si acaso

00:54:21.840 --> 00:54:24.800
vallense a la expresion por favor si es tan amable Brenda

00:54:26.140 --> 00:54:27.100
ahi falto

00:54:28.040 --> 00:54:28.640
listo

00:54:28.640 --> 00:54:30.820
al final del parentesi coloque un punto

00:54:30.820 --> 00:54:33.240
replace ahi esta el primero del enter

00:54:33.240 --> 00:54:35.100
abra comilla doble ahi

00:54:35.100 --> 00:54:36.680
slash n al

00:54:36.680 --> 00:54:37.820
el otro

00:54:38.920 --> 00:54:41.020
el otro el otro el

00:54:41.020 --> 00:54:41.660
ese

00:54:41.660 --> 00:54:44.160
cierre comilla doble

00:54:44.160 --> 00:54:46.040
y ahora esta el 0

00:54:46.040 --> 00:54:47.940
coloque comilla doble

00:54:47.940 --> 00:54:51.160
espacio comilla doble para reemplazar ese caracter ahi

00:54:51.160 --> 00:54:52.160
espacio comilla doble

00:54:52.760 --> 00:54:54.180
dele ok

00:54:54.180 --> 00:54:56.160
por favor si es tan amable

00:54:57.160 --> 00:54:58.020
dele ok

00:54:58.020 --> 00:54:59.660
cierre la entana y vuelve a ejecutar para ver

00:54:59.660 --> 00:55:02.160
listo ya esta por la linea

00:55:03.940 --> 00:55:05.560
consumiendo listo

00:55:05.560 --> 00:55:08.100
rese su correo por favor

00:55:08.820 --> 00:55:10.200
bueno ya claro que el prod

00:55:10.200 --> 00:55:12.160
no fue el mejor pero la idea es

00:55:12.160 --> 00:55:14.260
creo que ahi tenemos el ejemplo

00:55:14.260 --> 00:55:16.000
de como podemos combinar todo

00:55:16.000 --> 00:55:18.160
resultado con la parte de ahi

00:55:18.160 --> 00:55:20.260
lo que tendria que mejorar es el prom

00:55:20.260 --> 00:55:22.180
y pues evidentemente saber

00:55:22.180 --> 00:55:24.200
como manejar los datos tiene que tener en cuenta que cuando

00:55:24.200 --> 00:55:26.220
manda un json ese open ahi es muy

00:55:26.220 --> 00:55:28.020
celoso con algunos caracteres

00:55:28.020 --> 00:55:29.980
ya iremos con el caso del

00:55:29.980 --> 00:55:32.260
salto de linea y puede haber unos caracteres

00:55:32.260 --> 00:55:34.160
que de pronto no los acepta

00:55:34.960 --> 00:55:36.080
si le llego

00:55:36.080 --> 00:55:38.000
Brenda ok

00:55:41.700 --> 00:55:42.140
listo

00:55:42.140 --> 00:55:44.620
por un lado y nos vamos con Hai

00:55:46.960 --> 00:55:47.840
explico entonces

00:55:50.420 --> 00:55:52.260
no vamos a lo siguiente mejor

00:55:53.000 --> 00:55:54.320
mientras voy explicando

00:55:54.320 --> 00:55:56.160
porque que no se si tenemos memoria

00:55:56.160 --> 00:55:58.140
voy a cerrar este señor por favor

00:55:59.760 --> 00:56:00.200
abra

00:56:01.440 --> 00:56:02.200
algo que se llama

00:56:02.200 --> 00:56:03.700
el docker de stock

00:56:03.700 --> 00:56:06.260
se busca aqui docker aqui esta no aqui esta

00:56:06.260 --> 00:56:08.140
esta en el escritorio dice docker de stock

00:56:08.680 --> 00:56:10.120
le da doble clic por favor

00:56:10.120 --> 00:56:12.120
para que abra la interfaz

00:56:13.660 --> 00:56:15.660
listo lo deja ahi que vaya abriendo un momento

00:56:15.660 --> 00:56:18.200
la interfaz mientras yo voy explicando

00:56:18.200 --> 00:56:19.780
como la arquitectura que estamos manejando

00:56:21.380 --> 00:56:21.840
actualmente

00:56:21.840 --> 00:56:24.240
listo mientras eso va subiendo mire mi pantalla

00:56:24.240 --> 00:56:26.240
un momento y le voy a hacer el diagrama de lo que tenemos

00:56:27.140 --> 00:56:27.600
entonces

00:56:28.120 --> 00:56:28.620
diagrama aqui

00:56:28.620 --> 00:56:32.120
en blanco perfecto

00:56:32.800 --> 00:56:34.440
bien veamos por un lado

00:56:34.440 --> 00:56:36.260
tenemos la maquina

00:56:36.260 --> 00:56:37.700
virtual que es la de windows

00:56:37.700 --> 00:56:38.980
listo esta maquina virtual

00:56:38.980 --> 00:56:41.080
tiene direccion ip

00:56:41.700 --> 00:56:43.240
estatica perdon

00:56:43.240 --> 00:56:43.940
ip

00:56:43.940 --> 00:56:47.880
10.0.3.15

00:56:47.880 --> 00:56:49.860
sobre windows

00:56:49.860 --> 00:56:50.400
tenemos

00:56:51.700 --> 00:56:53.700
otra maquina virtual con hyper v

00:56:54.420 --> 00:56:55.620
que se llama

00:56:55.620 --> 00:56:58.040
hadut y que esta maquina virtual

00:56:58.040 --> 00:56:58.960
tiene direccion

00:56:58.960 --> 00:57:01.680
10.0.3.16

00:57:01.680 --> 00:57:03.580
ahora sobre

00:57:03.580 --> 00:57:05.380
windows tengo

00:57:05.380 --> 00:57:07.260
instalado el docker que es

00:57:07.260 --> 00:57:08.880
un motor de contenedores

00:57:09.700 --> 00:57:11.400
y sobre docker

00:57:11.400 --> 00:57:13.640
tenemos montado lo siguiente

00:57:13.640 --> 00:57:15.820
tengo montado tres contenedores

00:57:15.820 --> 00:57:17.300
un contenedor de

00:57:17.300 --> 00:57:19.200
postgres que ya les digo para que sirve eso

00:57:19.200 --> 00:57:21.620
postgres sql

00:57:21.620 --> 00:57:23.300
un contenedor que se llama

00:57:25.200 --> 00:57:25.380
metastore

00:57:26.280 --> 00:57:27.000
metastore

00:57:27.000 --> 00:57:29.600
y un componente

00:57:29.600 --> 00:57:30.980
que es el que tiene

00:57:30.980 --> 00:57:32.860
instalado el hi explico entonces

00:57:32.860 --> 00:57:33.940
inicialmente

00:57:33.940 --> 00:57:36.840
este contenedor no tiene direccion ip

00:57:36.840 --> 00:57:39.320
aunque se le puede configurar pero normalmente

00:57:39.320 --> 00:57:40.720
la ip seria

00:57:40.720 --> 00:57:43.340
la misma ip de la maquina que este el señor

00:57:43.340 --> 00:57:45.960
instalado el contenedor

00:57:45.960 --> 00:57:47.120
explico entonces

00:57:47.120 --> 00:57:48.720
como es la historia ahora

00:57:48.720 --> 00:57:50.260
en primer lugar hi

00:57:51.400 --> 00:57:53.040
es un motor ahora

00:57:53.040 --> 00:57:54.860
pensado para bodegas de datos

00:57:55.540 --> 00:57:56.500
que utiliza

00:57:56.500 --> 00:57:58.480
tecnología de big data

00:57:58.480 --> 00:58:00.600
hi va a ser el almacenamiento

00:58:00.600 --> 00:58:01.860
en un cluster de hadu

00:58:01.860 --> 00:58:04.760
entonces ya esto esta configurado para que este

00:58:04.760 --> 00:58:06.060
hi guarde los datos

00:58:06.600 --> 00:58:08.400
en la maquina virtual en pocas palabras

00:58:08.400 --> 00:58:10.660
cuando creemos tablas insertemos

00:58:10.660 --> 00:58:12.780
registros ahi van a quedar guardados

00:58:12.780 --> 00:58:14.680
en el cluster de hadu que esta a este

00:58:14.680 --> 00:58:15.160
lado

00:58:16.440 --> 00:58:18.520
para que es ese postgres

00:58:19.240 --> 00:58:20.660
resulta en que lo que

00:58:20.660 --> 00:58:22.600
voy a hacer es que voy a

00:58:23.220 --> 00:58:24.480
colocar la misma capa

00:58:24.480 --> 00:58:26.060
sql que usted conocio

00:58:26.060 --> 00:58:28.420
pero al final cuando yo hago una consulta

00:58:28.420 --> 00:58:29.460
con un sql insert

00:58:29.460 --> 00:58:31.900
al final lo que estoy usando es ya sea

00:58:32.420 --> 00:58:33.740
macredius o ya sea sparr

00:58:33.740 --> 00:58:35.520
para hacer cualquiera de las cosas

00:58:35.520 --> 00:58:38.040
entonces para poder simular

00:58:38.040 --> 00:58:39.240
el sql insert

00:58:40.200 --> 00:58:41.600
el create table

00:58:41.600 --> 00:58:43.720
debo utilizar un motor de base de datos

00:58:43.720 --> 00:58:45.620
en este caso el motor que usamos

00:58:45.620 --> 00:58:48.420
el que estoy usando es postgres

00:58:48.420 --> 00:58:50.280
por lo tanto este señor va

00:58:50.280 --> 00:58:51.760
a tomar la sintaxis de pokres

00:58:51.760 --> 00:58:54.320
para crear tablas

00:58:54.320 --> 00:58:56.080
y como las creas a través

00:58:56.080 --> 00:58:57.900
de un elemento intermedio que se llama

00:58:57.900 --> 00:58:59.580
metastore en pocas palabras

00:59:00.660 --> 00:59:02.280
la estructura de datos

00:59:02.280 --> 00:59:04.240
de tu base de datos va a

00:59:04.240 --> 00:59:06.080
quedar almacenada aqui en postgres

00:59:07.060 --> 00:59:08.080
pero los datos

00:59:08.080 --> 00:59:10.040
van a quedar almacenados no en esta instancia

00:59:10.040 --> 00:59:12.920
de pokres sino en este hadu de aqui

00:59:12.920 --> 00:59:14.540
cuando hago una consulta

00:59:14.540 --> 00:59:15.640
en select aterisco

00:59:15.640 --> 00:59:18.040
por la tabla que tu quieras lo que va a ser

00:59:18.040 --> 00:59:20.320
entonces hay es que va a convertir eso

00:59:20.320 --> 00:59:21.940
o ya sea a macredius

00:59:21.940 --> 00:59:22.900
o ya sea sparr

00:59:22.900 --> 00:59:26.220
en el caso de versiones 3 hacia abajo

00:59:26.220 --> 00:59:28.240
haria y convertiria

00:59:28.240 --> 00:59:29.540
osea practicamente lo que hace es

00:59:29.540 --> 00:59:30.260
big data

00:59:30.900 --> 00:59:32.820
es la unica diferencia que el select aterisco

00:59:32.820 --> 00:59:34.700
que es una postgres sql me la va a convertir

00:59:34.700 --> 00:59:36.680
en un macredius

00:59:36.680 --> 00:59:39.700
y si estoy de 3 hacia arriba

00:59:39.700 --> 00:59:40.900
puede que la consulta

00:59:41.620 --> 00:59:43.760
me la haga con spark

00:59:43.760 --> 00:59:44.920
porque digo que puede

00:59:44.920 --> 00:59:47.080
porque tu puedes configurar el high

00:59:47.080 --> 00:59:48.260
en version 4

00:59:48.260 --> 00:59:50.600
para que ejecute ya sea con macredius

00:59:50.600 --> 00:59:52.600
o con spark como tu desees

00:59:53.300 --> 00:59:54.920
entonces partiendo ese hecho

00:59:54.920 --> 00:59:56.300
esta seria la arquitectura

00:59:56.300 --> 00:59:58.440
vamos a comprobar

00:59:58.440 --> 01:00:00.700
lo siguiente entonces vamos a

01:00:00.700 --> 01:00:02.660
la maquina virtual a ver si ya me subio

01:00:02.660 --> 01:00:04.540
el docker de esto

01:00:05.640 --> 01:00:07.440
ok aqui esta

01:00:07.440 --> 01:00:08.200
por favor

01:00:08.200 --> 01:00:10.760
por defecto estan los

01:00:10.760 --> 01:00:12.040
3 contenedores

01:00:14.640 --> 01:00:15.240
seleccionados

01:00:15.240 --> 01:00:16.660
si digo iniciar puede

01:00:16.660 --> 01:00:18.500
que me daña el contenedor porque tienen que haber

01:00:18.500 --> 01:00:20.880
un orden en los cuales vamos a subir los contenedores

01:00:20.880 --> 01:00:22.580
lo vamos a subir en el siguiente orden

01:00:22.580 --> 01:00:23.280
primero

01:00:24.380 --> 01:00:26.000
seleccionado todos los contenedores

01:00:26.000 --> 01:00:28.260
y el primero que vamos a subir

01:00:28.260 --> 01:00:30.700
se llama high postgres

01:00:30.700 --> 01:00:32.600
que es el que tiene instalado el postgres

01:00:32.600 --> 01:00:34.900
ahi van a quedar las tablas

01:00:34.900 --> 01:00:36.360
dele por favor seleccionale

01:00:36.360 --> 01:00:38.500
y da click aca en el play para iniciar

01:00:40.980 --> 01:00:42.740
listo el verde indica

01:00:42.740 --> 01:00:44.580
en que subio de manera

01:00:44.580 --> 01:00:46.640
correcta ahora suba por favor

01:00:46.640 --> 01:00:48.300
el metastore

01:00:48.300 --> 01:00:49.400
que seria el segundo

01:00:49.400 --> 01:00:52.440
lo selecciona y le da play

01:00:52.440 --> 01:00:54.040
para iniciar

01:00:54.040 --> 01:00:56.980
y no hay ningun problema si digame

01:00:56.980 --> 01:00:58.640
ok creo que hizo esto

01:00:59.880 --> 01:01:02.000
creo que esta asi mas o menos

01:01:02.000 --> 01:01:03.420
esta aqui

01:01:04.380 --> 01:01:06.740
aqui le queda como una pequeña flecha

01:01:06.740 --> 01:01:08.980
que esta aqui no se si me parece

01:01:08.980 --> 01:01:10.180
si no le da click

01:01:10.180 --> 01:01:12.640
ah listo perfecto

01:01:13.780 --> 01:01:14.800
verifique que subo

01:01:14.800 --> 01:01:15.920
ahora el segundo por favor

01:01:15.920 --> 01:01:18.420
por ultimo subamos ahora el de high

01:01:18.420 --> 01:01:21.240
que seria el ultimo el primerito que aparece

01:01:23.600 --> 01:01:24.540
que pasa

01:01:25.880 --> 01:01:27.000
si aqui

01:01:27.000 --> 01:01:28.400
me salio un pequeño error

01:01:28.400 --> 01:01:29.660
no se si a usted le subio

01:01:29.660 --> 01:01:31.780
no subio

01:01:31.780 --> 01:01:33.340
asi no subio

01:01:33.340 --> 01:01:35.700
hagamos lo siguiente entonces

01:01:35.700 --> 01:01:38.640
vamos a volverlo a recrear porque seguramente paso algo

01:01:38.640 --> 01:01:40.140
en alguna configuracion no se

01:01:40.140 --> 01:01:41.000
de la maquina virtual

01:01:41.000 --> 01:01:42.660
de windows

01:01:42.660 --> 01:01:45.140
hagamos lo siguiente estando los 3

01:01:45.140 --> 01:01:47.320
seleccionados primero

01:01:47.320 --> 01:01:49.460
detengalos aqui

01:01:49.460 --> 01:01:51.980
en el cuadrado ese

01:01:51.980 --> 01:01:52.960
detener

01:01:52.960 --> 01:01:55.180
y cuando termine de

01:01:55.180 --> 01:01:57.000
tenerlos por favor

01:01:58.040 --> 01:01:59.460
le va ahora

01:01:59.460 --> 01:02:00.560
en delete

01:02:00.560 --> 01:02:02.940
para borrar los contenidores

01:02:02.940 --> 01:02:05.020
delete forever

01:02:07.540 --> 01:02:09.800
aclaro lo que pasa es que

01:02:09.800 --> 01:02:10.780
si me demo

01:02:10.780 --> 01:02:12.340
me podia demorar mas

01:02:13.960 --> 01:02:15.440
corrigiendo el error porque que pasa

01:02:15.440 --> 01:02:17.120
que volviendolo a crear

01:02:17.120 --> 01:02:20.220
entonces vallase al proveedor de archivo si esta amable

01:02:20.220 --> 01:02:22.140
y en el proveedor de archivo

01:02:22.140 --> 01:02:23.800
en la carpeta de descargas

01:02:24.820 --> 01:02:26.340
va a encontrar una carpeta

01:02:26.340 --> 01:02:27.740
que se llama docker high

01:02:27.740 --> 01:02:29.480
2.0

01:02:29.480 --> 01:02:31.800
si la vio dele doble click por favor

01:02:36.480 --> 01:02:38.120
permiteme confirmar algo aqui

01:02:38.120 --> 01:02:40.440
si perfecto esta bien

01:02:40.440 --> 01:02:42.120
docker high 3.0

01:02:44.720 --> 01:02:46.360
ahora en esta carpeta

01:02:46.360 --> 01:02:48.300
que esta aqui

01:02:48.300 --> 01:02:50.000
vamos a decir este señor me deja

01:02:50.000 --> 01:02:51.700
no no me deja en esta version este es windows

01:02:51.700 --> 01:02:54.400
entonces hago lo siguiente sencillo

01:02:54.400 --> 01:02:55.920
en la barra de direcciones

01:02:55.920 --> 01:02:57.760
que esta aca arriba le va a dar click

01:02:57.760 --> 01:02:59.080
un poquito

01:03:00.180 --> 01:03:02.060
en la misma barra pero

01:03:02.060 --> 01:03:04.480
que no seleccione ninguna letra

01:03:04.480 --> 01:03:05.940
de tal forma que seleccione todos los elementos

01:03:05.940 --> 01:03:07.500
y escriba cmd

01:03:07.500 --> 01:03:08.260
y le da enter

01:03:10.220 --> 01:03:12.760
y me va a abrir entonces una terminal shell

01:03:12.760 --> 01:03:14.140
de cmd

01:03:14.140 --> 01:03:16.420
en esa carpeta que he escogido

01:03:16.420 --> 01:03:17.800
y aqui vamos a escribir lo siguiente

01:03:19.300 --> 01:03:19.760
docker

01:03:22.200 --> 01:03:22.660
compose

01:03:22.660 --> 01:03:23.940
deja y se lo escribo aca entonces

01:03:26.480 --> 01:03:27.900
mejor lo copio

01:03:27.900 --> 01:03:30.240
un momento lo copio de aqui

01:03:33.000 --> 01:03:34.560
y para que usted lo vea mejor

01:03:34.560 --> 01:03:36.560
aca va a copiar

01:03:36.560 --> 01:03:38.140
esto exactamente asi docker

01:03:38.140 --> 01:03:39.600
compose op-d

01:03:39.600 --> 01:03:41.860
le das enter por favor

01:03:49.080 --> 01:03:50.880
y te va nuevamente al docker

01:03:50.880 --> 01:03:51.540
de estos

01:03:52.760 --> 01:03:54.300
y confirma que tenga los tres

01:03:54.880 --> 01:03:56.980
habilitados a poner el comando

01:03:56.980 --> 01:03:58.620
claro como lo puedo perfecto dice

01:03:58.620 --> 01:04:00.240
Fernanda voy a ver para ver que paso

01:04:00.240 --> 01:04:02.000
en el comando y te digo que paso que salio

01:04:02.000 --> 01:04:02.960
vamos a ver un pequeño

01:04:03.900 --> 01:04:06.000
un momento a ver

01:04:06.880 --> 01:04:07.720
falto el op

01:04:07.720 --> 01:04:09.980
falto op despues de compose

01:04:09.980 --> 01:04:11.780
op up

01:04:11.780 --> 01:04:13.500
dale flecha arriba dale flecha arriba

01:04:13.500 --> 01:04:15.940
flechita arriba para que te vuelva

01:04:15.940 --> 01:04:18.000
para que te coloque el comando

01:04:19.280 --> 01:04:20.180
y tu le dices

01:04:20.180 --> 01:04:21.200
nada mas

01:04:21.840 --> 01:04:24.020
entonces devuelve con flecha a la izquierda

01:04:24.020 --> 01:04:25.920
espacio y dale enter ahora si

01:04:25.920 --> 01:04:26.780
espera un momento

01:04:26.780 --> 01:04:29.540
listo vete ahora a tu docker

01:04:30.340 --> 01:04:31.020
listo

01:04:33.200 --> 01:04:33.880
perfecto

01:04:33.880 --> 01:04:35.580
claro que no se que paso inicialmente

01:04:35.580 --> 01:04:37.220
ahora a lo siguiente para

01:04:37.220 --> 01:04:39.540
no entrarle mucho comando y vamos directamente aqui

01:04:40.060 --> 01:04:41.660
vallese por favor aqui

01:04:41.660 --> 01:04:43.540
al que dice high server

01:04:45.220 --> 01:04:47.680
dele click a high server

01:04:49.300 --> 01:04:52.020
y ahora por favor dele click

01:04:52.020 --> 01:04:52.760
ahora

01:04:53.540 --> 01:04:55.980
aqui hay un menu

01:04:55.980 --> 01:04:57.660
vaya donde dice

01:04:57.660 --> 01:04:59.020
exec de ejecutar

01:04:59.020 --> 01:05:10.060
a mi particularmente

01:05:10.060 --> 01:05:11.240
no me gusta esta shell

01:05:11.240 --> 01:05:13.780
me gusta mas coloque la palabra

01:05:13.780 --> 01:05:15.780
bash esa shell no me gusta

01:05:15.780 --> 01:05:17.620
mucho coloque bash

01:05:18.340 --> 01:05:21.160
para cambiar de shell

01:05:21.160 --> 01:05:23.660
me gusta mas esa

01:05:24.820 --> 01:05:25.640
esa shell no me gusta

01:05:25.640 --> 01:05:28.000
entonces vamos listo bash

01:05:30.060 --> 01:05:30.940
bash

01:05:31.640 --> 01:05:34.060
ahora del siguiente comando

01:05:34.060 --> 01:05:35.680
del comando

01:05:35.680 --> 01:05:36.520
high enter

01:05:36.520 --> 01:05:39.640
tiene que aparecerle ahora un pron llamado

01:05:40.340 --> 01:05:41.120
beeline

01:05:41.640 --> 01:05:43.640
que es el

01:05:45.200 --> 01:05:45.760
el driver

01:05:45.760 --> 01:05:47.020
que conecta hacia high

01:05:47.020 --> 01:05:48.140
aparece beeline a todos

01:05:49.720 --> 01:05:50.600
beeline

01:05:51.740 --> 01:05:52.920
ahi esta

01:05:52.920 --> 01:05:55.200
ahora vamos a conectarnos a la base de datos

01:05:55.200 --> 01:05:57.220
como nos conectamos a la base de datos

01:05:57.220 --> 01:05:59.080
tiene que ir al siguiente comando

01:05:59.080 --> 01:06:00.600
slash connect

01:06:00.600 --> 01:06:02.340
un momento aqui

01:06:02.920 --> 01:06:04.100
slash connect

01:06:04.940 --> 01:06:06.920
y entonces le da

01:06:09.940 --> 01:06:10.780
jdbc

01:06:10.780 --> 01:06:12.560
dos puntos

01:06:12.560 --> 01:06:15.380
high dos cero

01:06:15.380 --> 01:06:16.600
dos punto

01:06:16.600 --> 01:06:18.920
slash slash localhost

01:06:20.380 --> 01:06:21.260
asi que

01:06:21.260 --> 01:06:22.160
mejor hagamos lo siguiente

01:06:23.400 --> 01:06:24.920
le agrego este comando mejor

01:06:24.920 --> 01:06:26.920
en la carpeta compartida para que no tenga que escribirlo

01:06:27.620 --> 01:06:28.920
dos punto diez mil

01:06:30.400 --> 01:06:31.000
diez mil

01:06:31.000 --> 01:06:33.160
aqui espera un momento y lo coloco en la carpeta compartida

01:06:33.160 --> 01:06:33.820
este comando

01:06:35.100 --> 01:06:36.300
si de admiracion perfecto

01:06:36.300 --> 01:06:38.320
si de admiracion

01:06:38.920 --> 01:06:41.180
si quiere lo voy a colocar en la carpeta compartida

01:06:41.180 --> 01:06:42.980
mejor para que no este

01:06:42.980 --> 01:06:44.660
copiando eso

01:06:44.660 --> 01:06:46.120
porque voy a hacer esto

01:06:46.120 --> 01:06:47.900
porque dentro de talent

01:06:47.900 --> 01:06:50.240
talent puede hacer muchas cosas con base de datos

01:06:50.240 --> 01:06:52.020
lo unico que no hay un componente directo

01:06:52.020 --> 01:06:53.660
para crear una base de datos, entonces voy a crear aqui

01:06:53.660 --> 01:06:55.040
una base de datos

01:06:55.760 --> 01:06:57.980
espere un momentico y lo subo aqui en talent

01:06:57.980 --> 01:07:00.140
en la carpeta expresiones que habiamos

01:07:00.140 --> 01:07:02.480
usado ya esta

01:07:02.480 --> 01:07:04.060
voy a la carpeta

01:07:04.060 --> 01:07:08.020
compartida aca y la voy a copiar mejor de ahi

01:07:08.020 --> 01:07:09.220
a ver

01:07:10.380 --> 01:07:11.880
seria one drive

01:07:11.880 --> 01:07:13.480
aqui tengo tv4

01:07:18.820 --> 01:07:19.920
expresiones

01:07:22.120 --> 01:07:23.780
copia el comando

01:07:23.780 --> 01:07:25.780
y lo pega directamente aca

01:07:25.780 --> 01:07:27.040
en la terminal

01:07:27.040 --> 01:07:29.060
deberia permitir copiarlo

01:07:36.120 --> 01:07:37.060
perfecto ese es el comando

01:07:38.120 --> 01:07:39.660
si esta mal

01:07:39.660 --> 01:07:41.580
colocalo en edge

01:07:42.120 --> 01:07:43.660
ah perdon

01:07:43.660 --> 01:07:45.020
no tiene enter

01:07:45.020 --> 01:07:45.620
que pena

01:07:47.120 --> 01:07:47.840
no tiene

01:07:47.840 --> 01:07:50.300
no tiene usuario

01:07:51.960 --> 01:07:53.720
al final te va a aparecer un prompt

01:07:53.720 --> 01:07:54.840
que es la linea de conexion

01:07:54.840 --> 01:07:57.660
hacia el motor

01:07:58.580 --> 01:08:00.040
vete por favor ahi

01:08:00.040 --> 01:08:01.580
si quiere escribe ahi one drive

01:08:01.580 --> 01:08:02.780
que seguramente ya lo has usado

01:08:03.600 --> 01:08:05.300
ahi te aparece un momentico

01:08:05.300 --> 01:08:07.100
ya voy a tu pantalla

01:08:07.660 --> 01:08:08.440
listo estas ahi

01:08:08.440 --> 01:08:09.140
entre expresiones

01:08:09.140 --> 01:08:10.140
si esta mal es fernando

01:08:11.140 --> 01:08:12.540
copia de la ultima linea

01:08:12.540 --> 01:08:15.000
la vas a pegar en el shell

01:08:15.000 --> 01:08:16.300
exactamente ahi

01:08:16.300 --> 01:08:17.840
click derecho

01:08:20.020 --> 01:08:21.400
paste ahi

01:08:22.860 --> 01:08:23.580
listo

01:08:23.580 --> 01:08:24.200
enter

01:08:25.140 --> 01:08:26.440
no tiene usuario

01:08:27.140 --> 01:08:28.800
enter y tampoco tiene paso

01:08:28.800 --> 01:08:30.360
entonces enter

01:08:31.140 --> 01:08:33.580
y ahi estamos conectados

01:08:33.580 --> 01:08:35.040
listo entonces

01:08:35.040 --> 01:08:37.140
ahora veamos un par de comandos

01:08:38.760 --> 01:08:39.920
si digame

01:08:40.580 --> 01:08:41.160
ah

01:08:41.160 --> 01:08:43.400
quien me hablo Beatriz si

01:08:43.400 --> 01:08:45.260
y vamos a ver que paso vamos a ver

01:08:45.260 --> 01:08:47.600
porque el comando no funciono

01:08:47.600 --> 01:08:48.800
vamos a ver

01:08:48.800 --> 01:08:50.760
ah primero dele hay enter

01:08:51.460 --> 01:08:53.340
y va a aparecer un prompt que dice

01:08:53.340 --> 01:08:55.140
beeline espera un momento por favor

01:08:55.140 --> 01:08:57.080
listo perfecto ahora

01:08:57.080 --> 01:08:59.280
copia la linea osea arriba

01:08:59.280 --> 01:09:01.340
tiene ya el comando listo

01:09:01.340 --> 01:09:03.140
copielo por favor donde dice

01:09:04.860 --> 01:09:06.100
ah donde dice

01:09:06.100 --> 01:09:07.360
conect jdbc

01:09:07.360 --> 01:09:09.940
dale flecha arriba parece que le coge el comando flecha arriba

01:09:09.940 --> 01:09:11.360
para ver no no un momento no no

01:09:11.360 --> 01:09:13.140
salio salio salio dele hay nuevamente

01:09:14.200 --> 01:09:15.140
sabe porque no le funciono

01:09:15.900 --> 01:09:17.520
porque mando mil ira diez mil

01:09:17.520 --> 01:09:18.980
esta escuchando por el puerto diez mil

01:09:18.980 --> 01:09:20.760
permítame yo le intento a ver un momento

01:09:20.760 --> 01:09:22.420
yo se lo copio para ver eso

01:09:22.420 --> 01:09:24.080
dele hay ahi asi dele enter

01:09:24.080 --> 01:09:26.900
y voy a intentar copiar este comando aqui

01:09:26.900 --> 01:09:29.180
pero un momento le intento copiar

01:09:29.180 --> 01:09:30.860
este comando

01:09:30.860 --> 01:09:32.120
seria este de aqui

01:09:32.120 --> 01:09:34.960
y lo voy a pegar aca

01:09:34.960 --> 01:09:36.860
el problema es que era diez mil

01:09:40.060 --> 01:09:41.260
ahora si

01:09:41.260 --> 01:09:42.860
por limita de un cero nada mas

01:09:43.640 --> 01:09:44.860
listo seguimos entonces

01:09:45.560 --> 01:09:47.000
a ver vamos a dar los siguientes

01:09:47.000 --> 01:09:49.060
comandos entonces

01:09:49.060 --> 01:09:50.920
vamos aqui a hacer

01:09:53.640 --> 01:09:55.240
ah yo si estaba buscando

01:09:55.240 --> 01:09:56.980
y dije cari si listo si gracias

01:09:56.980 --> 01:09:58.480
ah perdido

01:09:58.480 --> 01:10:00.220
primer comando dele el siguiente

01:10:00.220 --> 01:10:02.740
deben terminar en punto y coma los comandos

01:10:02.740 --> 01:10:04.220
select

01:10:04.220 --> 01:10:06.640
curren underscore

01:10:06.640 --> 01:10:08.400
database deberia devolverle

01:10:08.400 --> 01:10:10.860
punto y coma deberia devolverle el nombre

01:10:10.860 --> 01:10:12.480
curren aqui que pena de curren

01:10:13.900 --> 01:10:14.480
curren asi

01:10:15.060 --> 01:10:16.480
actual base de ahi

01:10:17.300 --> 01:10:18.480
le confirmo entonces

01:10:22.300 --> 01:10:23.900
confirmo aqui

01:10:25.140 --> 01:10:26.740
para ver

01:10:27.860 --> 01:10:29.320
espere para ver en que me

01:10:29.320 --> 01:10:30.740
equivoque en el comando creo que era

01:10:30.740 --> 01:10:32.740
curren me parecio que era curren el comando

01:10:34.260 --> 01:10:34.820
dejai

01:10:34.820 --> 01:10:37.260
confirmo para ver ah listo

01:10:37.260 --> 01:10:39.220
el comando necesita terminar

01:10:39.220 --> 01:10:40.740
con parentesis

01:10:41.820 --> 01:10:42.800
creo que es mejor

01:10:42.800 --> 01:10:44.800
estrategia que lo haga en uno pad para poder

01:10:44.800 --> 01:10:46.700
copiar y pegar porque no deja aqui

01:10:46.700 --> 01:10:48.740
ahi asi

01:10:50.180 --> 01:10:50.740
confirmo

01:10:50.740 --> 01:10:51.680
para ver

01:10:52.680 --> 01:10:54.180
aqui me devolvio

01:10:58.640 --> 01:10:59.600
ahora si

01:11:00.100 --> 01:11:01.180
perfecto ahi esta

01:11:01.180 --> 01:11:02.440
en la base datos default

01:11:02.440 --> 01:11:04.080
el comando es con

01:11:05.360 --> 01:11:06.720
listo ahora si llevamos de rapide

01:11:06.720 --> 01:11:08.800
que estamos un poquito atrasados vamos entonces a hacer lo siguiente

01:11:08.800 --> 01:11:10.080
vamos a crear nuestro propio base datos

01:11:10.740 --> 01:11:12.280
create database

01:11:13.640 --> 01:11:14.120
axa

01:11:14.120 --> 01:11:16.380
para conectarse axa le da use

01:11:16.380 --> 01:11:18.860
axa recuerda de terminar en punto y coma si no sale

01:11:25.580 --> 01:11:26.820
vamos a ver

01:11:26.820 --> 01:11:27.680
vamos a ver

01:11:27.680 --> 01:11:28.120
vamos a ver

01:11:28.120 --> 01:11:28.860
vamos a ver

01:11:30.220 --> 01:11:31.900
dale un

01:11:31.900 --> 01:11:32.480
select

01:11:32.480 --> 01:11:33.240
curren database

01:11:35.220 --> 01:11:36.060
ah

01:11:36.060 --> 01:11:38.780
select curren

01:11:38.780 --> 01:11:40.020
ah porque tienes

01:11:40.020 --> 01:11:40.860
data

01:11:43.320 --> 01:11:44.420
listo cambio

01:11:44.420 --> 01:11:46.520
entonces espera un momento a ver

01:11:46.520 --> 01:11:47.120
data

01:11:48.340 --> 01:11:48.820
ah

01:11:48.820 --> 01:11:50.920
si porque que si

01:11:50.920 --> 01:11:53.120
no tuviese conectado

01:11:53.120 --> 01:11:54.760
ahi esta bien conectado perfecto

01:11:54.760 --> 01:11:56.900
vamos a hacer los siguientes pasos entonces

01:11:56.900 --> 01:11:59.120
dame gracias a ti listo voy a salir de aqui

01:11:59.760 --> 01:12:01.140
creamos la base datos recuerda de terminar

01:12:01.140 --> 01:12:03.120
en punto y coma use axa

01:12:04.360 --> 01:12:05.240
vamos a crear aqui

01:12:05.240 --> 01:12:07.440
una tablita sencilla

01:12:07.440 --> 01:12:08.860
create table

01:12:08.860 --> 01:12:10.740
create table

01:12:11.640 --> 01:12:13.280
aqui la ventaja del meta store

01:12:13.280 --> 01:12:15.120
es que puedes usar la sintaxis de

01:12:15.120 --> 01:12:16.760
de postgres

01:12:16.760 --> 01:12:19.000
y tambien una sintaxis de ya

01:12:19.000 --> 01:12:21.120
por ejemplo voy a colocar aqui empleados

01:12:22.160 --> 01:12:23.680
abro parentesi de aqui

01:12:23.680 --> 01:12:25.120
voy a crear aqui identificación

01:12:25.860 --> 01:12:27.420
de tipo integer

01:12:27.420 --> 01:12:29.800
y voy a colocar aqui

01:12:31.020 --> 01:12:31.120
nombres

01:12:31.780 --> 01:12:33.440
integer aqui que vena

01:12:34.360 --> 01:12:35.120
integer

01:12:35.120 --> 01:12:37.120
nombres bar chart

01:12:37.940 --> 01:12:39.080
de 60

01:12:39.080 --> 01:12:41.320
tambien puede ver usados stream

01:12:41.320 --> 01:12:42.220
punto y coma

01:12:42.220 --> 01:12:44.080
y aqui aplicas

01:12:44.080 --> 01:12:45.980
exactamente ya lo que sabes de base

01:12:45.980 --> 01:12:46.800
de datos

01:12:48.480 --> 01:12:50.600
vamos a insertar aqui un registro

01:12:50.600 --> 01:12:52.660
insert quinto

01:12:55.480 --> 01:12:56.040
empleados

01:12:57.300 --> 01:12:57.860
values

01:12:59.180 --> 01:13:00.300
abro aca

01:13:00.300 --> 01:13:02.200
y le digo este señor

01:13:02.200 --> 01:13:04.580
como es entero necesita comida simple 92

01:13:05.220 --> 01:13:06.400
y el nombre suyo

01:13:06.400 --> 01:13:08.260
si usted quiere John aqui y termina

01:13:08.260 --> 01:13:10.020
con punti com si

01:13:10.020 --> 01:13:11.600
o si usted quiere si

01:13:13.020 --> 01:13:14.380
al final lo que necesito

01:13:14.380 --> 01:13:15.580
nada mas es esta base de datos

01:13:15.580 --> 01:13:17.720
si quiere yo hago lo demas para que lo vea

01:13:17.720 --> 01:13:20.160
creo que ahi me va a mostrar

01:13:21.500 --> 01:13:22.000
nada mas

01:13:22.000 --> 01:13:23.540
necesito que cree la base de datos

01:13:25.740 --> 01:13:26.100
listo

01:13:26.100 --> 01:13:27.680
entonces permiteme yo creo la tablita aca

01:13:27.680 --> 01:13:29.580
porque la tabla la vamos a crear ya directamente

01:13:30.100 --> 01:13:31.800
aqui lo que necesitaban nada mas era crear

01:13:31.800 --> 01:13:32.640
la base de datos

01:13:33.340 --> 01:13:34.980
pero permiteme hacer un ejemplo aqui

01:13:36.080 --> 01:13:38.300
y voy a insertar entonces

01:13:38.300 --> 01:13:40.300
voy a crear la tabla

01:13:41.820 --> 01:13:44.160
crear la tabla

01:13:44.160 --> 01:13:45.920
copiar

01:13:45.920 --> 01:13:48.300
listo copiar

01:13:49.980 --> 01:13:52.600
creo la tablita aqui

01:13:54.300 --> 01:13:55.180
si señor

01:13:56.300 --> 01:13:57.660
si señor

01:13:58.300 --> 01:14:00.080
y no tenemos

01:14:00.080 --> 01:14:02.720
y en realidad tengo poca memoria

01:14:02.720 --> 01:14:05.260
osea con el

01:14:05.260 --> 01:14:06.300
docker tengo menos memoria

01:14:06.820 --> 01:14:08.520
que la que tenia anteriormente

01:14:08.520 --> 01:14:10.940
si señor efectivamente

01:14:11.400 --> 01:14:12.580
recordara que normalmente

01:14:12.580 --> 01:14:14.700
cuando yo tengo una base de datos

01:14:16.360 --> 01:14:16.800
normalmente

01:14:16.800 --> 01:14:18.720
los procesos de actualizacion no hay

01:14:18.720 --> 01:14:20.820
mas que todo hay procesos de insertion

01:14:23.940 --> 01:14:24.980
creo que no me creo

01:14:24.980 --> 01:14:26.640
la tabla que fue

01:14:26.640 --> 01:14:28.100
voy a darle aqui show tables

01:14:28.100 --> 01:14:29.180
para que me muestra las tablas

01:14:29.180 --> 01:14:29.760
show tables

01:14:33.100 --> 01:14:33.980
punto y coma

01:14:35.840 --> 01:14:36.920
si empleados

01:14:36.920 --> 01:14:38.320
ah me equivoque yo

01:14:38.320 --> 01:14:40.100
para me equivoque yo fue

01:14:40.720 --> 01:14:41.260
empleados

01:14:43.340 --> 01:14:44.420
si efectivamente

01:14:44.420 --> 01:14:46.840
y creo que bueno ya Alejandro lo tiene

01:14:46.840 --> 01:14:48.760
nota que ahi te aparece un mac reduce

01:14:48.760 --> 01:14:49.460
cierto

01:14:49.980 --> 01:14:51.980
significa que para insertar esta utilizando

01:14:52.560 --> 01:14:53.980
mac reduce

01:14:53.980 --> 01:14:57.460
sobre el nodo de

01:14:57.460 --> 01:14:57.980
de clusters de hadut

01:14:59.500 --> 01:15:00.040
entonces

01:15:00.040 --> 01:15:02.240
si me voy a este nodo ahi esta

01:15:02.240 --> 01:15:04.020
ahi esta exactamente

01:15:04.020 --> 01:15:06.720
si me voy al nodo este

01:15:06.720 --> 01:15:08.460
voy a buscar el nodo

01:15:08.460 --> 01:15:10.340
el cluster en el

01:15:10.340 --> 01:15:12.100
en el browser

01:15:12.100 --> 01:15:14.220
un momento a ver

01:15:14.220 --> 01:15:15.900
voy a ver aqui

01:15:15.900 --> 01:15:17.580
browser file system

01:15:17.580 --> 01:15:20.160
y note que aqui tiene ya un

01:15:20.160 --> 01:15:22.420
propietario hype de dos carpetas

01:15:22.420 --> 01:15:24.480
opt y ten

01:15:24.480 --> 01:15:26.080
indicando entonces

01:15:26.080 --> 01:15:27.540
confirmando que en realidad

01:15:27.540 --> 01:15:29.760
tiene que subir de ax al root

01:15:29.760 --> 01:15:31.060
hay que subir de nivel

01:15:31.060 --> 01:15:33.420
y van a notar entonces que ahi creo

01:15:33.420 --> 01:15:35.840
unas carpetas el usuario hype

01:15:35.840 --> 01:15:38.080
que es donde tiene almacenado

01:15:38.080 --> 01:15:39.500
los datos como tal

01:15:39.500 --> 01:15:41.320
ah bueno eso es un

01:15:41.320 --> 01:15:43.800
interesante el como

01:15:43.800 --> 01:15:45.700
depende de la tabla de como

01:15:45.700 --> 01:15:47.660
se definen hype

01:15:47.660 --> 01:15:49.800
entonces para esas tablas vamos a

01:15:49.800 --> 01:15:51.960
subir para adelantarnos

01:15:51.960 --> 01:15:53.720
esa pregunta deja la abierta

01:15:53.720 --> 01:15:55.600
y te la respondo ahora cuando creemos la

01:15:55.600 --> 01:15:57.820
tabla aca con el hype

01:15:57.820 --> 01:15:59.640
vamos entonces

01:16:00.600 --> 01:16:01.600
aca listo

01:16:01.600 --> 01:16:03.500
lo unico que necesitaba era crear al final

01:16:05.460 --> 01:16:06.460
la tabla

01:16:06.460 --> 01:16:07.760
perdon la base de datos

01:16:08.580 --> 01:16:09.780
y confirmamos aqui

01:16:09.780 --> 01:16:12.020
que ya me esta insertando en

01:16:12.020 --> 01:16:14.140
en el en la maquina virtual

01:16:14.140 --> 01:16:15.840
del cluster dejado aqui no

01:16:15.840 --> 01:16:18.320
voy a hacer mas nada entonces me voy a meter aca con

01:16:18.320 --> 01:16:19.440
el talent big data

01:16:20.120 --> 01:16:23.280
voy a cerrar todos estos señores de aqui

01:16:23.880 --> 01:16:25.680
bueno perdon deja uno abierto

01:16:25.680 --> 01:16:27.980
porque voy a copiar esta conexion de aqui

01:16:28.840 --> 01:16:31.740
no mentira en este ejemplo pequeñito

01:16:32.380 --> 01:16:32.940
no

01:16:32.940 --> 01:16:37.700
en este ejemplo pequeñito para que usted entienda como se hace eso aqui

01:16:37.700 --> 01:16:39.640
bueno listo entonces

01:16:39.640 --> 01:16:41.860
voy a crear una subcarpeta en axa que se llame

01:16:42.620 --> 01:16:43.180
hype

01:16:45.220 --> 01:16:45.780
perfecto

01:16:45.780 --> 01:16:47.800
y vamos entonces aqui en

01:16:47.800 --> 01:16:49.720
hype vamos aqui crear

01:16:49.720 --> 01:16:51.700
job por favor crear job

01:16:51.700 --> 01:16:53.020
dentro de la carpeta de hype

01:16:53.020 --> 01:16:55.520
coloque aqui job

01:16:58.380 --> 01:16:59.020
creacion

01:17:00.240 --> 01:17:00.880
tabla

01:17:00.880 --> 01:17:02.100
hype

01:17:02.760 --> 01:17:04.700
listo voy aqui donde dice

01:17:05.560 --> 01:17:06.200
finish

01:17:07.480 --> 01:17:09.900
y voy a hacer entonces lo siguiente

01:17:11.440 --> 01:17:13.220
este ejemplo voy a tratar de

01:17:13.220 --> 01:17:15.300
combinar varios porque el

01:17:15.300 --> 01:17:17.020
tiempo no nos esta dando vamos entonces

01:17:17.760 --> 01:17:19.500
vamos a crear una conexion

01:17:19.500 --> 01:17:21.380
que no la hemos hecho a una base

01:17:21.380 --> 01:17:21.680
de datos

01:17:22.200 --> 01:17:24.400
hype es una base de datos siendo que utiliza

01:17:24.400 --> 01:17:26.660
la tecnologia de big data para progresar sus datos

01:17:26.660 --> 01:17:28.320
pero es una base de datos

01:17:28.320 --> 01:17:30.480
entonces para hacer conexiones hacia base de datos

01:17:30.480 --> 01:17:33.320
me voy aqui donde dice debe connection

01:17:33.320 --> 01:17:34.700
click derecho

01:17:36.700 --> 01:17:37.580
crear conexion

01:17:37.580 --> 01:17:39.820
vamos a llamarla entonces

01:17:40.900 --> 01:17:41.340
axa

01:17:41.340 --> 01:17:43.620
por costumbre es mas costumbre

01:17:43.620 --> 01:17:44.920
mía por experiencia

01:17:45.660 --> 01:17:47.660
los nombres de conexion de base de datos

01:17:47.660 --> 01:17:49.560
le doy el nombre de base de datos

01:17:49.560 --> 01:17:52.160
y coloco dentro de parentesis el motor que la tiene

01:17:53.240 --> 01:17:54.140
esto por experiencia

01:17:54.140 --> 01:17:55.880
que tengo desde algunos procesos

01:17:55.880 --> 01:17:57.440
que he tenido que a veces me tengo

01:17:57.440 --> 01:17:59.980
dos bases de datos con igual nombre pero a veces me pierdo de ahí

01:17:59.980 --> 01:18:01.700
a que motor me estoy refiriendo

01:18:01.700 --> 01:18:03.400
entonces para costumbre por buena

01:18:03.400 --> 01:18:05.380
practica mía le coloco axa

01:18:05.380 --> 01:18:06.560
parentesis hype

01:18:07.420 --> 01:18:09.680
le coloco el nombre de base de datos y dentro de parentesis

01:18:09.680 --> 01:18:11.760
le coloco el motor en el que

01:18:12.460 --> 01:18:12.860
esta

01:18:12.860 --> 01:18:14.560
vemos la key next por favor

01:18:15.140 --> 01:18:17.520
y alto seguido lo que va a hacer es que vas a

01:18:17.520 --> 01:18:19.100
buscar aqui donde dice debe tie

01:18:19.100 --> 01:18:21.080
el driver de conexion

01:18:21.080 --> 01:18:23.360
hacia la base de datos que necesitas en este caso

01:18:23.360 --> 01:18:25.680
expande y me interesa el que

01:18:25.680 --> 01:18:27.640
se llama hype

01:18:32.120 --> 01:18:33.700
aqui hay un pequeño bug

01:18:33.700 --> 01:18:35.580
con esta pequeña

01:18:35.580 --> 01:18:36.420
bueno no

01:18:39.680 --> 01:18:41.140
login no tiene

01:18:42.300 --> 01:18:43.460
servidores localhost

01:18:43.460 --> 01:18:44.980
porque aunque esta en el contenedor

01:18:44.980 --> 01:18:47.000
recordara que el contenedor tiene

01:18:47.000 --> 01:18:48.940
la misma hype de la máquina en que este

01:18:49.580 --> 01:18:50.920
entonces queda localhost

01:18:50.920 --> 01:18:52.880
el puerto queda igual lo unico diferente

01:18:52.880 --> 01:18:55.080
es que voy a conectarme no a base de datos default

01:18:55.080 --> 01:18:57.360
sino a base de datos que se llama axa

01:18:57.360 --> 01:18:58.400
como tu lo hayas llamado

01:18:58.400 --> 01:19:00.980
tienes que bajar el scroll para que te de la conexion

01:19:04.180 --> 01:19:04.900
dale un

01:19:04.900 --> 01:19:07.420
text connection aqui por favor

01:19:10.100 --> 01:19:10.580
ok

01:19:11.800 --> 01:19:13.020
este exportar como

01:19:13.020 --> 01:19:15.020
el contexto lo que hace es que te crea las variables

01:19:15.020 --> 01:19:16.900
y te las porta como contextos recuerda que

01:19:16.900 --> 01:19:19.200
los contextos son las variables de entorno

01:19:19.860 --> 01:19:21.060
cuat y de produccion

01:19:21.060 --> 01:19:23.020
que ya hicimos una el dia

01:19:23.640 --> 01:19:24.500
antes de ayer

01:19:24.500 --> 01:19:27.000
aqui dele finish no va a ser mas nada finish

01:19:27.000 --> 01:19:28.740
no no le porte porque

01:19:28.740 --> 01:19:30.080
no vamos a crear contextos

01:19:31.580 --> 01:19:33.020
perfecto listo ahora

01:19:33.540 --> 01:19:35.300
si quiere traerse

01:19:35.300 --> 01:19:36.960
expanda por favor la conexion que creo

01:19:36.960 --> 01:19:38.940
de axa y ahi le va a aparecer

01:19:39.980 --> 01:19:40.940
las estructuras

01:19:40.940 --> 01:19:42.480
que pertenecen a esa base de datos

01:19:42.480 --> 01:19:45.340
en este caso te va a mostrar sinonimo si hay

01:19:45.340 --> 01:19:47.060
tablas y vistas

01:19:47.060 --> 01:19:48.900
los que crearon la tabla

01:19:48.900 --> 01:19:51.160
y la quieren ver para ver de este lado

01:19:51.160 --> 01:19:52.940
vamos a hacer lo siguiente aunque no la haya creado

01:19:52.940 --> 01:19:54.040
hagamos toda la misma prueba

01:19:54.040 --> 01:19:56.720
por defecto cuando yo creo la conexion

01:19:56.720 --> 01:19:57.920
no me trae las

01:19:58.720 --> 01:20:00.800
los objetos de esa base de datos

01:20:00.800 --> 01:20:02.900
me toca hacer lo siguiente

01:20:03.460 --> 01:20:06.840
conectase dele click por favor a la conexion

01:20:06.840 --> 01:20:08.160
click derecho ahora

01:20:08.160 --> 01:20:09.720
y digale donde dice

01:20:09.720 --> 01:20:11.960
extraer esquema

01:20:11.960 --> 01:20:14.080
aqui le pregunta

01:20:14.080 --> 01:20:15.880
si desea filtrar algun tipo de objetos

01:20:15.880 --> 01:20:17.700
nada mas le va a mostrar la herramienta

01:20:17.700 --> 01:20:19.560
tablas, vistas y sonodibus

01:20:19.560 --> 01:20:21.960
dele next aqui

01:20:21.960 --> 01:20:23.940
aqui expanda por favor

01:20:23.940 --> 01:20:25.460
y le van a aparecer los objetos

01:20:25.460 --> 01:20:28.160
lo que aparece vacio es porque no crearon la tabla

01:20:28.700 --> 01:20:30.140
pero entonces si deseamos

01:20:30.140 --> 01:20:32.240
exportar esa tabla dele click aqui en

01:20:32.240 --> 01:20:34.000
axa para poder

01:20:34.780 --> 01:20:36.400
seleccionar todos los objetos

01:20:36.400 --> 01:20:38.400
le da next

01:20:40.440 --> 01:20:41.040
finish

01:20:41.920 --> 01:20:43.420
y este señor

01:20:43.420 --> 01:20:45.520
ya le va a traer aqui todos los objetos

01:20:45.520 --> 01:20:47.400
que pertenecen a esa tabla

01:20:48.520 --> 01:20:49.120
ok

01:20:49.120 --> 01:20:51.760
eso funciona exactamente

01:20:51.760 --> 01:20:53.060
igual independiente del motor

01:20:53.060 --> 01:20:55.400
MySQL, Postgres, Oracle SQL Server

01:20:56.100 --> 01:20:57.480
por defecto la herramienta

01:20:57.480 --> 01:20:59.120
no te hace la conexion

01:20:59.120 --> 01:21:01.240
pero tienes que hacer de forma manual extraer

01:21:01.240 --> 01:21:03.540
los diferentes elementos que quieres de esa base de datos

01:21:03.540 --> 01:21:05.440
eso porque lo hacen de esa manera

01:21:05.440 --> 01:21:07.440
porque hay veces que base de datos tiene muchos componentes

01:21:07.440 --> 01:21:08.100
o muchos objetos

01:21:08.100 --> 01:21:09.740
y te puede demorar mucho

01:21:09.740 --> 01:21:12.160
entonces lo ideal es que crees la conexion

01:21:12.160 --> 01:21:14.720
y exportes o traiga los objetos

01:21:14.720 --> 01:21:16.300
que tu quieras manejar

01:21:16.300 --> 01:21:19.480
en la conexion como tal

01:21:19.480 --> 01:21:21.940
vamos entonces a hacer lo siguiente ahora

01:21:21.940 --> 01:21:24.620
vamos a crear aqui un tepre job

01:21:24.620 --> 01:21:26.080
aqui en el job

01:21:26.080 --> 01:21:27.120
un tepre job

01:21:27.780 --> 01:21:29.140
no, no hay problema

01:21:29.140 --> 01:21:31.620
deje la tabla, ahora cuando creemos la tabla aqui en el job

01:21:31.620 --> 01:21:33.320
hace lo mismo ahora

01:21:33.320 --> 01:21:34.180
para que le devuelva

01:21:34.180 --> 01:21:36.220
la tabla que vamos a crear en el job

01:21:37.000 --> 01:21:38.480
no hay problema

01:21:38.480 --> 01:21:41.320
esa practica la hacemos ahora cuando ejecutemos este job

01:21:42.500 --> 01:21:43.400
tengo aqui

01:21:43.400 --> 01:21:44.900
un tepre job por favor

01:21:44.900 --> 01:21:47.820
y al igual que con HDFS

01:21:47.820 --> 01:21:49.700
hay un tehy connection

01:21:49.700 --> 01:21:51.320
tehy connection

01:21:52.180 --> 01:21:53.660
conecta el tepre job

01:21:53.660 --> 01:21:55.320
con el tehy connection

01:22:00.700 --> 01:22:01.360
listo

01:22:01.360 --> 01:22:04.080
ahora vamos a configurar el tehy connection

01:22:05.720 --> 01:22:07.200
si tu quieres puedes

01:22:07.200 --> 01:22:09.300
volver a configurar todo nuevamente

01:22:09.300 --> 01:22:11.120
pero como ya tenemos la configuracion

01:22:11.120 --> 01:22:13.240
y que se encuentra en el repositorio

01:22:13.240 --> 01:22:15.320
tienes que irte a property type

01:22:15.320 --> 01:22:17.480
y le vas a decir ahora que no vas a

01:22:17.480 --> 01:22:19.320
construirla sino que vas a tomar una

01:22:19.320 --> 01:22:21.240
configuracion que esta en el repositorio

01:22:21.240 --> 01:22:23.360
le das aqui repositorio

01:22:24.080 --> 01:22:25.360
te cambia entonces

01:22:26.440 --> 01:22:27.400
aqui listo

01:22:27.400 --> 01:22:29.540
dale click ahi en el mas mas que aparece

01:22:29.540 --> 01:22:31.760
y escoge la conexion

01:22:31.760 --> 01:22:33.820
de hay que esta habilitada

01:22:33.820 --> 01:22:36.320
aqui la escojo y doy ok

01:22:37.540 --> 01:22:39.500
con esta distribucion no hay mucho

01:22:39.500 --> 01:22:41.420
problema porque al fin y al cabo

01:22:41.420 --> 01:22:43.680
quien maneja la conexion con hadut

01:22:43.680 --> 01:22:44.780
es hay

01:22:45.540 --> 01:22:47.440
y no en este caso esta en

01:22:47.440 --> 01:22:49.460
big data entonces lo dejo ahi

01:22:49.460 --> 01:22:51.540
con eso esa configuracion

01:22:54.320 --> 01:22:55.360
agrego un

01:22:55.360 --> 01:22:56.180
tepos job

01:23:02.220 --> 01:23:04.360
y voy a colocar aqui

01:23:04.360 --> 01:23:05.620
un tehy close

01:23:06.660 --> 01:23:08.120
como buena practica

01:23:08.120 --> 01:23:09.800
para cerrar la conexion porque si no

01:23:09.800 --> 01:23:12.660
puede quedar abierta y

01:23:12.660 --> 01:23:14.120
ocuparte memoria

01:23:14.120 --> 01:23:16.360
esa conexion que no lo vas a usar

01:23:16.360 --> 01:23:18.080
pero quedo abierta entonces

01:23:18.080 --> 01:23:20.060
lo configuras con la conexion

01:23:20.060 --> 01:23:25.000
que vas a cerrar

01:23:25.000 --> 01:23:27.520
perfecto listo entonces

01:23:27.520 --> 01:23:29.320
agreguemos los siguientes componentes

01:23:29.320 --> 01:23:31.300
ahora vamos agregar aqui

01:23:31.300 --> 01:23:32.920
un tehy

01:23:32.920 --> 01:23:34.960
create table

01:23:47.320 --> 01:23:49.980
y acto seguido vamos agregar aqui

01:23:49.980 --> 01:23:51.320
un

01:23:52.320 --> 01:23:53.320
tehy row

01:23:55.640 --> 01:23:57.220
tehy row

01:23:58.380 --> 01:23:59.140
y

01:23:59.140 --> 01:24:00.220
se me escapo

01:24:00.220 --> 01:24:03.580
agrego un tehy row generator

01:24:07.780 --> 01:24:09.320
listo son los componentes

01:24:09.320 --> 01:24:11.220
que vamos a utilizar entonces

01:24:18.640 --> 01:24:19.820
conecta el

01:24:19.820 --> 01:24:21.620
tehy row generator

01:24:21.620 --> 01:24:23.400
ya te explico para que sirve eso

01:24:23.400 --> 01:24:25.420
con el tehy row

01:24:27.420 --> 01:24:29.440
vas a conectar el

01:24:29.440 --> 01:24:31.320
tehy create con un

01:24:31.320 --> 01:24:33.180
sub job diferente entonces click

01:24:33.180 --> 01:24:34.960
derecho disparador

01:24:34.960 --> 01:24:37.060
con sub job ok

01:24:37.060 --> 01:24:39.360
hacia el tehy row generator

01:24:40.080 --> 01:24:41.220
ahi tendria

01:24:41.220 --> 01:24:44.000
4 sub jobs el del tehy pre job

01:24:44.000 --> 01:24:45.580
el del tehy create

01:24:45.580 --> 01:24:47.380
table el del tehy row

01:24:47.380 --> 01:24:49.220
generator y el del tehy post job

01:24:51.140 --> 01:24:53.500
vamos la primera confesion del tehy create

01:24:53.500 --> 01:24:55.260
table entonces vamos a darle doble click

01:24:55.260 --> 01:24:57.600
listo doble click a ese tehy

01:24:57.600 --> 01:24:58.080
create

01:24:58.940 --> 01:25:01.220
voy a hacer una tallita sencilla entonces

01:25:01.220 --> 01:25:03.300
en primer lugar voy a decirle que

01:25:03.300 --> 01:25:05.380
ya tengo una conexion prestalecida use

01:25:05.380 --> 01:25:07.260
conexion existente y la escojo

01:25:08.140 --> 01:25:09.260
tehy connection

01:25:10.520 --> 01:25:11.480
nombre de

01:25:11.480 --> 01:25:13.480
tabla por favor entre comillas

01:25:13.480 --> 01:25:15.260
dobles coloquele clientes

01:25:16.120 --> 01:25:17.380
si estamos le coloque

01:25:17.380 --> 01:25:19.260
clientes ahi perfecto clientes

01:25:20.100 --> 01:25:21.380
en el action on

01:25:21.380 --> 01:25:23.520
table hay varias opciones

01:25:23.520 --> 01:25:25.620
entre las opciones que aparecen nada mas hay dos

01:25:25.620 --> 01:25:27.740
una que es crear tabla

01:25:27.740 --> 01:25:29.580
el problema de crear tabla es

01:25:29.580 --> 01:25:31.460
que si la tabla esta creada ya

01:25:31.460 --> 01:25:32.600
va a salir un error

01:25:33.700 --> 01:25:35.500
entonces si este job lo voy a

01:25:35.500 --> 01:25:37.520
ejecutar ene veces lo mejor

01:25:37.520 --> 01:25:39.620
es que crea tabla si no

01:25:39.620 --> 01:25:40.100
existe

01:25:41.400 --> 01:25:43.540
para evitar es el mismo problema

01:25:43.540 --> 01:25:45.520
que teniamos con el

01:25:46.100 --> 01:25:47.620
con el del cluster de

01:25:47.620 --> 01:25:49.840
hadoop cuando creamos el archivo

01:25:49.840 --> 01:25:51.680
en el output habia que sobre escribirlo

01:25:51.680 --> 01:25:53.060
si lo ejecutaba varias veces

01:25:53.960 --> 01:25:56.140
bien y este señor le permite

01:25:56.140 --> 01:25:57.140
a usted que hacer

01:25:57.680 --> 01:25:59.640
en el formato de tabla

01:25:59.640 --> 01:26:02.180
permite hacer varias cosas

01:26:02.180 --> 01:26:04.160
o varios formatos

01:26:04.160 --> 01:26:05.760
dele click por favor en los formatos

01:26:05.760 --> 01:26:07.720
que estan prestablecidos ahi

01:26:07.720 --> 01:26:09.680
y notara que hay diferentes formatos

01:26:10.380 --> 01:26:12.080
el primer formato que aparece

01:26:12.080 --> 01:26:13.920
es el de text file

01:26:13.920 --> 01:26:15.600
text file indicaria

01:26:15.600 --> 01:26:17.540
en que los datos se van a guardar

01:26:17.540 --> 01:26:19.820
en texto plano en el

01:26:21.820 --> 01:26:23.700
por ende al estar en el

01:26:23.700 --> 01:26:25.800
texto plano ahi

01:26:25.800 --> 01:26:27.880
pues yo creeria que la unica ventaja

01:26:27.880 --> 01:26:29.720
que vas a poder hacer ahi es que es

01:26:29.720 --> 01:26:31.780
legible tu puede hacer consultas

01:26:31.780 --> 01:26:33.900
ese ql ahi y

01:26:33.900 --> 01:26:35.520
vas a entender porque esta en plano

01:26:36.460 --> 01:26:37.860
pero el problema que vas

01:26:37.860 --> 01:26:40.300
a tener con un text file es que

01:26:40.300 --> 01:26:42.240
no hay ningun tipo de compresion

01:26:42.240 --> 01:26:43.860
entonces como no hay

01:26:43.860 --> 01:26:45.860
comprension vas a ocupar

01:26:45.860 --> 01:26:48.020
mas espacio el rendimiento va a ser

01:26:48.020 --> 01:26:50.100
mas lento y evidentemente vas

01:26:50.100 --> 01:26:51.860
a consumir mucho mas espacio porque ni siquiera

01:26:51.860 --> 01:26:53.300
haces comprension de los archivos

01:26:54.020 --> 01:26:56.020
los archivos no se guardan comprimidos

01:26:57.100 --> 01:26:58.200
listo ese text

01:26:58.200 --> 01:26:59.760
file yo

01:26:59.760 --> 01:27:01.980
diria que es posible usarlo siempre

01:27:01.980 --> 01:27:04.180
cuando tu tengas

01:27:04.180 --> 01:27:05.800
mucho espacio y donde

01:27:05.800 --> 01:27:07.860
quizas el rendimiento de las consultas

01:27:07.860 --> 01:27:10.240
no requiera

01:27:10.240 --> 01:27:11.680
unos tiempos de respuesta

01:27:11.680 --> 01:27:13.760
bastante bajos porque

01:27:13.760 --> 01:27:15.920
la ventaja va a tener es que el rendimiento va a ser

01:27:15.920 --> 01:27:17.000
muy lento

01:27:17.920 --> 01:27:19.960
despues sigue ahi le aparece uno

01:27:19.960 --> 01:27:22.340
que se llama sequence files

01:27:22.340 --> 01:27:24.320
ese sequence files

01:27:24.320 --> 01:27:25.920
es propio de jadut

01:27:26.540 --> 01:27:28.300
ahora este jadut

01:27:29.000 --> 01:27:30.080
va a guardar

01:27:30.080 --> 01:27:31.140
los datos comprimidos

01:27:31.140 --> 01:27:33.700
y recordaras un poquito el dia de ayer

01:27:33.700 --> 01:27:35.440
cuando guardamos

01:27:35.440 --> 01:27:37.440
un archivo comprimido

01:27:37.440 --> 01:27:40.220
que tu lo abrias y ese archivo estaba en binario

01:27:40.220 --> 01:27:42.260
no entendias lo que ibas a hacer

01:27:42.260 --> 01:27:44.060
ahora al guardarlo

01:27:44.060 --> 01:27:46.460
de esta manera jadut lo prepara

01:27:46.460 --> 01:27:47.760
para que se ejecute tambien

01:27:47.760 --> 01:27:49.220
en forma paralela

01:27:49.220 --> 01:27:51.620
pero creo que la ventaja

01:27:51.620 --> 01:27:54.060
que vas a tener es que es

01:27:54.760 --> 01:27:56.220
dependiente de jadut

01:27:56.220 --> 01:27:58.160
significa entonces en que para

01:27:58.160 --> 01:27:59.860
poder reutilizar esto en una infra

01:27:59.860 --> 01:28:00.720
infraestructura diferente

01:28:00.720 --> 01:28:03.800
tiene que estar casada con jadut

01:28:03.800 --> 01:28:06.140
pensar en cambiar de a un synapse

01:28:06.140 --> 01:28:07.840
a un mr

01:28:07.840 --> 01:28:09.640
aunque mr usa jadut

01:28:09.640 --> 01:28:12.060
pensar de pronto en un

01:28:12.060 --> 01:28:13.600
s3 por ejemplo

01:28:13.600 --> 01:28:16.500
ahi si vos tenes inconvenientes

01:28:16.500 --> 01:28:17.480
despues sigue

01:28:18.060 --> 01:28:20.320
dos en particular que dice rcfile

01:28:20.320 --> 01:28:22.000
y orcfile

01:28:22.000 --> 01:28:24.100
basicamente son los mismos formatos

01:28:24.100 --> 01:28:25.980
la diferencia es que orcfile

01:28:25.980 --> 01:28:28.280
es mucho mas

01:28:28.280 --> 01:28:29.720
eficiente que el

01:28:29.720 --> 01:28:32.160
rcfile te preguntaras si es

01:28:32.160 --> 01:28:33.860
mas eficiente porque nada mas dejan

01:28:33.860 --> 01:28:36.320
orcfile porque el problema que tienes

01:28:36.320 --> 01:28:38.060
es que hay una incompatibilidad

01:28:38.700 --> 01:28:40.060
con esos dos formatos

01:28:41.100 --> 01:28:42.180
entonces quizas hay

01:28:42.180 --> 01:28:44.080
sistemas que ya lo tienes con rcfile

01:28:44.720 --> 01:28:46.060
y quisieras mantenerlo

01:28:46.780 --> 01:28:48.340
entonces por eso se mantiene los dos

01:28:48.340 --> 01:28:50.420
ahora una característica

01:28:50.420 --> 01:28:52.060
que tiene rcfile

01:28:53.880 --> 01:28:54.840
que tiene

01:28:55.460 --> 01:28:56.660
uno que se llama

01:28:56.660 --> 01:28:58.140
y que creo que

01:28:58.140 --> 01:28:59.940
mis calculos no me fallan porque

01:28:59.940 --> 01:29:02.160
me parecio ver un job una vez que me reuní

01:29:02.160 --> 01:29:04.000
con alguien que trabajaba para ustedes

01:29:05.200 --> 01:29:06.480
es muy parecido

01:29:06.480 --> 01:29:08.120
a ese señor que se llama parket

01:29:08.120 --> 01:29:08.840
que esta ahi abajo

01:29:08.840 --> 01:29:11.700
el formato parket quizas lo has escuchado

01:29:11.700 --> 01:29:13.340
que aclaro ese formato

01:29:13.340 --> 01:29:15.180
la idea del formato no es nueva

01:29:16.060 --> 01:29:18.180
eso venia antes

01:29:18.180 --> 01:29:19.940
ese formato corresponde

01:29:19.940 --> 01:29:21.440
a un tipo de base de datos

01:29:21.440 --> 01:29:24.300
que se llama tipo columnar

01:29:24.300 --> 01:29:26.560
que favorece

01:29:26.560 --> 01:29:28.300
las operaciones

01:29:28.300 --> 01:29:29.840
que tu vayas a hacer

01:29:29.840 --> 01:29:32.540
eso por ejemplo en el caso

01:29:32.540 --> 01:29:34.240
de colombia

01:29:34.240 --> 01:29:35.760
yo te habia comentado el dia de ayer

01:29:35.760 --> 01:29:38.180
que hay una empresa

01:29:38.180 --> 01:29:39.920
que es un supermercado

01:29:39.920 --> 01:29:41.140
en que tiene varias

01:29:41.760 --> 01:29:42.620
sucursales por ciudad

01:29:43.760 --> 01:29:45.880
osea toda colombia seguramente en mexico

01:29:45.880 --> 01:29:47.760
debe haber alguna muy parecida

01:29:47.760 --> 01:29:49.760
aca se llama super tiendas olimpica

01:29:50.700 --> 01:29:52.040
entonces esos

01:29:52.040 --> 01:29:53.640
señores tienen demasiado

01:29:53.640 --> 01:29:55.840
dinero pero bastante dinero

01:29:55.840 --> 01:29:57.600
y ellos mantienen entonces

01:29:57.600 --> 01:29:58.740
varias

01:29:59.760 --> 01:30:01.500
en una misma ciudad puede haber varias

01:30:01.500 --> 01:30:03.300
sucursales aunque

01:30:03.300 --> 01:30:05.300
ellos tienen ahora una

01:30:06.180 --> 01:30:07.260
una pelea fuerte

01:30:07.260 --> 01:30:08.860
con una competencia dura

01:30:08.860 --> 01:30:11.340
que llaman aqui en colombia

01:30:11.340 --> 01:30:13.300
supermercados de bajo costo

01:30:14.400 --> 01:30:15.400
entonces hay

01:30:15.400 --> 01:30:17.120
una tendencia en que ya estan

01:30:17.120 --> 01:30:19.640
vendiendo mucho menos porque esos supermercados

01:30:19.640 --> 01:30:21.160
tambien la idea es

01:30:21.160 --> 01:30:23.700
ocupar toda colombia pero bueno depende de eso

01:30:23.700 --> 01:30:24.940
ese ors files

01:30:24.940 --> 01:30:26.840
es mas tipo columnar aqui

01:30:26.840 --> 01:30:29.220
coloco un ejemplo muy sencillo de pronto

01:30:29.220 --> 01:30:31.300
para que lo tengas en cuenta

01:30:31.300 --> 01:30:33.300
suponte que tu tengas aca

01:30:33.920 --> 01:30:35.400
sucursal

01:30:35.400 --> 01:30:37.300
columna sucursal

01:30:38.260 --> 01:30:39.380
tenga por si te

01:30:39.380 --> 01:30:41.640
algo ventas

01:30:41.640 --> 01:30:43.300
aca colocamos estas dos

01:30:43.300 --> 01:30:45.300
tengas aqui sucursal x

01:30:45.940 --> 01:30:47.320
ventas

01:30:47.320 --> 01:30:48.680
12 tal

01:30:49.300 --> 01:30:50.940
sucursal y

01:30:50.940 --> 01:30:53.740
ventas fueron 3 14

01:30:53.740 --> 01:30:54.780
z

01:30:55.300 --> 01:30:56.880
15

01:30:56.880 --> 01:30:59.440
o te lo voy a colocar

01:30:59.440 --> 01:31:01.500
para que sea mucho mas sencillo de entender

01:31:01.500 --> 01:31:03.200
voy a colocar todos

01:31:03.200 --> 01:31:04.540
en sucursal x

01:31:04.540 --> 01:31:06.460
para que me quede mejor el ejemplo

01:31:07.500 --> 01:31:09.180
si a ti te preguntaran

01:31:09.180 --> 01:31:11.060
te dijeran o

01:31:11.060 --> 01:31:13.320
te pidieran que sacaras

01:31:13.320 --> 01:31:15.100
el total de ventas que tienes

01:31:15.100 --> 01:31:17.100
al momento de la sucursal x

01:31:17.800 --> 01:31:19.300
tu podias hacerlo con

01:31:19.300 --> 01:31:21.340
ese ql o lo que uno llama con

01:31:21.340 --> 01:31:23.440
tecnologia tradicional seria

01:31:23.440 --> 01:31:25.440
un select listo puedes agrupar

01:31:26.380 --> 01:31:27.560
pero el agrupar

01:31:27.560 --> 01:31:28.420
es un sum

01:31:28.420 --> 01:31:31.300
de la columna ventas

01:31:31.300 --> 01:31:32.760
lo que pasa es que

01:31:32.760 --> 01:31:35.200
internamente lo que hace el sql

01:31:35.200 --> 01:31:37.060
es que tendria que sacar

01:31:37.060 --> 01:31:39.580
registro por registro e ir sumando

01:31:39.580 --> 01:31:41.440
yo podria mejorar

01:31:41.440 --> 01:31:43.360
esto por ejemplo haciendo

01:31:44.260 --> 01:31:44.980
particiones

01:31:45.560 --> 01:31:47.320
las particiones velo como si

01:31:47.320 --> 01:31:49.240
fuese una carpeta imaginate que yo

01:31:49.240 --> 01:31:51.480
crea una partición por sucursal

01:31:51.480 --> 01:31:53.660
imaginate que cada sucursal

01:31:53.660 --> 01:31:55.540
se guarda a nivel de una secundaria

01:31:55.540 --> 01:31:56.600
por una carpeta diferente

01:31:56.600 --> 01:31:58.720
cuando tu haces un group by por la

01:31:59.540 --> 01:32:01.200
sucursal ya el sabe que

01:32:01.200 --> 01:32:02.820
todo lo que esta en la carpeta por

01:32:02.820 --> 01:32:05.480
si necesita algo x son de la

01:32:05.480 --> 01:32:06.740
sucursal x

01:32:06.740 --> 01:32:09.160
es mas eficiente a la hora de hacer

01:32:09.160 --> 01:32:10.840
los calculos pero todavia

01:32:10.840 --> 01:32:12.740
me quedo en que me toca sacar

01:32:12.740 --> 01:32:15.660
registro por registro e ir sumando

01:32:15.660 --> 01:32:17.700
la idea esta de columnar

01:32:17.700 --> 01:32:19.460
que en realidad no es

01:32:19.460 --> 01:32:21.540
nueva es que no

01:32:22.540 --> 01:32:25.520
lo guarde en ese

01:32:25.520 --> 01:32:27.760
columna sucursal

01:32:27.760 --> 01:32:29.480
imaginate que esto lo guardo en un solo

01:32:29.480 --> 01:32:31.380
archivo sucursal

01:32:32.380 --> 01:32:32.940
sucursal

01:32:33.520 --> 01:32:35.740
y aqui guardo nada mas x

01:32:35.740 --> 01:32:37.720
y aca en ventas guardo

01:32:37.720 --> 01:32:39.280
los valores 12,

01:32:41.080 --> 01:32:41.640
14

01:32:41.640 --> 01:32:43.040
y 15

01:32:43.040 --> 01:32:45.540
que diferencia hay que cuando ahora tu vas a hacer

01:32:45.540 --> 01:32:47.220
la sumatoria de las ventas

01:32:47.220 --> 01:32:49.280
yo en realidad nada mas traigo este archivo

01:32:49.280 --> 01:32:51.520
que es la columna ventas y voy sumando

01:32:52.480 --> 01:32:53.440
efectivamente si quiera

01:32:53.440 --> 01:32:54.740
hacer un join habia que hacer

01:32:54.740 --> 01:32:57.180
un join de este archivo

01:32:57.180 --> 01:32:59.200
practicamente con quizas

01:32:59.200 --> 01:33:00.900
este archivo que seria otra columna

01:33:00.900 --> 01:33:02.980
cada columna se guarda en archivos diferentes

01:33:02.980 --> 01:33:05.560
pero la ventaja que tengo es que como solamente saco

01:33:05.560 --> 01:33:07.420
un solo registro

01:33:07.420 --> 01:33:09.420
de esa columna sumar eso

01:33:09.420 --> 01:33:11.900
y hacer operacion de matematica pues

01:33:11.900 --> 01:33:13.640
eso la diferencia

01:33:13.640 --> 01:33:15.440
con la tradicional pues

01:33:16.880 --> 01:33:18.080
es abismal

01:33:18.080 --> 01:33:19.860
a nivel de tiempo de respuestas

01:33:19.860 --> 01:33:21.320
entonces ese archivo

01:33:21.880 --> 01:33:23.320
orsl te guarda en formato

01:33:26.260 --> 01:33:26.740
formato

01:33:26.740 --> 01:33:27.260
aclaro

01:33:27.260 --> 01:33:29.500
del sequence files hacia el parquet

01:33:29.500 --> 01:33:31.440
perdon hacia el

01:33:31.440 --> 01:33:33.060
storas te guarda en binario

01:33:33.060 --> 01:33:35.560
parquet si no estoy mal te la guarda en

01:33:35.560 --> 01:33:38.360
ahora reviso para ver que ahora no recuerdo como te la guarda aqui en hide

01:33:39.100 --> 01:33:39.580
pero

01:33:39.580 --> 01:33:41.480
aclaro que parquet tambien se puede usar en otros motores

01:33:41.480 --> 01:33:43.400
no solamente de hide

01:33:43.400 --> 01:33:45.340
es un formato de archivo y que

01:33:45.340 --> 01:33:47.800
es usado por diferentes motores de baseados

01:33:47.800 --> 01:33:49.440
abro te lo guarda en binario

01:33:49.440 --> 01:33:50.680
estoy seguro

01:33:51.440 --> 01:33:52.800
input y output format

01:33:52.800 --> 01:33:55.360
ese que esta ahi al igual que

01:33:55.360 --> 01:33:57.260
el storas

01:33:57.260 --> 01:33:59.220
eso te lo coloca

01:33:59.220 --> 01:34:01.380
si tu quieres definir un

01:34:01.380 --> 01:34:03.580
formato especial de almacenamiento

01:34:03.580 --> 01:34:05.720
es si tu defines el formato

01:34:05.720 --> 01:34:07.800
no tomas un formato convencional

01:34:07.800 --> 01:34:09.520
yo particularmente nunca

01:34:09.520 --> 01:34:11.180
metido eso porque es desgastante

01:34:11.180 --> 01:34:13.360
normalmente ya estos formatos

01:34:13.360 --> 01:34:14.980
que estan ya estan ha sido

01:34:14.980 --> 01:34:16.320
testeado y funcionan bien

01:34:16.320 --> 01:34:18.440
y por ultimo tendrias el parquet

01:34:18.440 --> 01:34:20.120
que es el que te mencione que esta en colunar

01:34:20.120 --> 01:34:22.240
entonces tu tienes que definir como

01:34:22.240 --> 01:34:24.380
vas a guardar los datos en este caso en particular

01:34:24.380 --> 01:34:26.560
quiero que sea text file

01:34:27.860 --> 01:34:28.980
y esta base

01:34:28.980 --> 01:34:30.600
de datos esta tabla perdon no va a tener

01:34:30.600 --> 01:34:32.480
particiones no voy a crear particiones

01:34:32.480 --> 01:34:34.780
porque la cantidad de registros que voy a almacenar

01:34:34.780 --> 01:34:35.980
es elevadamente pequeño

01:34:35.980 --> 01:34:37.700
listo entonces

01:34:38.320 --> 01:34:39.520
eso es lo que hace ahi

01:34:39.520 --> 01:34:41.440
este row generator

01:34:41.440 --> 01:34:44.320
para que este te genera filas

01:34:45.460 --> 01:34:47.240
cuantas la que tu quieras

01:34:47.240 --> 01:34:48.280
para que sirve eso

01:34:48.280 --> 01:34:50.660
aparte de probar algunas cosas como la que voy a hacer ahora

01:34:50.660 --> 01:34:52.300
que voy a generar unos

01:34:52.300 --> 01:34:54.320
aleatoriamente voy a generar unos filas

01:34:54.320 --> 01:34:56.340
para hacerlas almacenadas en la tabla

01:34:57.420 --> 01:34:58.280
este row generator

01:34:58.280 --> 01:35:00.640
es muy bueno cuando quieres hacer pruebas de estres

01:35:01.700 --> 01:35:02.580
por ejemplo

01:35:02.580 --> 01:35:03.840
tu quieres un sistema

01:35:03.840 --> 01:35:06.300
y quisieras saber cual

01:35:06.300 --> 01:35:08.420
la eficiencia de ese sistema

01:35:08.980 --> 01:35:10.600
entonces tu haces una prueba de estres

01:35:10.600 --> 01:35:12.380
entonces empiezas a insertar una cantidad

01:35:12.380 --> 01:35:14.440
de registros y miras como se

01:35:14.440 --> 01:35:16.320
comporta con esos registros

01:35:16.320 --> 01:35:17.980
despues haces una proyeccion

01:35:17.980 --> 01:35:20.340
y esa proyeccion matematica te va a definir

01:35:20.340 --> 01:35:22.220
cuanta memoria, cuanto recursos

01:35:22.220 --> 01:35:24.320
necesitas minimo para atender a

01:35:24.320 --> 01:35:26.500
en usuarios entonces este row generator

01:35:26.500 --> 01:35:28.560
es muy bueno para hacer esas pruebas de estres

01:35:29.700 --> 01:35:30.260
vayase

01:35:30.260 --> 01:35:32.140
vamos a conferir asi el row generator

01:35:32.140 --> 01:35:33.920
por favor row generator

01:35:35.440 --> 01:35:36.480
y vamos a

01:35:36.480 --> 01:35:38.380
agregar dos columnas

01:35:40.680 --> 01:35:42.280
la primera columna

01:35:42.280 --> 01:35:43.720
la va a llamar por favor

01:35:45.360 --> 01:35:46.000
identificacion

01:35:46.000 --> 01:35:47.680
todo en minuscula

01:35:47.680 --> 01:35:49.620
y la segunda la va a llamar

01:35:51.440 --> 01:35:52.080
nombres

01:35:56.800 --> 01:35:58.720
el tipo de datos define

01:35:58.720 --> 01:36:00.640
lo string y por favor

01:36:00.640 --> 01:36:02.980
ahora en las funciones que estan aqui

01:36:02.980 --> 01:36:04.560
porque esto se viene aleatoriamente

01:36:04.560 --> 01:36:06.480
dele click en la primera funcion

01:36:06.480 --> 01:36:08.760
de el campo identificacion

01:36:08.760 --> 01:36:10.620
y va a buscar

01:36:12.040 --> 01:36:14.100
uno que se llama

01:36:14.100 --> 01:36:15.420
perdon que pena hay

01:36:16.480 --> 01:36:18.880
algo que guarde, que genere un código

01:36:18.880 --> 01:36:20.520
espere lo busco por aca

01:36:20.520 --> 01:36:22.660
que se me escapo el nombre

01:36:22.660 --> 01:36:24.820
de row generator

01:36:24.820 --> 01:36:26.700
momento por favor

01:36:26.700 --> 01:36:28.480
a listo busquelo

01:36:31.480 --> 01:36:32.480
en numeric

01:36:33.060 --> 01:36:34.380
ahi aparece numeric

01:36:34.380 --> 01:36:36.380
no se si le aparece numeric

01:36:38.880 --> 01:36:40.620
ah no aparece

01:36:43.760 --> 01:36:45.520
que raro que no aparezca

01:36:45.520 --> 01:36:47.640
perdon cambia el tipo de datos ya se porque no aparece

01:36:47.640 --> 01:36:50.020
el tipo de datos cambia al integer mejor

01:36:50.020 --> 01:36:52.160
para generar un consecutivo

01:36:52.160 --> 01:36:53.520
cambia al integer ese es el problema

01:36:54.380 --> 01:36:56.720
integer y aqui escoge

01:36:56.720 --> 01:36:57.860
uno que se llama

01:36:58.760 --> 01:36:59.920
numeric sequence

01:36:59.920 --> 01:37:01.940
que ya lo hemos usado anteriormente

01:37:01.940 --> 01:37:02.700
numeric sequence

01:37:03.320 --> 01:37:06.140
y eso te va a generar un aleatorio ahi

01:37:06.140 --> 01:37:12.860
1,2,3,4,5,6,7,8,9

01:37:16.100 --> 01:37:17.060
numeric sequence

01:37:17.720 --> 01:37:19.340
aquí he escogido que no era

01:37:19.940 --> 01:37:21.740
y aqui lo puedes confiar recuerda que esto lo hicimos

01:37:21.740 --> 01:37:23.840
en el dia de ayer cuando obtuvimos

01:37:23.840 --> 01:37:26.080
los tres primeros

01:37:26.080 --> 01:37:27.580
productos menos vendidos

01:37:27.580 --> 01:37:29.900
y en el nombre

01:37:29.900 --> 01:37:31.900
busquen uno que se llama

01:37:33.900 --> 01:37:35.100
talent data generator

01:37:35.900 --> 01:37:37.040
get first name

01:37:38.380 --> 01:37:41.440
ahi no se porque le di enter

01:37:42.780 --> 01:37:43.640
listo

01:37:43.640 --> 01:37:45.600
get first name

01:37:45.600 --> 01:37:47.220
quedaria mas o menos aqui

01:37:47.900 --> 01:37:50.220
el primero es integer

01:37:50.220 --> 01:37:51.320
numeric sequence

01:37:51.900 --> 01:37:54.320
y el segundo es de tipo string

01:37:54.320 --> 01:37:55.960
talent data generator

01:37:55.960 --> 01:37:56.840
get first name

01:37:56.840 --> 01:37:57.920
el valor numerico

01:37:57.920 --> 01:38:00.720
perdon no lo escuche el cual integer

01:38:00.720 --> 01:38:01.620
el primero

01:38:02.440 --> 01:38:03.600
es integer si

01:38:04.200 --> 01:38:07.080
no el numeric de 1,9

01:38:07.080 --> 01:38:08.860
no deje lo que lo

01:38:08.860 --> 01:38:10.820
ahi va de uno en uno pero el

01:38:10.820 --> 01:38:12.980
a partir de note lo siguiente

01:38:12.980 --> 01:38:14.820
que aqui le va a pasar numero de filas

01:38:14.820 --> 01:38:16.860
que va a generar esa 100 no la genera

01:38:16.860 --> 01:38:18.620
porque demora mucho insertando entonces vamos

01:38:18.620 --> 01:38:21.220
a generar solamente 20 y nada mas

01:38:21.220 --> 01:38:22.760
cambia ese number row para que genere

01:38:22.760 --> 01:38:24.980
20 listo

01:38:24.980 --> 01:38:26.240
ok aqui

01:38:27.260 --> 01:38:28.460
y entonces

01:38:28.980 --> 01:38:30.660
se me escapo algo en el

01:38:30.660 --> 01:38:32.980
create table que no defini el esquema

01:38:33.540 --> 01:38:35.060
osea las columnas que voy a

01:38:35.060 --> 01:38:36.860
definir entonces

01:38:36.860 --> 01:38:39.140
vaya nuevamente al create table

01:38:39.140 --> 01:38:40.980
editar esquema

01:38:41.660 --> 01:38:42.980
y agregue dos columnas

01:38:43.700 --> 01:38:44.820
una columna que se llama

01:38:45.760 --> 01:38:46.360
identificador

01:38:48.000 --> 01:38:48.980
de tipo entero

01:38:55.140 --> 01:38:57.160
y la otra columna se llama

01:38:57.160 --> 01:38:59.020
nombre de tipo string

01:39:01.900 --> 01:39:03.260
hay muchisimas gracias

01:39:03.260 --> 01:39:05.320
porque si despues no no era compatible

01:39:05.320 --> 01:39:06.980
integer muchas gracias

01:39:07.740 --> 01:39:08.980
se me habia escapado

01:39:12.380 --> 01:39:13.740
perfecto aqui esta

01:39:13.740 --> 01:39:15.380
data generator

01:39:17.180 --> 01:39:19.060
aca la llamo identificacion pero bueno

01:39:19.060 --> 01:39:20.960
no importa haga lo siguiente

01:39:20.960 --> 01:39:22.580
ahora ya para finalizar este ejemplo

01:39:22.580 --> 01:39:24.420
vallase aqui a

01:39:24.420 --> 01:39:27.020
vamos a cambiar el nombre de esta fila

01:39:27.020 --> 01:39:28.580
porque me interesa este nombre muy importante

01:39:29.720 --> 01:39:30.860
vamos entonces

01:39:30.860 --> 01:39:32.680
a cambiar esta conexion de

01:39:32.680 --> 01:39:34.580
tegro generator al tehai la vamos a llamar

01:39:39.360 --> 01:39:40.000
clientes

01:39:43.000 --> 01:39:44.280
listo entonces

01:39:44.280 --> 01:39:46.960
accedemos al tehai row

01:39:48.700 --> 01:39:50.600
le decimos que vamos a usar

01:39:50.600 --> 01:39:52.860
una conexion existente

01:39:53.740 --> 01:39:55.020
tehai connection

01:39:55.020 --> 01:39:56.640
y este

01:39:56.640 --> 01:39:58.120
tehai row

01:39:59.060 --> 01:39:59.700
funciona

01:40:01.160 --> 01:40:02.600
especificando una sentencia

01:40:02.600 --> 01:40:04.540
SQL hay dos formas

01:40:04.540 --> 01:40:06.400
de insertar datos en high

01:40:06.400 --> 01:40:08.620
una es esa forma y la otra

01:40:08.620 --> 01:40:10.840
es a traves de un cargue de un archivo

01:40:10.840 --> 01:40:12.720
en este caso voy a hacer entonces

01:40:12.720 --> 01:40:15.080
la primera forma que es

01:40:15.080 --> 01:40:16.340
insertandola con una

01:40:16.340 --> 01:40:18.500
sentencia SQL entonces

01:40:18.500 --> 01:40:20.340
aqui donde dice query por favor

01:40:20.340 --> 01:40:22.540
vamos a definir la sentencia SQL

01:40:22.540 --> 01:40:24.580
esto es sumamente

01:40:25.420 --> 01:40:26.540
engorroso la dejo

01:40:26.540 --> 01:40:29.220
por si acaso de pronto te toca hacerlo alguna vez

01:40:29.220 --> 01:40:30.500
pero quizas la otra forma

01:40:30.500 --> 01:40:32.220
es mucho mas sencilla

01:40:32.220 --> 01:40:34.480
para esto voy a hacer lo siguiente

01:40:35.680 --> 01:40:36.560
ahi donde

01:40:36.560 --> 01:40:38.720
esta el query te va a quedar de esta forma

01:40:38.720 --> 01:40:40.500
voy a copiarla aca

01:40:41.240 --> 01:40:42.520
te va a quedar comilla

01:40:42.520 --> 01:40:44.880
doble, comilla doble

01:40:44.880 --> 01:40:46.600
dentro de esas comillas doble vamos a

01:40:46.600 --> 01:40:48.640
definir la sentencia SQL con la cual

01:40:48.640 --> 01:40:50.440
yo quiero insertar valores

01:40:50.440 --> 01:40:51.400
en este caso

01:40:51.400 --> 01:40:54.040
insert int

01:40:55.820 --> 01:40:56.500
empleados

01:40:56.500 --> 01:40:58.820
abro parentesis

01:40:58.820 --> 01:41:00.600
y ahi termino la sentencia

01:41:01.660 --> 01:41:03.060
inicial del SQL

01:41:03.060 --> 01:41:04.800
quedaria insert int

01:41:05.760 --> 01:41:06.440
empleados

01:41:06.440 --> 01:41:08.880
perdon no se llama empleados se llama clientes que pena

01:41:08.880 --> 01:41:10.600
la tala la llame fue clientes

01:41:11.780 --> 01:41:12.600
no empleados

01:41:13.140 --> 01:41:14.540
la de empleados fue la anterior

01:41:14.540 --> 01:41:16.620
no porque ahora voy a pegar en

01:41:16.620 --> 01:41:18.380
en tiempo de ejecución los valores

01:41:19.120 --> 01:41:21.040
ah si perdon perdon

01:41:21.040 --> 01:41:22.440
que pena gracias

01:41:30.760 --> 01:41:32.560
aqui le voy a dar aqui

01:41:32.560 --> 01:41:33.600
un mas

01:41:33.600 --> 01:41:35.400
para concatenar el valor

01:41:35.400 --> 01:41:38.660
y aqui es muy importante como llamaste

01:41:38.660 --> 01:41:40.440
la conexion del

01:41:40.440 --> 01:41:42.520
T-Rogenerator hacia el T-Hiro

01:41:42.520 --> 01:41:45.040
con el mismo nombre y con esto sensible

01:41:45.040 --> 01:41:46.520
yo lo llame clientes

01:41:47.080 --> 01:41:48.960
quedaria entonces

01:41:49.920 --> 01:41:50.600
clientes

01:41:50.600 --> 01:41:52.560
punto nombre de la variable

01:41:52.560 --> 01:41:54.560
con que viene del T-Rogenerator

01:41:54.560 --> 01:41:56.420
yo la llame inicialmente

01:41:56.420 --> 01:41:58.900
la primera la llame identificación

01:41:58.900 --> 01:42:00.900
entonces para hacer referencia

01:42:00.900 --> 01:42:02.760
me quedaria de esta

01:42:02.760 --> 01:42:03.760
forma

01:42:06.000 --> 01:42:07.360
punto identificación

01:42:07.360 --> 01:42:08.520
y de esa manera

01:42:09.240 --> 01:42:10.920
cada vez que

01:42:10.920 --> 01:42:12.760
entre una fila voy a tomar por modo

01:42:12.760 --> 01:42:15.040
de ejecución el valor ese

01:42:16.160 --> 01:42:16.920
mas

01:42:16.920 --> 01:42:18.920
abro comillas dobles

01:42:20.040 --> 01:42:20.700
coma

01:42:21.460 --> 01:42:24.340
y aqui me toca abrir una comilla simple

01:42:24.340 --> 01:42:25.860
para que pueda quedar

01:42:26.680 --> 01:42:26.920
mas

01:42:28.860 --> 01:42:28.920
clientes

01:42:29.620 --> 01:42:31.260
punto nombres

01:42:31.260 --> 01:42:32.920
mas abro comilla

01:42:34.080 --> 01:42:35.300
comilla simple

01:42:35.300 --> 01:42:37.300
y termina asi

01:42:37.300 --> 01:42:39.320
vea que esto es sumamente engorroso

01:42:39.320 --> 01:42:41.320
pero quizas alguna vez

01:42:41.320 --> 01:42:42.940
pueda superar algún problema con eso

01:42:42.940 --> 01:42:45.740
ahi estoy creando una

01:42:45.740 --> 01:42:47.320
sentencia dinamica

01:42:50.280 --> 01:42:51.740
vamos con la comilla simple

01:42:51.740 --> 01:42:53.020
porque como es textual

01:42:53.020 --> 01:42:54.720
es texto el bar chart

01:42:54.720 --> 01:42:57.440
entonces hay que necesariamente mandarle comillas simple

01:42:57.440 --> 01:42:59.320
voy a pegarla aca

01:43:00.500 --> 01:43:01.260
asi que

01:43:01.260 --> 01:43:03.400
a ver que aquí hay todo

01:43:03.400 --> 01:43:05.320
la pego aqui

01:43:07.300 --> 01:43:08.380
cuando termines

01:43:08.380 --> 01:43:09.820
por favor manda a ejecutar

01:43:09.820 --> 01:43:10.960
el job

01:43:12.380 --> 01:43:13.520
confirma que el job

01:43:13.520 --> 01:43:15.460
que vayas a ejecutar es el que

01:43:15.460 --> 01:43:17.480
el que necesitas

01:43:17.480 --> 01:43:19.300
voy a ir de aqui a ver si no tengo

01:43:19.300 --> 01:43:20.800
algún error de

01:43:21.320 --> 01:43:23.420
bueno ahi empezo

01:43:23.420 --> 01:43:25.320
ya creo la tabla esta intentando

01:43:25.320 --> 01:43:27.320
insertar esta demorando

01:43:28.680 --> 01:43:29.600
si claro

01:43:29.600 --> 01:43:31.300
ahi esta demorando para insertar

01:43:31.300 --> 01:43:32.620
pero ya inserto la primera fila

01:43:32.620 --> 01:43:34.540
ahi esta que esta aqui

01:43:34.540 --> 01:43:35.900
si desea la coloco

01:43:36.620 --> 01:43:38.620
en la carpeta compartida

01:43:40.760 --> 01:43:42.120
Beatriz creo que salio error

01:43:42.120 --> 01:43:43.600
este para ver Beatriz

01:43:43.600 --> 01:43:47.100
para ver que error tiene en la ejecucion me parecio que vi un error

01:43:47.100 --> 01:43:48.960
dice el valor

01:43:48.960 --> 01:43:50.360
del parametro

01:43:50.360 --> 01:43:52.640
no existe ah listo dele cancel

01:43:52.640 --> 01:43:54.640
ahi un momento vayase al teja y close

01:43:54.640 --> 01:43:56.240
que es donde aparece el error

01:43:56.240 --> 01:43:57.720
y se le olvido

01:43:58.620 --> 01:44:00.520
se le olvido colocar

01:44:00.520 --> 01:44:02.340
la conexion que va a cerrar

01:44:02.340 --> 01:44:04.600
entonces cierre ahi

01:44:04.600 --> 01:44:06.620
aca en el component list le da click en el combo

01:44:06.620 --> 01:44:07.200
boss

01:44:07.940 --> 01:44:10.140
y escoge la conexion de high que va a cerrar

01:44:10.960 --> 01:44:12.480
listo intenta ejecutar nuevamente

01:44:12.480 --> 01:44:12.960
por favor

01:44:14.880 --> 01:44:16.560
ah porque tiene poca demoria

01:44:16.560 --> 01:44:18.340
como tenemos el continuo la maquina virtual

01:44:18.340 --> 01:44:20.220
y ahora estamos ejecutando con

01:44:21.960 --> 01:44:22.580
listo por eso

01:44:22.580 --> 01:44:23.180
que esta demorando

01:44:23.180 --> 01:44:26.160
empezo la conexion listo

01:44:26.160 --> 01:44:28.080
empezo a insertar va a ver si sale algun error

01:44:28.080 --> 01:44:30.700
recuerde que por lo mismo recurso que tenemos

01:44:30.700 --> 01:44:32.060
esta demorando para insertar

01:44:32.060 --> 01:44:33.380
espere para ver si pasa el dos

01:44:33.380 --> 01:44:35.940
listo ahi inserto el primero dejelo ahi ejecutando por favor

01:44:35.940 --> 01:44:37.520
quien no le funciono

01:44:38.080 --> 01:44:39.780
listo quien no le funciono

01:44:39.780 --> 01:44:42.060
vamos a ver voy contigo Fernanda

01:44:42.060 --> 01:44:43.500
que sale un error ahi vamos a ver

01:44:43.500 --> 01:44:46.080
es un error de compilacion

01:44:47.980 --> 01:44:48.520
ok

01:44:48.520 --> 01:44:50.180
vayase al code

01:44:50.180 --> 01:44:52.140
para ver que le muestre exactamente

01:44:52.140 --> 01:44:54.080
donde esta en el code

01:44:55.280 --> 01:44:56.100
ah vayase

01:44:56.100 --> 01:44:58.260
vayase al designer code al codigo

01:44:58.260 --> 01:45:00.120
ah si si listo

01:45:00.120 --> 01:45:02.100
ah todavia no cerra el lado verdad hay que serle

01:45:03.160 --> 01:45:03.600
dele

01:45:03.600 --> 01:45:06.160
click en code vamos a ver ahi esta el punto rojo

01:45:06.160 --> 01:45:08.260
que esta marcando dele click en el punto rojo

01:45:08.260 --> 01:45:10.060
exactamente el lado derecho

01:45:11.380 --> 01:45:12.320
ah dice que esa

01:45:12.320 --> 01:45:14.000
variable no existe como le llamo

01:45:14.000 --> 01:45:16.100
usted nombre o nombres

01:45:17.620 --> 01:45:18.060
entonces

01:45:18.060 --> 01:45:19.600
entra al terror generator

01:45:19.600 --> 01:45:21.760
vayase al designer nuevamente

01:45:21.760 --> 01:45:26.080
vayase al terror generator ese que esta ahi

01:45:26.960 --> 01:45:27.480
nombres

01:45:30.420 --> 01:45:30.940
listo

01:45:30.940 --> 01:45:32.080
dele en T

01:45:32.080 --> 01:45:33.940
y cambie nombre

01:45:33.940 --> 01:45:35.940
por nombres

01:45:35.940 --> 01:45:40.240
vuelve a ejecutar nuevamente a ver ah dele cancel ahi

01:45:40.240 --> 01:45:42.280
vayase al T high close que esta marcando

01:45:42.280 --> 01:45:44.160
error ahi doble click escoge ahi

01:45:44.160 --> 01:45:46.320
la conexion que va cerrada que no la configuro

01:45:46.320 --> 01:45:48.240
y vuelve a ejecutar nuevamente

01:45:48.240 --> 01:45:50.160
a ver ahora si listo ya creo esta

01:45:50.160 --> 01:45:51.780
insertando espere un momento a ver porque esta

01:45:51.780 --> 01:45:53.640
demorando mucho hay pocos recursos

01:45:53.640 --> 01:45:55.300
no deberia

01:45:55.300 --> 01:45:57.740
porque vamos a ver no no deberia vamos a

01:45:57.740 --> 01:45:59.900
ver un momento voy a revisar

01:45:59.900 --> 01:46:01.640
un momento el caso

01:46:02.320 --> 01:46:04.040
el mio para ver voy a consultar

01:46:04.040 --> 01:46:05.740
como esta mi tabla mi base de datos

01:46:05.740 --> 01:46:07.980
a ver

01:46:08.740 --> 01:46:11.720
entonces voy por aca

01:46:12.700 --> 01:46:14.140
voy a darle

01:46:14.140 --> 01:46:16.800
estoy en que en AXA no se si

01:46:16.800 --> 01:46:17.640
sele casterisco

01:46:20.100 --> 01:46:20.620
frompientes

01:46:22.740 --> 01:46:23.460
no no

01:46:23.460 --> 01:46:24.880
no dejo de insertarlo en blanco me muestras

01:46:24.880 --> 01:46:27.020
su yo para ver un momento

01:46:27.020 --> 01:46:28.840
quien fue jorge

01:46:28.840 --> 01:46:30.280
el que me dijo

01:46:30.740 --> 01:46:32.420
alejano voy contigo entonces alejano

01:46:32.420 --> 01:46:34.800
vamos a ver que paso

01:46:38.440 --> 01:46:38.840
ah

01:46:38.840 --> 01:46:40.600
te falta ahi lo que hace que te falta el

01:46:40.600 --> 01:46:42.240
punto y coma dale punto y coma ahi en favor

01:46:42.240 --> 01:46:44.760
agregale el punto y coma porque no lo

01:46:44.760 --> 01:46:46.760
no no no punto y coma porque ahi

01:46:46.760 --> 01:46:48.680
concatiendo ahi listo dale flecha

01:46:48.680 --> 01:46:48.920
ahi arriba

01:46:50.480 --> 01:46:52.060
coloco nuevamente el select lo que hace es que

01:46:52.060 --> 01:46:53.740
ahi te concateno los dos select

01:46:54.560 --> 01:46:56.980
nuevamente select casterisco entonces

01:46:56.980 --> 01:46:58.680
ahi con punto y coma exactamente enter

01:46:58.680 --> 01:47:00.460
ahi si esta manate solamente

01:47:01.440 --> 01:47:02.940
perfecto listo esa es una forma

01:47:02.940 --> 01:47:04.240
vamos entonces a la segunda

01:47:05.620 --> 01:47:06.820
eh vamos a hacer entonces

01:47:06.820 --> 01:47:08.760
el break de los

01:47:08.760 --> 01:47:10.180
20 minuticos o las 12 30

01:47:11.260 --> 01:47:12.940
hacemos los 20 minutos y estamos entonces

01:47:12.940 --> 01:47:14.680
en 20 minuticos ok

01:47:18.220 --> 01:47:18.660
listo

01:47:20.800 --> 01:47:22.200
ok

01:47:51.940 --> 01:47:53.340
ok

01:48:05.060 --> 01:48:06.460
ok

01:48:06.460 --> 01:48:08.720
ok

01:48:16.740 --> 01:48:18.140
ok

01:48:18.140 --> 01:48:18.160
ok

01:48:18.160 --> 01:48:18.180
ok

01:48:35.840 --> 01:48:38.000
Thank you.

01:49:00.760 --> 01:49:03.740
Hello again. We are back, are we connected?

01:49:06.220 --> 01:49:10.320
Yes, we are connected, or if there is audio, can you hear me?

01:49:10.460 --> 01:49:11.840
All right.

01:49:12.760 --> 01:49:13.880
Perfect, thank you very much.

01:49:13.880 --> 01:49:27.860
All right, then we create with the tool such that the only thing that does not allow or does not have a component that allows you to create database, although it is possible that you can, through an SQL pattern,

01:49:28.220 --> 01:49:35.220
I show it to you, I mentioned it on the first day, but if you want, for example, to create with the tool,

01:49:35.220 --> 01:49:38.620
sorry, I'm with Alejandro's, it's not mine, now it's mine.

01:49:40.160 --> 01:49:48.160
We can make an SQL pattern that generates the database, but hey, we won't have time to do that part.

01:49:48.640 --> 01:49:51.500
Let's look at the table, how would we do to look at the table?

01:49:52.360 --> 01:49:59.340
First of all, take into account that some of us had already created the employee table and we have created a new table.

01:49:59.340 --> 01:50:03.560
So, to be able to view the new table, we are going to do the same thing again.

01:50:04.420 --> 01:50:07.340
We select the connection to Hive.

01:50:09.320 --> 01:50:11.800
It gives you extract scheme.

01:50:14.100 --> 01:50:14.680
Next.

01:50:15.920 --> 01:50:17.800
Expand, please, there is Axa.

01:50:18.560 --> 01:50:21.740
And there you see new customers, which is the one you have.

01:50:22.020 --> 01:50:23.760
Please choose it. Next.

01:50:25.120 --> 01:50:25.720
Finish.

01:50:25.720 --> 01:50:29.160
And two tables should appear to you.

01:50:30.820 --> 01:50:33.080
To another, only one table will appear.

01:50:33.100 --> 01:50:38.420
How do we do now to process those data, for example, that I want to read with the tool?

01:50:39.340 --> 01:50:46.520
To use then tables that are defined and perform operations of some job, do the following, please.

01:50:46.640 --> 01:50:49.480
We are going to create a new job in Hive.

01:50:50.180 --> 01:50:52.840
Create job. Call it job.

01:50:54.340 --> 01:50:55.080
Read.

01:50:55.420 --> 01:51:00.180
Underscore data. Underscore table. Underscore Hive.

01:51:00.700 --> 01:51:02.080
Here finish.

01:51:06.460 --> 01:51:08.060
And very simply.

01:51:10.720 --> 01:51:13.860
It is located in the Axa Hive connection metadata.

01:51:14.820 --> 01:51:16.060
Expand the tables.

01:51:17.720 --> 01:51:20.500
Take the table that you are going to read, which in this case is customers.

01:51:21.420 --> 01:51:24.000
And release it in the canvas.

01:51:24.680 --> 01:51:28.380
By default, an ELT High Input appears.

01:51:29.040 --> 01:51:32.400
Wait a moment, if this man leaves me this component, I do not remember.

01:51:41.300 --> 01:51:41.980
Perfect.

01:51:42.680 --> 01:51:43.460
One moment, done.

01:51:44.040 --> 01:51:45.980
I'm going here to Telok Row.

01:51:51.360 --> 01:51:54.720
And it links with Telok Row, one moment here.

01:51:55.020 --> 01:51:56.820
No, it's with...

01:51:56.820 --> 01:51:58.800
Wait a moment, I do not know if I used the... Oh, I'm going, one moment.

01:51:58.900 --> 01:52:01.560
If I used the right component.

01:52:03.400 --> 01:52:04.000
No.

01:52:05.840 --> 01:52:06.700
I think...

01:52:06.700 --> 01:52:09.920
Let me confirm, because I have that ELT.

01:52:10.580 --> 01:52:13.940
And I think the component is not...

01:52:13.940 --> 01:52:15.560
ELT Input.

01:52:16.080 --> 01:52:17.940
But T-High Input.

01:52:19.040 --> 01:52:21.580
Let me check it for a moment, Telok Row.

01:52:22.220 --> 01:52:24.660
I already got to where this moment associated the problem.

01:52:26.260 --> 01:52:27.800
I'm going here to the main.

01:52:28.460 --> 01:52:31.520
I think it also works for me, I do not remember this part here.

01:52:31.560 --> 01:52:32.900
Let's see.

01:52:36.180 --> 01:52:36.680
Yes.

01:52:37.980 --> 01:52:41.680
I also test it with the other one, which is ELT Input.

01:52:41.680 --> 01:52:42.940
I think it gives me the same.

01:52:44.240 --> 01:52:45.020
Telok Row.

01:52:48.560 --> 01:52:50.320
I link with...

01:52:50.320 --> 01:52:54.280
No, it has to be with a shooter, then.

01:52:54.860 --> 01:52:57.140
No, it will not accept that component, I do not remember what it is for.

01:52:57.340 --> 01:52:58.240
The component, of course.

01:52:58.800 --> 01:53:00.520
This is one of the cases that failed, then.

01:53:00.640 --> 01:53:02.320
The tool to select is the most appropriate.

01:53:02.820 --> 01:53:05.400
The component is T-High Input, the one that will select.

01:53:06.040 --> 01:53:07.760
And then it connects it to a Telok Row.

01:53:08.600 --> 01:53:10.980
Who was telling me that I did not create the table?

01:53:11.140 --> 01:53:12.300
Who was it?

01:53:12.420 --> 01:53:13.200
I remember.

01:53:13.780 --> 01:53:14.960
Brenda, I'm going there, then.

01:53:14.960 --> 01:53:16.360
Well, let's see.

01:53:16.600 --> 01:53:17.380
Ah, but it's blocked.

01:53:18.060 --> 01:53:18.720
No, I'm seeing.

01:53:19.040 --> 01:53:20.700
How many rows did you send to insert?

01:53:21.020 --> 01:53:21.500
Twenty.

01:53:23.300 --> 01:53:23.780
Or...

01:53:23.780 --> 01:53:24.780
Wait, that takes time.

01:53:25.880 --> 01:53:27.240
Go to the left side.

01:53:28.440 --> 01:53:30.540
Where it says AXA High of the metadata.

01:53:31.420 --> 01:53:32.240
Right click.

01:53:32.780 --> 01:53:34.300
Give it to extract the scheme for a moment.

01:53:34.420 --> 01:53:35.460
Give it Next, please.

01:53:35.560 --> 01:53:37.160
Expand AXA.

01:53:38.120 --> 01:53:39.220
And there it has clients.

01:53:39.300 --> 01:53:40.040
I think that's it.

01:53:41.120 --> 01:53:42.540
Then select, please.

01:53:42.920 --> 01:53:44.000
Give it Next.

01:53:49.140 --> 01:53:50.180
And it ends with Finish.

01:53:51.240 --> 01:53:52.740
That is the table that had to be created.

01:53:52.920 --> 01:53:55.060
To confirm, go to your T-High Create Table.

01:53:55.160 --> 01:53:56.140
One moment.

01:53:56.320 --> 01:53:57.740
To T-High Create Table.

01:53:58.100 --> 01:54:00.740
And let's see the name of the table that created T-High Create Table.

01:54:01.000 --> 01:54:01.620
That one that is there.

01:54:01.620 --> 01:54:02.380
In the middle.

01:54:03.040 --> 01:54:03.940
In the canvas.

01:54:04.740 --> 01:54:05.360
That one that is there.

01:54:05.420 --> 01:54:06.120
Client, perfect.

01:54:06.240 --> 01:54:09.080
Now, do the following to confirm if the data was stored correctly.

01:54:10.160 --> 01:54:11.880
Go then to Hive.

01:54:11.940 --> 01:54:13.320
And it will create a new job there in Hive.

01:54:13.540 --> 01:54:14.160
Which is to read data.

01:54:14.240 --> 01:54:15.080
There it has it.

01:54:16.180 --> 01:54:16.480
Ready.

01:54:17.040 --> 01:54:18.460
Then take clients.

01:54:19.120 --> 01:54:20.720
And release them in the canvas.

01:54:22.060 --> 01:54:24.720
But change there to T-High Input.

01:54:24.880 --> 01:54:25.800
Add a Tello Grow.

01:54:26.040 --> 01:54:27.560
And connect clients.

01:54:29.080 --> 01:54:30.040
With Tello Grow.

01:54:30.360 --> 01:54:32.340
And proceed to execute then the job.

01:54:33.580 --> 01:54:33.940
Ready.

01:54:34.000 --> 01:54:35.240
Proceed to execute it.

01:54:35.520 --> 01:54:35.820
Here it is.

01:54:35.960 --> 01:54:36.220
There it goes.

01:54:36.320 --> 01:54:37.540
If there it has the data.

01:54:38.480 --> 01:54:40.100
What makes me weird is.

01:54:40.120 --> 01:54:41.320
There you sent 20, right?

01:54:41.340 --> 01:54:41.720
20 data.

01:54:41.920 --> 01:54:42.100
There it is.

01:54:42.100 --> 01:54:42.840
What is out of order.

01:54:43.020 --> 01:54:43.780
Yes, there are 20.

01:54:44.380 --> 01:54:45.400
Go up to read a little more.

01:54:45.540 --> 01:54:46.740
Yes, go up.

01:54:47.080 --> 01:54:47.860
Yes, there are 20.

01:54:47.880 --> 01:54:48.180
There it is.

01:54:48.360 --> 01:54:48.680
Yes.

01:54:48.820 --> 01:54:52.000
What happens is that I had not updated the connection.

01:54:52.160 --> 01:54:53.860
To see the table that was created.

01:54:53.960 --> 01:54:54.500
That's why I didn't.

01:54:54.940 --> 01:54:55.600
Yes, I had created it.

01:54:55.640 --> 01:54:56.560
But it was necessary to update.

01:55:00.340 --> 01:55:01.060
Well, let's go.

01:55:01.860 --> 01:55:02.220
Yes.

01:55:02.240 --> 01:55:04.860
Because by default the tool does not update.

01:55:05.020 --> 01:55:05.980
The changes that there are.

01:55:05.980 --> 01:55:06.540
It does not update.

01:55:07.280 --> 01:55:09.220
Of the new sockets that are in the data.

01:55:10.480 --> 01:55:11.580
You have to update.

01:55:12.540 --> 01:55:14.960
Well, let's create now.

01:55:14.960 --> 01:55:16.920
And let's see the other way.

01:55:16.920 --> 01:55:19.440
Of how we would do to insert data.

01:55:19.440 --> 01:55:20.600
In a table.

01:55:21.440 --> 01:55:22.240
For that, then.

01:55:22.240 --> 01:55:23.780
Let's create another job.

01:55:24.200 --> 01:55:25.620
Please, if we are talking about hype.

01:55:26.480 --> 01:55:27.640
And this man.

01:55:27.640 --> 01:55:28.980
I'm going to call him.

01:55:31.300 --> 01:55:31.960
Job.

01:55:33.340 --> 01:55:34.600
Under score sales.

01:55:34.600 --> 01:55:35.240
Sales.

01:55:35.440 --> 01:55:36.000
Products.

01:55:37.360 --> 01:55:38.480
By genre.

01:55:39.160 --> 01:55:39.460
Sales.

01:55:39.580 --> 01:55:39.860
Products.

01:55:39.980 --> 01:55:40.360
By genre.

01:55:40.640 --> 01:55:41.400
Let's go then.

01:55:41.400 --> 01:55:43.180
To mix a bit of.

01:55:43.180 --> 01:55:43.760
Big data.

01:55:43.800 --> 01:55:44.220
Of progress.

01:55:44.560 --> 01:55:45.720
With storage in hype.

01:55:47.300 --> 01:55:48.840
At the end it gives you finish.

01:55:50.120 --> 01:55:50.220
Then.

01:55:54.300 --> 01:55:54.880
Of hype.

01:55:54.920 --> 01:55:56.180
You can copy.

01:55:56.360 --> 01:55:57.000
In the previous hype.

01:55:57.340 --> 01:55:58.820
Copy the tepryok that serves me.

01:55:59.040 --> 01:56:01.600
The tepryok.

01:56:02.980 --> 01:56:03.540
Copy.

01:56:05.420 --> 01:56:06.600
Paste it here.

01:56:09.200 --> 01:56:10.920
And look for a.

01:56:10.920 --> 01:56:12.540
In the.

01:56:12.700 --> 01:56:13.020
The hadut.

01:56:13.020 --> 01:56:14.420
Look for a.

01:56:14.420 --> 01:56:14.760
TH.

01:56:15.900 --> 01:56:16.920
HDFS connection.

01:56:17.220 --> 01:56:18.780
To paste it there too.

01:56:19.780 --> 01:56:20.600
And reuse it.

01:56:23.820 --> 01:56:25.700
This HDFS connection.

01:56:26.260 --> 01:56:26.620
OK.

01:56:26.980 --> 01:56:27.400
Copy.

01:56:28.580 --> 01:56:30.660
And paste it here.

01:56:36.340 --> 01:56:36.900
Perfect.

01:56:37.020 --> 01:56:37.980
And I connect it.

01:56:37.980 --> 01:56:38.860
In the same tepryok.

01:56:38.940 --> 01:56:39.880
With the tehy connection.

01:56:43.540 --> 01:56:44.660
I copy the tepos.

01:56:44.720 --> 01:56:46.080
I close the connection.

01:56:46.360 --> 01:56:46.780
Of the hype.

01:56:47.160 --> 01:56:48.420
That you have it here.

01:56:50.000 --> 01:56:50.560
Ready.

01:56:51.720 --> 01:56:53.040
This would be the original structure.

01:56:54.740 --> 01:56:55.100
Then.

01:56:56.760 --> 01:56:57.320
Copy.

01:57:00.580 --> 01:57:02.440
And I'm going to paste it here.

01:57:02.620 --> 01:57:03.180
In.

01:57:07.560 --> 01:57:08.120
Perfect.

01:57:11.320 --> 01:57:12.260
This one here.

01:57:12.580 --> 01:57:13.460
And this one here.

01:57:15.120 --> 01:57:16.600
Those are the elements that I need now.

01:57:19.140 --> 01:57:19.680
The what?

01:57:20.700 --> 01:57:21.580
The tehy connection.

01:57:22.420 --> 01:57:22.940
Yes.

01:57:22.980 --> 01:57:23.920
I had had that.

01:57:23.940 --> 01:57:24.540
If you don't have it.

01:57:24.800 --> 01:57:25.340
Add it please.

01:57:26.280 --> 01:57:27.840
We had done that in the previous one.

01:57:28.000 --> 01:57:28.700
Add a tepos.

01:57:28.840 --> 01:57:29.760
And a tehyclose.

01:57:30.160 --> 01:57:31.560
Remember that tehyclose.

01:57:31.560 --> 01:57:33.880
Will show the connections that the job has.

01:57:33.880 --> 01:57:35.140
And you have to specify here.

01:57:35.160 --> 01:57:36.360
The connection that is going to close.

01:57:36.680 --> 01:57:37.720
Otherwise it will be a compilation error.

01:57:38.060 --> 01:57:38.580
That is not the past.

01:57:38.640 --> 01:57:40.020
It is the same because it takes the configuration.

01:57:40.960 --> 01:57:43.040
Maybe you ask yourself.

01:57:43.040 --> 01:57:44.200
You start to ask yourself.

01:57:44.380 --> 01:57:45.880
There is no way that I.

01:57:47.960 --> 01:57:50.360
Don't always have to copy and paste this.

01:57:50.460 --> 01:57:50.960
The same.

01:57:52.140 --> 01:57:55.180
Well, in the version of the Talent Studio.

01:57:55.480 --> 01:57:56.020
The commercial.

01:57:56.860 --> 01:57:59.160
They have what is called a Joplex.

01:57:59.900 --> 01:58:02.380
A Joplex is a set of components.

01:58:02.380 --> 01:58:03.840
That is involved in a job.

01:58:03.960 --> 01:58:05.620
So I paste that Joplex.

01:58:05.740 --> 01:58:06.920
And I could have a Joplex.

01:58:07.080 --> 01:58:09.080
That I have the pre-job and the tepos-job.

01:58:10.720 --> 01:58:11.600
So there.

01:58:11.800 --> 01:58:13.900
The only thing I do is paste the job.

01:58:13.960 --> 01:58:15.740
And I don't have to copy and paste.

01:58:16.280 --> 01:58:17.920
Otherwise I already have everything in one part.

01:58:17.980 --> 01:58:19.580
And I can reuse it many times.

01:58:20.420 --> 01:58:23.040
I hope I have time to show you a little bit of that part.

01:58:25.800 --> 01:58:26.680
Well then.

01:58:26.700 --> 01:58:28.200
I need the following components.

01:58:28.420 --> 01:58:30.760
I need an HDFS input.

01:58:35.140 --> 01:58:35.380
Input.

01:58:40.280 --> 01:58:43.140
I need a new component.

01:58:43.580 --> 01:58:45.260
Here a T-Aggregator Row.

01:58:49.600 --> 01:58:51.300
I'm going to do the following.

01:58:51.520 --> 01:58:53.380
Here a T-High Create Table.

01:58:57.380 --> 01:58:57.760
No lie.

01:58:57.980 --> 01:58:58.680
It makes more sense.

01:58:58.680 --> 01:58:59.700
The T-High.

01:58:59.700 --> 01:59:01.560
Makes more sense here in the T-Pre-Job.

01:59:01.720 --> 01:59:02.500
The Create Table.

01:59:02.720 --> 01:59:03.780
It makes much more sense.

01:59:03.860 --> 01:59:04.780
I'm going to put it down.

01:59:06.060 --> 01:59:07.200
I'm going to connect it here.

01:59:07.240 --> 01:59:08.240
On Component OK.

01:59:09.480 --> 01:59:10.720
It makes more sense.

01:59:10.740 --> 01:59:12.000
In the T-Pre-Job.

01:59:13.340 --> 01:59:14.820
And I'm going to finish here.

01:59:14.860 --> 01:59:16.460
To upload these data.

01:59:17.380 --> 01:59:19.180
An HDFS Output.

01:59:20.500 --> 01:59:22.240
And I'm going to finish with a component.

01:59:22.280 --> 01:59:23.540
That is called T-High Low At.

01:59:28.680 --> 01:59:30.680
It would be the components that I'm going to have here.

01:59:32.620 --> 01:59:33.700
HDFS Input.

01:59:33.820 --> 01:59:34.760
T-Aggregator Row.

01:59:35.040 --> 01:59:36.740
T-HDFS Output.

01:59:37.460 --> 01:59:37.860
Perfect.

01:59:38.040 --> 01:59:38.440
Ready.

01:59:41.820 --> 01:59:42.380
Well.

01:59:42.640 --> 01:59:44.120
Now then.

01:59:45.260 --> 01:59:47.700
We are going to configure the T-High Create Table first.

01:59:50.260 --> 01:59:52.220
We are going to use an existing connection.

01:59:53.700 --> 01:59:55.500
And the name of the table.

01:59:55.500 --> 01:59:56.540
We are going to call it.

01:59:58.680 --> 02:00:02.480
We are going to call it.

02:00:02.520 --> 02:00:03.900
Under score by.

02:00:04.360 --> 02:00:05.780
Under score genre.

02:00:06.560 --> 02:00:06.960
Sales.

02:00:07.200 --> 02:00:08.380
Under score by.

02:00:08.640 --> 02:00:09.460
Under score genre.

02:00:10.280 --> 02:00:11.020
In the Action Table.

02:00:11.020 --> 02:00:12.240
I'm going to tell you then.

02:00:12.240 --> 02:00:13.760
That create the table if it doesn't exist.

02:00:13.860 --> 02:00:15.020
If you are going to get a little high.

02:00:15.060 --> 02:00:16.360
You have to investigate a little.

02:00:16.460 --> 02:00:17.540
The partitions if necessary.

02:00:17.700 --> 02:00:18.160
Create partitions.

02:00:18.220 --> 02:00:20.200
But the interesting thing is that the tool allows you to configure that.

02:00:20.500 --> 02:00:21.480
And the.

02:00:21.800 --> 02:00:22.240
Scheme.

02:00:22.340 --> 02:00:24.180
I'm going to take it from what they were down here.

02:00:24.380 --> 02:00:24.820
So.

02:00:25.180 --> 02:00:26.180
So as not to do it twice.

02:00:27.280 --> 02:00:28.580
I configure it down here.

02:00:28.700 --> 02:00:30.060
And then I export it.

02:00:30.060 --> 02:00:31.920
And I import it up here.

02:00:33.160 --> 02:00:33.980
I'm going to do the following now.

02:00:34.160 --> 02:00:34.680
Let's go.

02:00:34.740 --> 02:00:35.680
And we connect.

02:00:36.140 --> 02:00:37.320
To the HDImpute.

02:00:37.380 --> 02:00:38.800
We connect it to the T-Aggregator Row.

02:00:40.240 --> 02:00:41.220
And the T-Aggregator Row.

02:00:41.260 --> 02:00:43.180
We connect it to the HDFS Output.

02:00:48.600 --> 02:00:49.060
Perfect.

02:00:49.320 --> 02:00:49.460
There.

02:00:49.560 --> 02:00:49.940
Double click.

02:00:50.120 --> 02:00:50.720
In the top part.

02:00:50.960 --> 02:00:51.940
Double click.

02:00:52.220 --> 02:00:52.940
Up there.

02:00:53.540 --> 02:00:54.360
In the header.

02:00:54.860 --> 02:00:55.480
I'll tell you where.

02:00:57.160 --> 02:00:58.180
For the moment.

02:01:00.180 --> 02:01:01.840
If you double click here in the header.

02:01:02.080 --> 02:01:03.480
He's going to put you.

02:01:03.600 --> 02:01:04.640
Give him another double click.

02:01:05.400 --> 02:01:06.340
I'm not going to leave.

02:01:07.280 --> 02:01:08.180
That you lost the.

02:01:09.640 --> 02:01:11.040
You lost this one here.

02:01:11.540 --> 02:01:12.180
This is the one that remains.

02:01:12.860 --> 02:01:13.500
Double click.

02:01:14.080 --> 02:01:14.120
Again.

02:01:15.480 --> 02:01:15.820
Up here.

02:01:17.940 --> 02:01:18.480
Ready.

02:01:18.840 --> 02:01:19.380
So.

02:01:20.220 --> 02:01:22.040
Let's go to the HDImpute.

02:01:22.220 --> 02:01:23.080
To configure it.

02:01:23.080 --> 02:01:25.200
We say to use existing connection.

02:01:27.860 --> 02:01:29.060
HDFS Connection.

02:01:29.340 --> 02:01:30.500
Take as a reference.

02:01:30.820 --> 02:01:32.180
The file that is called.

02:01:34.380 --> 02:01:36.060
Let's connect here.

02:01:38.500 --> 02:01:39.620
That is called.

02:01:41.360 --> 02:01:41.960
Supermarket.

02:01:42.820 --> 02:01:43.420
Sales.

02:01:46.220 --> 02:01:48.080
And I'm going to check that supermarket.

02:01:48.300 --> 02:01:49.860
If it's separated by what.

02:01:50.000 --> 02:01:50.700
If by comma.

02:01:50.980 --> 02:01:51.820
Or by dot com.

02:01:51.840 --> 02:01:53.400
But if by comma.

02:01:54.360 --> 02:01:55.280
By dot com.

02:01:55.280 --> 02:01:55.840
I'm going to check.

02:01:56.020 --> 02:01:57.020
To see what I forgot.

02:02:07.580 --> 02:02:08.440
Separated by comma.

02:02:08.540 --> 02:02:09.320
And they have header.

02:02:09.480 --> 02:02:09.740
Ok.

02:02:10.860 --> 02:02:10.920
So.

02:02:11.520 --> 02:02:12.720
I tell you in the input.

02:02:12.760 --> 02:02:13.300
I tell you.

02:02:14.200 --> 02:02:14.260
Well.

02:02:14.560 --> 02:02:15.300
Text file.

02:02:15.760 --> 02:02:16.200
Separator.

02:02:17.440 --> 02:02:17.880
Is.

02:02:18.940 --> 02:02:19.380
Comma.

02:02:19.860 --> 02:02:21.420
And the header is.

02:02:21.640 --> 02:02:21.900
One.

02:02:22.020 --> 02:02:22.240
Then.

02:02:22.560 --> 02:02:23.540
I'm going to look for the scheme.

02:02:23.540 --> 02:02:24.780
That surely I have it there.

02:02:25.800 --> 02:02:26.440
Temporal.

02:02:26.480 --> 02:02:27.460
I'm going to import it.

02:02:27.560 --> 02:02:28.200
Temporal.

02:02:28.460 --> 02:02:29.460
It must have a supermarket.

02:02:29.560 --> 02:02:30.800
Out there.

02:02:31.280 --> 02:02:32.540
I think we had read it already.

02:02:34.340 --> 02:02:35.620
This supermarket.

02:02:35.940 --> 02:02:36.580
XML.

02:02:45.560 --> 02:02:45.960
Perfect.

02:02:46.160 --> 02:02:46.360
Ready.

02:02:48.040 --> 02:02:50.120
Let's go to the aggregator row.

02:02:50.760 --> 02:02:51.300
Well.

02:02:51.480 --> 02:02:54.000
Since I want the sales of products by genre.

02:02:55.000 --> 02:02:55.200
Then.

02:02:55.520 --> 02:02:57.020
Let's go to the aggregator row.

02:02:57.420 --> 02:02:58.500
Let's edit the scheme.

02:03:00.180 --> 02:03:02.160
And let's go then.

02:03:02.420 --> 02:03:04.020
Gender to the right side.

02:03:04.500 --> 02:03:05.320
And product line.

02:03:09.180 --> 02:03:10.400
I do this in this way.

02:03:10.400 --> 02:03:11.840
It is to keep the names.

02:03:12.560 --> 02:03:13.020
So.

02:03:13.040 --> 02:03:15.060
It makes it easier for me to take the names.

02:03:15.100 --> 02:03:16.060
And the same type.

02:03:16.460 --> 02:03:17.600
Than adding the new ones.

02:03:17.600 --> 02:03:18.100
But.

02:03:18.180 --> 02:03:18.800
If I want.

02:03:18.800 --> 02:03:20.280
I can add it here with the plus.

02:03:20.800 --> 02:03:21.500
And let's go there.

02:03:21.640 --> 02:03:22.600
Total also.

02:03:23.160 --> 02:03:23.880
For this side.

02:03:25.320 --> 02:03:26.540
I do it just for that.

02:03:26.560 --> 02:03:27.880
To keep the names.

02:03:28.680 --> 02:03:30.280
The same names of output in the file.

02:03:31.360 --> 02:03:32.620
It makes it much easier for me.

02:03:33.920 --> 02:03:34.420
Ok.

02:03:36.440 --> 02:03:37.900
Then we group by.

02:03:38.520 --> 02:03:38.660
Gender.

02:03:39.540 --> 02:03:40.720
And by product line.

02:03:41.680 --> 02:03:41.700
Yes.

02:03:41.700 --> 02:03:42.140
Yes.

02:03:42.320 --> 02:03:42.360
Of course.

02:03:43.400 --> 02:03:43.960
Yes.

02:03:44.220 --> 02:03:44.640
Of course.

02:03:47.000 --> 02:03:48.120
Fernanda Lisbe.

02:03:48.120 --> 02:03:49.020
I already connected.

02:03:49.200 --> 02:03:50.200
Let's see.

02:03:50.680 --> 02:03:52.460
Go to.

02:03:52.580 --> 02:03:54.540
Edit the HFC input scheme.

02:03:54.700 --> 02:03:55.100
If you are kind.

02:03:55.260 --> 02:03:55.660
There where it is.

02:03:56.940 --> 02:03:58.120
Go to import.

02:03:59.440 --> 02:04:00.400
And surely.

02:04:00.720 --> 02:04:01.500
In the temporary folder.

02:04:01.640 --> 02:04:02.740
It has to have.

02:04:02.740 --> 02:04:04.920
An XML of supermarket.

02:04:05.040 --> 02:04:05.880
The last one it has.

02:04:06.000 --> 02:04:06.400
Perfect.

02:04:06.820 --> 02:04:07.300
Ok.

02:04:08.960 --> 02:04:10.700
In the separator then.

02:04:10.800 --> 02:04:11.420
Of field.

02:04:12.140 --> 02:04:13.000
It is a comma.

02:04:13.100 --> 02:04:13.780
Not point and comma.

02:04:13.940 --> 02:04:14.740
And the header.

02:04:14.740 --> 02:04:16.580
It is one.

02:04:16.780 --> 02:04:17.920
Because it has a header.

02:04:19.460 --> 02:04:19.740
Perfect.

02:04:19.880 --> 02:04:20.780
Let's go then.

02:04:20.880 --> 02:04:21.880
To the t-aggregator room.

02:04:22.020 --> 02:04:23.560
And delete the scheme.

02:04:25.100 --> 02:04:25.800
Go to the right side.

02:04:26.000 --> 02:04:27.600
Because it makes it easier for me to configure.

02:04:27.680 --> 02:04:29.040
To keep the same names.

02:04:29.440 --> 02:04:30.520
Go to gender.

02:04:31.460 --> 02:04:32.240
Product line.

02:04:32.660 --> 02:04:33.300
To the right.

02:04:33.460 --> 02:04:33.760
Ready.

02:04:33.960 --> 02:04:34.840
Product line.

02:04:36.280 --> 02:04:36.920
And total.

02:04:36.920 --> 02:04:37.840
Which is a little lower.

02:04:38.100 --> 02:04:38.520
Ok.

02:04:38.600 --> 02:04:39.040
Ready.

02:04:39.460 --> 02:04:40.740
Now let's go to the following.

02:04:40.800 --> 02:04:42.160
Let's define the operation.

02:04:42.240 --> 02:04:43.660
The operation as it is total.

02:04:43.660 --> 02:04:46.120
Give it a click here.

02:04:46.140 --> 02:04:47.460
And the total change appears.

02:04:47.820 --> 02:04:49.100
And the function.

02:04:50.700 --> 02:04:51.300
Is sum.

02:04:54.660 --> 02:04:55.600
Ignore the nulls.

02:04:55.660 --> 02:04:57.260
Because I don't remember if there are nulls.

02:04:57.460 --> 02:04:58.020
Yes.

02:04:58.780 --> 02:04:59.100
Please.

02:04:59.340 --> 02:05:00.260
Ignore the nulls.

02:05:00.400 --> 02:05:01.880
And there you don't have to do anything else.

02:05:02.820 --> 02:05:04.640
Let's go to the HF output.

02:05:07.100 --> 02:05:09.460
Let's say that we are going to use an existing connection.

02:05:11.180 --> 02:05:13.380
Let's go here to HF connection.

02:05:15.060 --> 02:05:15.900
And ready.

02:05:16.000 --> 02:05:17.900
And we are going to choose a file.

02:05:17.940 --> 02:05:19.560
To later change the name.

02:05:22.260 --> 02:05:24.220
So I'm going to call it.

02:05:25.000 --> 02:05:25.640
Sales.

02:05:28.120 --> 02:05:28.900
Under score.

02:05:29.440 --> 02:05:29.620
By.

02:05:29.640 --> 02:05:30.320
Under score.

02:05:30.320 --> 02:05:30.760
Gender.

02:05:33.380 --> 02:05:35.700
Well, here I am progressing with Big Data.

02:05:35.820 --> 02:05:36.640
In which.

02:05:36.940 --> 02:05:37.580
Process.

02:05:38.380 --> 02:05:39.700
The data that I have.

02:05:39.860 --> 02:05:40.940
And I send it an output file.

02:05:41.440 --> 02:05:43.560
To the Hadoop Cluster.

02:05:43.900 --> 02:05:45.580
Initially what I did was that.

02:05:47.040 --> 02:05:48.440
Remember that the Create Table.

02:05:48.460 --> 02:05:50.520
We didn't manage to configure the.

02:05:50.960 --> 02:05:51.540
The.

02:05:51.680 --> 02:05:51.980
Scheme.

02:05:52.500 --> 02:05:54.720
Because the scheme depends on the structure of the output file.

02:05:55.060 --> 02:05:55.640
So.

02:05:55.960 --> 02:05:57.420
Since I have the structure of the output file.

02:05:57.560 --> 02:05:58.880
I'm going to edit the scheme in the.

02:05:58.880 --> 02:05:59.420
In the output.

02:06:01.520 --> 02:06:04.120
And export the scheme you have there.

02:06:06.420 --> 02:06:07.400
Call it.

02:06:07.680 --> 02:06:08.280
Sales.

02:06:08.660 --> 02:06:09.360
Under score.

02:06:09.660 --> 02:06:09.680
By.

02:06:09.860 --> 02:06:10.460
Gender.

02:06:10.800 --> 02:06:11.360
XML.

02:06:12.060 --> 02:06:13.080
You always have to use the XML.

02:06:13.240 --> 02:06:14.120
Because then you don't read it.

02:06:17.720 --> 02:06:18.320
Ok.

02:06:18.900 --> 02:06:20.020
I export it initially.

02:06:21.220 --> 02:06:23.500
And now I'm going to import it in the Create Table.

02:06:23.540 --> 02:06:24.980
So the table will remain.

02:06:25.600 --> 02:06:27.300
With the same scheme.

02:06:27.680 --> 02:06:28.280
Import.

02:06:29.860 --> 02:06:31.120
And sales by gender.

02:06:31.360 --> 02:06:32.380
And there I have the scheme.

02:06:33.360 --> 02:06:35.160
What I don't like about how it turned out.

02:06:35.240 --> 02:06:36.980
Was that it has the first in capital letters.

02:06:37.100 --> 02:06:37.300
I don't know.

02:06:38.260 --> 02:06:39.080
As usual.

02:06:39.240 --> 02:06:39.980
I'm going to leave it like that in the middle.

02:06:40.880 --> 02:06:41.280
Well.

02:06:41.300 --> 02:06:42.560
Let's continue then.

02:06:42.920 --> 02:06:43.920
Let's go here.

02:06:44.000 --> 02:06:44.860
Where it says.

02:06:44.960 --> 02:06:45.200
Well.

02:06:45.300 --> 02:06:46.440
We are going to.

02:06:46.440 --> 02:06:47.840
That the subjobs are different.

02:06:48.560 --> 02:06:49.860
I'm going to connect then.

02:06:49.860 --> 02:06:51.720
The subjobs of the HDF input.

02:06:51.740 --> 02:06:53.680
With a subjobs of the Teja and Loa.

02:06:54.260 --> 02:06:54.720
Remember.

02:06:54.720 --> 02:06:56.180
To do the subjobs.

02:06:56.180 --> 02:06:58.620
The first component of the subjobs.

02:06:58.840 --> 02:07:01.180
Connects it with the first component of the second subjobs.

02:07:03.320 --> 02:07:04.360
Keep in mind.

02:07:04.880 --> 02:07:05.620
That here.

02:07:05.620 --> 02:07:07.100
I don't know if I put it.

02:07:07.660 --> 02:07:08.980
Because by default.

02:07:09.260 --> 02:07:10.400
The high index file.

02:07:10.760 --> 02:07:11.080
Wait comma.

02:07:11.320 --> 02:07:12.660
It escaped me.

02:07:12.880 --> 02:07:13.480
So.

02:07:13.760 --> 02:07:15.060
In the HDF opus.

02:07:15.220 --> 02:07:15.700
Please.

02:07:15.760 --> 02:07:16.640
Place comma.

02:07:16.640 --> 02:07:17.380
As separator.

02:07:19.340 --> 02:07:20.420
And it escaped me.

02:07:20.480 --> 02:07:21.800
That includes headers.

02:07:24.600 --> 02:07:25.100
Perfect.

02:07:25.320 --> 02:07:25.560
It's there.

02:07:25.700 --> 02:07:26.440
We are almost there.

02:07:27.280 --> 02:07:27.920
Perfect.

02:07:28.380 --> 02:07:28.860
So.

02:07:29.140 --> 02:07:29.780
Let's configure.

02:07:29.880 --> 02:07:30.520
The T.

02:07:30.520 --> 02:07:30.800
In the output.

02:07:31.000 --> 02:07:31.480
Of comma.

02:07:31.820 --> 02:07:31.900
Perfect.

02:07:32.140 --> 02:07:32.480
No problem.

02:07:33.220 --> 02:07:33.480
Well.

02:07:33.700 --> 02:07:33.860
Here.

02:07:34.100 --> 02:07:34.580
As suddenly.

02:07:34.660 --> 02:07:35.060
In the opus.

02:07:35.080 --> 02:07:36.220
We can execute it several times.

02:07:36.380 --> 02:07:36.800
Please.

02:07:36.900 --> 02:07:37.680
Place it there.

02:07:37.760 --> 02:07:38.520
Over white.

02:07:40.420 --> 02:07:40.560
Here.

02:07:40.700 --> 02:07:41.280
As a curious thing.

02:07:41.480 --> 02:07:42.160
If the guy.

02:07:42.220 --> 02:07:42.660
In the opus.

02:07:42.700 --> 02:07:43.160
If the guy.

02:07:43.240 --> 02:07:43.940
Had chosen.

02:07:43.940 --> 02:07:45.060
As sequence files.

02:07:45.980 --> 02:07:46.320
The table.

02:07:46.380 --> 02:07:47.400
It has to be of sequence type.

02:07:47.540 --> 02:07:47.780
Because.

02:07:47.780 --> 02:07:48.060
Otherwise.

02:07:48.300 --> 02:07:49.600
They are not compatible.

02:07:50.160 --> 02:07:51.360
Let's go to Teja and Loa.

02:07:53.940 --> 02:07:54.920
Let's tell it.

02:07:55.100 --> 02:07:55.900
Use existing connection.

02:07:59.340 --> 02:07:59.760
Perfect.

02:07:59.880 --> 02:08:00.060
There.

02:08:00.840 --> 02:08:01.780
This Teja and Loa.

02:08:01.860 --> 02:08:03.260
It has two ways to load.

02:08:03.880 --> 02:08:05.260
If you go in the Loa action.

02:08:05.440 --> 02:08:06.220
The Loa appears.

02:08:06.340 --> 02:08:07.300
And if the second option.

02:08:07.320 --> 02:08:08.260
Choose insert.

02:08:08.940 --> 02:08:09.380
Practically.

02:08:09.400 --> 02:08:09.820
It comes out.

02:08:09.840 --> 02:08:10.640
The same menu.

02:08:10.720 --> 02:08:11.460
That I had now.

02:08:11.460 --> 02:08:12.560
With the Teja and Loa.

02:08:13.440 --> 02:08:13.880
So.

02:08:14.060 --> 02:08:14.600
I'm interested in.

02:08:14.600 --> 02:08:15.980
How to do it with the Loa.

02:08:16.580 --> 02:08:17.180
This Loa.

02:08:17.220 --> 02:08:18.280
It has to be much faster.

02:08:18.400 --> 02:08:19.420
Because what it's going to load.

02:08:19.440 --> 02:08:20.140
Is a file.

02:08:20.700 --> 02:08:21.140
Directly.

02:08:21.320 --> 02:08:22.540
It's not going to be a SQL sentence.

02:08:22.880 --> 02:08:23.200
If not.

02:08:23.200 --> 02:08:23.920
It's going to be a bulk.

02:08:24.140 --> 02:08:24.860
To load.

02:08:25.180 --> 02:08:25.620
The records.

02:08:25.700 --> 02:08:26.340
To the table.

02:08:27.200 --> 02:08:27.580
So.

02:08:27.640 --> 02:08:28.160
Let's go here.

02:08:28.160 --> 02:08:29.240
The PAD.

02:08:29.500 --> 02:08:30.560
The PAD.

02:08:30.560 --> 02:08:31.360
Look at the following.

02:08:31.580 --> 02:08:32.060
Down here.

02:08:32.140 --> 02:08:33.440
There's a local.

02:08:34.020 --> 02:08:36.200
If I click this local.

02:08:36.560 --> 02:08:37.920
It's going to look for the file.

02:08:37.920 --> 02:08:39.340
That references here.

02:08:39.520 --> 02:08:40.280
In the location.

02:08:40.440 --> 02:08:40.760
That is on this side.

02:08:41.060 --> 02:08:42.700
That is to say.

02:08:42.700 --> 02:08:43.300
That if the local.

02:08:43.300 --> 02:08:44.180
I leave it here.

02:08:44.860 --> 02:08:45.360
In reality.

02:08:45.380 --> 02:08:46.520
It would be looking.

02:08:46.760 --> 02:08:47.320
The local.

02:08:47.480 --> 02:08:48.000
The file.

02:08:48.080 --> 02:08:48.660
That I place here.

02:08:48.660 --> 02:08:49.320
In the container.

02:08:50.840 --> 02:08:52.540
But I want the file.

02:08:53.040 --> 02:08:54.480
To be in Hadoop.

02:08:54.580 --> 02:08:54.980
So.

02:08:55.020 --> 02:08:56.340
I'm going to click this local.

02:08:56.340 --> 02:08:56.900
That is here.

02:08:56.900 --> 02:08:58.640
I am going to copy.

02:08:58.800 --> 02:08:59.640
The file.

02:08:59.640 --> 02:09:01.800
The route would be.

02:09:03.280 --> 02:09:04.220
This output.

02:09:04.500 --> 02:09:04.820
Is here.

02:09:05.020 --> 02:09:06.200
I'm going to copy the route.

02:09:06.420 --> 02:09:06.860
Output.

02:09:06.980 --> 02:09:07.860
That will be the file.

02:09:07.980 --> 02:09:09.000
For the Lua.

02:09:09.180 --> 02:09:09.620
Copy.

02:09:10.520 --> 02:09:11.800
And I paste it.

02:09:11.860 --> 02:09:12.600
In the Tejai.

02:09:12.720 --> 02:09:12.880
Lua.

02:09:14.420 --> 02:09:15.340
In the Failpad.

02:09:15.940 --> 02:09:16.560
In the Failpad.

02:09:16.780 --> 02:09:18.840
Of the Tejai.

02:09:18.960 --> 02:09:19.120
Lua.

02:09:19.780 --> 02:09:20.640
And the table.

02:09:20.780 --> 02:09:21.340
Let's call it.

02:09:22.140 --> 02:09:23.160
I'm going to copy the name.

02:09:23.280 --> 02:09:23.460
For.

02:09:23.460 --> 02:09:35.220
no problem, I copy the name and paste it here, just in case I have to save it in case of doubt,

02:09:35.260 --> 02:09:43.520
who takes it? Ok, ok, well, I'm going to try, I think everything is fine here, I'm going to try to

02:09:43.520 --> 02:09:48.840
run to see if there is no error, this should create the table and load the data that is

02:09:48.840 --> 02:09:58.600
in the output output file, I'm going to give it a run here to see what comes out,

02:10:00.240 --> 02:10:08.660
it is starting the process of bin data with my reduce, it created and supposedly there is no error,

02:10:09.180 --> 02:10:13.940
I'm going to see if it created the table, there are some that came out in error, not me, I'm going to

02:10:13.940 --> 02:10:22.920
see my first review here with me, I'm going to see, I'm going to extract them again,

02:10:22.940 --> 02:10:30.580
here it is, I think the table is sold, perfect, I'm going to consult it quickly here,

02:10:30.780 --> 02:10:36.820
if there is no error, there is no error, select asterisk from sales

02:10:40.720 --> 02:10:42.420
or under the core genero

02:10:50.800 --> 02:10:58.300
and the problem is that it inserted it to me but in a single column, wait a moment to see,

02:10:58.300 --> 02:11:07.300
I have run the errors, I have an error, I think with the, with the comma, let it load,

02:11:13.800 --> 02:11:18.320
ah no, what happens is that yes, yes, here I had a problem because the escape character of the

02:11:18.320 --> 02:11:23.540
table by default was in point and comma, change the escape character of the table please to

02:11:24.100 --> 02:11:30.480
escape this part and there was the problem, I'm going to try to run it again

02:11:31.660 --> 02:11:37.700
and there should be a problem with the separation characters, I thought that by default it was

02:11:37.700 --> 02:11:43.580
comma and in reality it is point and comma, I see that there are some that have errors, I'm going to

02:11:46.800 --> 02:11:50.480
ready, apparently ready, let's see

02:11:56.380 --> 02:12:01.340
no, I got the same error, ah, no, here is a problem, the problem I have is that, let's see,

02:12:01.880 --> 02:12:06.420
what is the problem I have now, the problem is that the table is already created, I will not

02:12:06.880 --> 02:12:13.600
create it again and the table has by default, I had, sorry, by default the separator point and comma,

02:12:14.080 --> 02:12:18.900
so it is easier for me because it is already created, I will not create it again, it is easier for me to

02:12:18.900 --> 02:12:25.260
change here in the output so that it is point and comma, the separator, I thought the separator was

02:12:25.260 --> 02:12:32.860
by default comma, then point and comma, let's see now yes, run again,

02:12:35.600 --> 02:12:39.260
I see that you have an error there, I do not know if you have overwrite in the output,

02:12:41.670 --> 02:12:51.310
you check to see if you have it overwrite and I try to run again and now yes, if the problem was

02:12:51.310 --> 02:12:56.630
that it was already created then the separator is point and comma, let's see, here it did not work,

02:12:58.130 --> 02:13:04.750
notice that the response time was superior, well, what did I do, I don't know, what a shame,

02:13:04.750 --> 02:13:11.930
I interrupted you, I'm going with you for a moment then, well, let's see what it says there, let's see,

02:13:12.390 --> 02:13:21.750
in the third line where it says Java file no phone, it did not find the file, it seems to me that

02:13:21.750 --> 02:13:30.270
go to hfimp if the configuration is wrong, of course, the problem is that our next one,

02:13:30.270 --> 02:13:38.390
I would be looking for that file in the root of the Hadoop closet, exactly, go there,

02:13:40.270 --> 02:13:47.750
if you want, but wait a moment, that is for the output or input, a moment, a moment,

02:13:50.130 --> 02:13:54.630
I left, I left, let me connect in an iterative way to be able to, a moment please,

02:13:55.370 --> 02:13:59.630
close the window for a moment that I left, a moment, close the window please,

02:14:01.210 --> 02:14:07.470
no, that input has to be, it is supermarket, then go back to the input, there you have it,

02:14:07.470 --> 02:14:15.750
sorry, go back to the, there, perfect, go to AXA and there it is supermarket,

02:14:16.390 --> 02:14:24.630
exactly, it is the entrance, ok, the output, please, the output, yes, that output is fine,

02:14:25.750 --> 02:14:30.430
let's go to Teja and Loa to see, to confirm, and it's fine, run to see.

02:14:33.550 --> 02:14:43.450
Well, there came out a small warning, I don't know, it already exists, well, it's over, if you want,

02:14:43.630 --> 02:14:48.090
do the following, let's do all the following to confirm the data then,

02:14:48.210 --> 02:14:53.710
that I did it over there, it was through the console, ready, let's do then, read data,

02:14:55.230 --> 02:14:59.630
ok, those who have already finished, please add a new job to read the table,

02:14:59.910 --> 02:15:06.370
remember that you have to refresh the scheme and then make an input and a telorow, ready,

02:15:06.590 --> 02:15:16.110
let's see, let's go to the output, please, to the output, ready, that's fine, let's go to

02:15:17.790 --> 02:15:22.030
Teja and Loa, remove the local, please, remove the local because it was looking for it there,

02:15:22.930 --> 02:15:28.790
the local chulito, because it was looking for it by default in the container and it is not there,

02:15:29.110 --> 02:15:36.290
try to execute again to see, ready, it's over, then, let's leave it there still,

02:15:36.330 --> 02:15:41.150
that we are going to do, we are going to consult it with the tool, to consult it with the tool,

02:15:41.150 --> 02:15:46.550
first please update the connection to the data base of Hi, how would it be,

02:15:46.670 --> 02:15:59.470
I go here to the Axa and update, extract scheme, next, next, Axa here, and here it was already,

02:15:59.490 --> 02:16:01.870
we selected 20 geniuses, if you don't have it, please select it,

02:16:05.610 --> 02:16:16.190
ah ok, let's see, let's see, it can't recognize, table not phone,

02:16:16.290 --> 02:16:23.510
I think it says, a little more to the right, yes, exactly, table not phone, sales by genre,

02:16:23.670 --> 02:16:31.690
ready, go to the Teja and create table for a moment, Teja and create table, copy in case that

02:16:31.690 --> 02:16:38.490
name comes from there, directly copy it, and go to Teja and load it, paste it there then,

02:16:41.970 --> 02:16:43.250
try to execute again to see,

02:16:51.630 --> 02:16:57.010
no there it says, table not phone, sales...

02:16:59.570 --> 02:17:04.590
ah, there, there, there, there, surely that is it, go to Teja and create table,

02:17:18.810 --> 02:17:29.070
And now, try to execute, yes, what happened, I tried to create the table, but I couldn't because the scheme was missing, and when I referenced it, it was created.

02:17:31.550 --> 02:17:43.130
Perfect, now do the next job, a new job in Hive that we are going to read, yes please, do that, it will be able to extract the scheme.

02:17:43.130 --> 02:18:09.070
Next, expand there AXA, choose sales in Hive, I already created it like this, next, finish, ready, then I was already in that part, create now a new job called job, read, job, read, sales table, nothing else, sales under core Hive.

02:18:13.750 --> 02:18:16.050
Here, finish, ready.

02:18:18.170 --> 02:18:32.250
Well, and then how do you do the sales table, I release it on this side, and remember that this is high input, I place a teloc row, and I send it to run.

02:18:33.530 --> 02:18:51.050
Well, here it is clear that with the tables created, and after you progress with Big Data, then what comes is pure SQL, or any component that you want to use, here it is, perfect, there it is.

02:18:51.530 --> 02:18:53.390
If anyone does not show data, please let me know.

02:18:53.390 --> 02:18:59.750
I do not know Alejandro, is there an error? Oh no, but it is Alejandro, sorry, what comes, what does it tell me?

02:19:00.050 --> 02:19:11.210
There was an error, right? Oh no, let's read the truth, then let's go to the next one, first delete this high input 2, because now, good question, because then it escaped me.

02:19:11.650 --> 02:19:18.650
First, update the connection you have of the Hive metadata, how will it be? Go to the connection and let it extract the scheme.

02:19:18.650 --> 02:19:28.730
Extract the scheme, I'll do it, extract the scheme, next, expand there please, next, finish.

02:19:30.450 --> 02:19:37.390
Now, take the sales table, expand tables, there are tables, there it expands, and sales, release it here.

02:19:38.090 --> 02:19:49.730
Now, the way you were going to do it, it also came out, but this is faster, why? Because here it already has the table, it already has the configuration of the connection to the Hive, you do not have to do it again.

02:19:50.450 --> 02:19:54.390
Give it a telocrow, give it a telocrow and send it to run, then.

02:19:55.610 --> 02:20:01.390
It executed the previous one, you have to run reading data, Alejandro, are you running the previous one?

02:20:03.670 --> 02:20:09.550
Open now, read data, no, it executed the previous one too, close and open again, close that job.

02:20:10.990 --> 02:20:16.950
Save it, sorry, save it, the engineer applies, close and open again, yes, perfect.

02:20:17.970 --> 02:20:30.790
Ready, then, we made the example with Hadut, we made the example with Hive, and actually the example with Sparse, this Open Source tool does not allow me,

02:20:30.790 --> 02:20:35.750
but I'm going to show you how easy it is to configure it in the proprietary tool.

02:20:36.810 --> 02:20:45.190
In the proprietary tool, I'm going to close here for a moment, notice that the jobs you have have the same structure,

02:20:45.570 --> 02:20:51.570
Basic Room, Debut, Advanced, Target and Memory Room, right? The jobs you create.

02:20:51.570 --> 02:20:57.210
Now, the jobs you create in the talent big data proprietary tool, this one that is here,

02:20:57.590 --> 02:21:00.570
is now going to add a new component called Spark Configuration.

02:21:02.490 --> 02:21:08.750
That Spark Configuration tool in the first place, you define where your Spark cluster is going to be.

02:21:08.830 --> 02:21:15.570
You configure it initially, and now, wait a moment, there you configure it.

02:21:17.670 --> 02:21:23.410
Spark Job Moment Talent, ready, I left it to see if I can find the graphic.

02:21:25.050 --> 02:21:27.350
Let's see, I can't find a graphic.

02:21:28.030 --> 02:21:34.650
Well, there you configure it, sorry, this configuration is done immediately in the job,

02:21:35.430 --> 02:21:43.150
and what the job is going to do is generate all the code you have, the job of the components that you have or that you have configured,

02:21:43.150 --> 02:21:48.890
it sends them and converts them to Spark in Java, and sends them to run against the cluster that you have defined.

02:21:49.430 --> 02:21:52.130
Why the example doesn't say it? Because the tool doesn't allow it.

02:21:52.650 --> 02:21:56.930
But in this case, it is connecting to the AWS MR cluster,

02:21:58.450 --> 02:22:03.070
and what it does then is that, here, let's see if it shows me the job.

02:22:04.750 --> 02:22:09.450
It shows here the job, this job that is here, which is a conventional job,

02:22:09.450 --> 02:22:13.770
the only thing we are going to configure is that you are going to tell it that you are going to run it against Spark,

02:22:14.270 --> 02:22:17.950
and it will automatically generate the code and it will make the Spark there.

02:22:18.570 --> 02:22:21.030
That's the only thing it does, it's actually very simple.

02:22:21.670 --> 02:22:27.970
You don't worry about Spark, you don't worry about the job, you do it conventionally, and in the end it sends it to run against the cluster.

02:22:29.610 --> 02:22:33.050
That would be the way to run it against Spark,

02:22:33.250 --> 02:22:35.870
I apologize because the tool doesn't allow me that part.

02:22:36.870 --> 02:22:45.290
And as a new thing, within the latest versions that the talent has,

02:22:45.670 --> 02:22:48.210
it already comes with components from machine learning.

02:22:49.370 --> 02:22:52.670
So it also saves you a lot from the machine learning processes,

02:22:53.510 --> 02:22:55.590
that normally one gets married with Python,

02:22:56.310 --> 02:23:01.030
but normally one gets married with Python because the Python ecosystem for machine learning

02:23:01.030 --> 02:23:04.850
is much larger than the machine learning ecosystem that it has for Java.

02:23:04.850 --> 02:23:06.310
It doesn't mean that Java doesn't exist,

02:23:06.730 --> 02:23:11.130
but rather that the machine learning in Python is much more mature.

02:23:11.790 --> 02:23:15.250
So to not start projects from scratch, one does it with Python.

02:23:15.710 --> 02:23:18.690
But now here, in the new versions that the talent has,

02:23:19.110 --> 02:23:22.770
it adds some components to be able to analyze the data

02:23:22.770 --> 02:23:27.570
through machine learning and with this it generates some descriptive or predictive patterns,

02:23:27.810 --> 02:23:28.750
as you want to do it.

02:23:29.310 --> 02:23:30.770
So it saves you a lot of work.

02:23:30.770 --> 02:23:34.610
These components, I don't understand much about the license part,

02:23:34.770 --> 02:23:37.490
I understand that the licenses are handled in the following way.

02:23:37.850 --> 02:23:40.670
You buy the talent studio and in the end,

02:23:40.770 --> 02:23:43.650
if you want a big data component or a machine learning component,

02:23:44.150 --> 02:23:46.270
what you do in the end is that you pay talent

02:23:46.270 --> 02:23:50.110
to enable you in the repository to update the components

02:23:50.770 --> 02:23:51.870
in the license that you have.

02:23:52.810 --> 02:23:54.530
That is the way it is working.

02:23:55.870 --> 02:24:00.850
Allow me to give an example with the version of the talent studio

02:24:00.850 --> 02:24:05.510
that does not come with support to enable that talent studio.

02:24:06.390 --> 02:24:09.850
But I'm going to show you some different things.

02:24:10.050 --> 02:24:12.230
Allow me, now I'm going to share my screen.

02:24:12.950 --> 02:24:16.570
The one of my team, not the one of...

02:24:16.570 --> 02:24:18.630
Give me a moment, this is my team.

02:24:19.510 --> 02:24:20.570
Wait a moment, ready.

02:24:20.830 --> 02:24:24.270
Here I have downloads, here I have talent studio.

02:24:24.530 --> 02:24:28.870
Ready, this is the commercial version.

02:24:29.230 --> 02:24:32.770
Wait for me, it's opening on the other screen.

02:24:34.410 --> 02:24:35.970
I'll try to pass it here.

02:24:36.630 --> 02:24:37.190
Look at the following.

02:24:37.390 --> 02:24:39.170
First, what does it start to change?

02:24:39.390 --> 02:24:43.830
It starts to change in the way the tool works.

02:24:44.370 --> 02:24:47.650
In the sense that the Open Source tool is designed more for personal projects,

02:24:48.310 --> 02:24:51.110
but I could adapt it to group projects.

02:24:51.750 --> 02:24:53.790
And this is designed for group projects.

02:24:54.730 --> 02:24:59.630
So, in the Open Source tool, I connected to a local repository,

02:24:59.990 --> 02:25:05.590
that is, a folder where I was going to save the different jobs that I was doing.

02:25:06.390 --> 02:25:11.730
In this case, what this tool is going to do is that you have to define a project.

02:25:12.510 --> 02:25:15.750
By defining a project, you associate that project with a Git repository.

02:25:17.150 --> 02:25:19.430
I don't know, they got married with Git.

02:25:19.470 --> 02:25:24.730
Although Git is the most used repository by Git developers,

02:25:25.750 --> 02:25:30.990
but I don't know if that was the decision of why they got Git at the end.

02:25:31.630 --> 02:25:37.130
But you connect to a repository in which all the people who belong to the same project

02:25:37.130 --> 02:25:38.850
are going to connect to the same repository.

02:25:39.510 --> 02:25:42.790
And that is the way they are going to share the different jobs.

02:25:42.790 --> 02:25:45.590
Here, by default, this part is already configured.

02:25:46.070 --> 02:25:47.410
So, here I say Open.

02:25:48.390 --> 02:25:51.210
Here, this version is a hybrid version,

02:25:51.670 --> 02:25:58.670
in which the authentication is done by the TMS that is in the cloud.

02:25:59.770 --> 02:26:00.430
How do I do it?

02:26:00.870 --> 02:26:02.910
In the TMS, I generate a key,

02:26:03.630 --> 02:26:06.730
and that key is the one that will allow me to log in

02:26:06.730 --> 02:26:12.030
and be able to open this talent studio application at a local level.

02:26:13.050 --> 02:26:15.890
So, how do you manage the versions of this man?

02:26:16.270 --> 02:26:18.170
Well, he manages the following way.

02:26:18.970 --> 02:26:20.470
Suppose you have a project.

02:26:20.910 --> 02:26:23.090
Well, here it tells me to wait five days.

02:26:24.890 --> 02:26:27.850
I'm going to create ... here are two types of jobs now.

02:26:28.090 --> 02:26:31.130
A job that is called Standard and a Job Lab.

02:26:32.150 --> 02:26:34.170
I'm not showing it here,

02:26:35.030 --> 02:26:37.750
for the question they asked me about how they manage the versions.

02:26:38.910 --> 02:26:42.770
Although I have it ...

02:26:42.770 --> 02:26:46.090
I could configure something in the TMS and we can wait there.

02:26:46.530 --> 02:26:50.050
So, I have little left for the evaluation that has to be done in the end.

02:26:50.150 --> 02:26:54.130
Here are two types of, let's say, jobs.

02:26:54.450 --> 02:26:56.330
One called Job and one Job Lab.

02:26:57.090 --> 02:27:01.090
The Job Lab is to reuse components within a job.

02:27:01.610 --> 02:27:03.150
So, imagine where I can use a Job Lab.

02:27:03.230 --> 02:27:07.730
Note that I, in most of the jobs I did,

02:27:08.370 --> 02:27:10.770
I always copied a pre-job and a post-job.

02:27:11.270 --> 02:27:13.570
With some basic elements that I was going to need.

02:27:14.290 --> 02:27:16.190
So, I can make a Job Lab.

02:27:16.690 --> 02:27:19.610
That Job Lab already has the pre-job and the post-job.

02:27:19.910 --> 02:27:23.710
And what I do is add that Job Lab to a particular job.

02:27:23.890 --> 02:27:27.330
In short, that Job Lab is what makes it have a set of components

02:27:27.330 --> 02:27:28.870
that prevents me from copying and pasting.

02:27:29.110 --> 02:27:31.030
It automatically makes me copy and paste.

02:27:31.670 --> 02:27:33.430
That's what a Job Lab does in the end.

02:27:34.790 --> 02:27:37.370
In the Standard, look at the interface, it is very similar.

02:27:37.370 --> 02:27:40.670
Well, graphically, there is a little thing at the level of shape,

02:27:40.690 --> 02:27:42.050
but in the end it is the same.

02:27:42.790 --> 02:27:43.990
Let's go here to Standard.

02:27:44.090 --> 02:27:45.910
Let's create a Standard Job.

02:27:45.930 --> 02:27:49.030
Let's put Job Example here.

02:27:49.290 --> 02:27:51.070
It opens the same bed.

02:27:51.490 --> 02:27:52.810
Very, very similar.

02:27:53.890 --> 02:27:54.290
Ready.

02:27:54.630 --> 02:27:56.950
So, I'm going to create a metadata here.

02:27:58.230 --> 02:27:59.030
I hope it's good.

02:27:59.110 --> 02:28:00.890
I already have a metadata here.

02:28:01.370 --> 02:28:02.690
I'm going to take this metadata.

02:28:02.710 --> 02:28:04.050
I'm going to take this Free file.

02:28:04.970 --> 02:28:06.630
The same idea.

02:28:07.170 --> 02:28:10.050
I'm going to add the Teglo Roam here.

02:28:10.450 --> 02:28:13.030
And here I connect this man with this man here.

02:28:14.270 --> 02:28:14.910
So, ok.

02:28:15.630 --> 02:28:18.990
The metadata concept will be achieved in all the talents, first of all.

02:28:19.090 --> 02:28:23.830
Note that here there may be some metadata elements

02:28:23.830 --> 02:28:27.550
that are in the OpenBit data that are not here.

02:28:27.650 --> 02:28:29.890
For example, here Hadoop does not appear.

02:28:30.310 --> 02:28:30.710
Not there.

02:28:31.270 --> 02:28:32.170
Why does Hadoop not appear?

02:28:34.050 --> 02:28:36.210
The Hadoop file that I have does not have that part.

02:28:36.230 --> 02:28:40.150
But when you buy or acquire the Big Data license,

02:28:40.570 --> 02:28:41.230
Hadoop will appear.

02:28:41.950 --> 02:28:43.490
There are the different elements that are there.

02:28:44.770 --> 02:28:47.230
In other words, in the end it is a matter of license

02:28:47.230 --> 02:28:48.910
that they want to sell their product.

02:28:50.230 --> 02:28:52.210
The question is that you buy more in a few words.

02:28:53.350 --> 02:28:54.630
So, look at the following.

02:28:54.790 --> 02:28:57.870
Imagine that here I have this Job that is here.

02:28:57.890 --> 02:28:58.410
Ready.

02:28:59.750 --> 02:29:00.030
Ready.

02:29:00.050 --> 02:29:01.450
That Job is there.

02:29:04.050 --> 02:29:05.050
I'm going to try something new.

02:29:05.910 --> 02:29:08.490
And I'm going to create what one calls a ...

02:29:08.490 --> 02:29:09.010
What is it called?

02:29:09.630 --> 02:29:13.050
When I make a partition in a KIP, I make a different branch.

02:29:14.010 --> 02:29:15.150
I make a fork in the branch.

02:29:15.190 --> 02:29:15.970
I make a fork in the branch.

02:29:16.910 --> 02:29:19.430
But I'm going to try something, but I don't want to leave the original.

02:29:19.430 --> 02:29:22.170
So I go to this man and I tell him here.

02:29:22.650 --> 02:29:23.950
Open in another version.

02:29:25.230 --> 02:29:25.550
I don't know.

02:29:25.790 --> 02:29:27.030
Here I open in another version.

02:29:27.130 --> 02:29:28.830
And here is where he charges life.

02:29:29.090 --> 02:29:30.370
The version that I'm going to read.

02:29:30.710 --> 02:29:31.590
The Major, the Minor.

02:29:34.390 --> 02:29:36.110
I'm going to create a new version.

02:29:36.730 --> 02:29:38.050
I'm going to upload the new version to 0.2.

02:29:39.330 --> 02:29:40.550
I'm going to finish it here.

02:29:41.730 --> 02:29:44.290
And I start where I left in version 1.

02:29:44.590 --> 02:29:46.550
So this version we are going to assume that I see the change.

02:29:47.730 --> 02:29:49.130
And I'm going to put a theme here.

02:29:49.670 --> 02:29:50.810
I put the theme here.

02:29:51.110 --> 02:29:52.470
I connect the theme here.

02:29:53.430 --> 02:29:54.170
Out here.

02:29:54.310 --> 02:29:54.810
Ready.

02:29:55.590 --> 02:29:56.990
I'm going to give it here and save.

02:29:57.590 --> 02:29:58.850
And now I have two versions.

02:29:59.010 --> 02:30:02.470
So if I'm wrong, I can open another version.

02:30:02.470 --> 02:30:05.310
And I can open the one here.

02:30:05.590 --> 02:30:06.990
See how the one was left here.

02:30:07.750 --> 02:30:10.390
And I can open the last one, which would be the two.

02:30:10.390 --> 02:30:12.090
And I can start from the version that I want.

02:30:12.830 --> 02:30:14.570
So that's how the versions work.

02:30:14.710 --> 02:30:17.310
So these versions are saved locally.

02:30:18.430 --> 02:30:23.310
But in the end you would have to do a push.

02:30:24.350 --> 02:30:26.110
A push that you do it automatically here.

02:30:26.150 --> 02:30:28.270
You put the comic manager here.

02:30:28.330 --> 02:30:30.110
The comic message, sorry.

02:30:30.330 --> 02:30:31.150
Here is the push.

02:30:32.510 --> 02:30:36.350
And you can see that when the other members of your group do a pull.

02:30:37.190 --> 02:30:40.070
Then what that's going to do is take the last changes you have.

02:30:41.850 --> 02:30:44.890
Obviously all the good practices that you have at the level of software development are applied.

02:30:45.430 --> 02:30:48.170
And all the good practices that you have at the level of using version control.

02:30:49.290 --> 02:30:50.770
But here it makes it very simple.

02:30:50.870 --> 02:30:53.130
Here, for example, the documentation that is generating you.

02:30:53.850 --> 02:30:56.950
For example, I tell you here add documentation.

02:30:57.050 --> 02:30:58.930
I think here, sorry.

02:30:59.590 --> 02:31:01.770
I go here and I tell you.

02:31:04.130 --> 02:31:06.690
Generate html doc, html documentation.

02:31:07.010 --> 02:31:07.770
Finish here.

02:31:08.850 --> 02:31:10.430
I think here it generates it in the folder.

02:31:11.270 --> 02:31:19.350
And this in the end what it does is that it generates some files that describe me.

02:31:20.710 --> 02:31:21.670
Well, I'm going to do it here.

02:31:21.750 --> 02:31:24.470
Generate all the documentation jobs for a moment there.

02:31:24.850 --> 02:31:25.730
It depends on what you want.

02:31:25.910 --> 02:31:26.750
I mean, what functionality?

02:31:27.930 --> 02:31:30.710
No, by default that comes already.

02:31:30.710 --> 02:31:32.610
No, that comes integrated into the talent studio.

02:31:32.710 --> 02:31:33.690
In any version it is.

02:31:34.170 --> 02:31:41.330
The only thing you have is that it is your power to define and clear the repository where your jobs are going to be.

02:31:41.730 --> 02:31:45.190
Particularly because I am married to Github.

02:31:45.270 --> 02:31:46.830
Remember that Github is a Git service.

02:31:47.850 --> 02:31:54.050
So I get married to Github and there I create my projects and I relate them when I'm working in a group.

02:31:55.650 --> 02:31:58.570
That's what generates all the documentation for this part.

02:31:59.390 --> 02:32:00.790
I don't know if.

02:32:01.210 --> 02:32:02.710
Yes, tell me.

02:32:03.070 --> 02:32:05.410
No, it can be public and private.

02:32:05.910 --> 02:32:07.490
Just let me show you something here.

02:32:09.090 --> 02:32:10.430
Where do you do that?

02:32:10.890 --> 02:32:18.550
Remember that I told you that in the end the tool is integrated, it is hybrid and it is going to integrate with a tool that is the TMC.

02:32:19.190 --> 02:32:26.610
In the TMC you are going to create as many projects as you have to develop.

02:32:26.610 --> 02:32:32.610
So every time you have a new project, in the end what you do is that it comes here.

02:32:34.350 --> 02:32:35.610
I'll show you a little bit of the TMC.

02:32:38.670 --> 02:32:44.350
This was what we used for the talent data preparation and for the talent data storage.

02:32:44.470 --> 02:32:47.050
But we didn't see the talent more console.

02:32:47.770 --> 02:32:49.050
Wait a moment and load.

02:32:51.590 --> 02:32:54.070
On the other hand, while it loads here.

02:32:54.070 --> 02:33:01.130
Here, if the good practice at the level of documentation is important because you have a tool called the talent data catalog.

02:33:02.210 --> 02:33:13.270
So in the future, to do the part of the life cycle or lineage of the data, that is, where the data starts, what changes there are.

02:33:13.270 --> 02:33:18.630
Normally, I could do it with the power center and several ETVs.

02:33:18.850 --> 02:33:25.590
But since I bought the talent studio, I would believe that it will integrate its talent studio, the jobs, with the talent data catalog.

02:33:26.470 --> 02:33:30.310
Therefore, it is vital that the names of variables are well documented.

02:33:31.310 --> 02:33:38.890
Any documentation that can, in such a way that when someone recovers it already in the talent data catalog, they can understand what is done internally in the job.

02:33:39.910 --> 02:33:42.730
Well, here I go to the TMC, which is this one here.

02:33:45.050 --> 02:33:46.210
Let's see in project.

02:33:47.570 --> 02:33:50.630
Here I have the test project that I am connected and I tell you the following.

02:33:50.990 --> 02:33:54.510
This is the URL of the repository that I am connected.

02:33:55.410 --> 02:34:03.670
So what you're going to do here is just create a new project with the name you want and here you put the URL of the repository here.

02:34:03.890 --> 02:34:05.610
And that's it, there you integrate them.

02:34:06.230 --> 02:34:09.810
And it does all the part of the push, the pull, it will do it automatically.

02:34:09.810 --> 02:34:13.910
Well, that's it, here are many more things, but hey, it doesn't give time.

02:34:14.510 --> 02:34:22.490
I don't know if we start with the part of the case study.

02:34:22.610 --> 02:34:24.290
Here I have two case studies actually.

02:34:24.850 --> 02:34:29.190
There is a case study that it seems to me that it will not give us time, we have practically two hours.

02:34:30.430 --> 02:34:35.330
I think I'll wait a moment and show you the case study that I have prepared.

02:34:35.390 --> 02:34:38.670
I think we have to change it because it will not give us time.

02:34:39.810 --> 02:34:42.810
Either we cut it or I make one that is simpler.

02:34:44.970 --> 02:34:51.130
Wait a moment and I show you the case study here.

02:34:58.610 --> 02:35:01.510
I think I lost the case study.

02:35:05.010 --> 02:35:11.890
I don't know if it's going to take a little break of 10 minutes, but it will allow me to tell you that my son is calling me.

02:35:12.690 --> 02:35:17.910
It is not customary for him to call me, so I want to answer him in case he says something that is happening.

02:35:18.730 --> 02:35:20.510
Give me 10 minutes and I'll come then.

02:35:30.010 --> 02:35:31.410
Well, I'm back.

02:35:32.290 --> 02:35:33.330
Fortunately, everything is fine.

02:35:33.590 --> 02:35:37.250
I don't know if you are listening, are you there?

02:35:37.910 --> 02:35:41.490
Yes, well, that's it.

02:35:41.490 --> 02:35:46.150
I had shared a case study to solve here in class, but it didn't give time.

02:35:47.010 --> 02:35:48.510
This is the case study.

02:35:48.610 --> 02:35:56.150
Let's read it for a moment to see if you suddenly think it is very complicated because of the time we have, practically only two hours.

02:35:57.850 --> 02:35:58.850
It says the following.

02:35:59.210 --> 02:36:04.110
The idea is to simulate a kind of data lake, the simulation.

02:36:04.850 --> 02:36:06.130
So the description says the following.

02:36:06.130 --> 02:36:15.490
As the explosive growth of music streaming platforms such as Spotify and Apple Music, a large volume of data is generated associated with the profiles of bands and musical artists.

02:36:16.230 --> 02:36:22.750
Millions of users interact daily with these profiles, waiting for recommendations and searches in real time.

02:36:23.930 --> 02:36:35.130
To support this demand, the Rockbrand music platform wanted to migrate the 12 billion profiles of artists currently managing a system of scalable and high-performance rigdata.

02:36:35.130 --> 02:36:42.850
To carry out initial tests in its traditional infrastructure, the response times for simple consultations took more than three hours.

02:36:43.030 --> 02:36:45.690
Remember that in the end, Big Data is for cost analysis.

02:36:46.730 --> 02:36:50.910
I wanted to give you this time at a maximum of five minutes to go to the users.

02:36:52.030 --> 02:37:01.730
So here is placed an initial configuration in which you are going to create a folder in the Big Data tab called case study and we are going to create three subfolders.

02:37:01.730 --> 02:37:07.790
This is a folder that resembles the three folders that you handle in the data lake.

02:37:08.430 --> 02:37:12.070
Which is the bronze folder, silver and gold.

02:37:13.430 --> 02:37:19.150
Here it gives you recommendations to use peyotips for each of the jobs you are going to do.

02:37:20.190 --> 02:37:22.530
And here are the different activities that you have to do.

02:37:22.630 --> 02:37:26.690
First you are going to download the S3 files.

02:37:27.290 --> 02:37:37.470
Then you are going to put them in the Hadoop's host, and then you are going to resolve these jobs that are here.

02:37:38.010 --> 02:37:39.750
For the time being, it will not give you time.

02:37:39.870 --> 02:37:45.170
There are eight activities that are here and this one is prepared for practically four hours and we have half.

02:37:45.930 --> 02:37:52.870
So I propose for a matter of time, let's make one that is much simpler than the one I was going to do in class.

02:37:52.870 --> 02:37:59.110
If you want, we do it together and present it, I don't know if we all do it and present it then at the end.

02:37:59.110 --> 02:38:02.210
Or if you want, we do it separately, I don't know what you think.

02:38:03.010 --> 02:38:06.870
It is this, it is to make a data warehouse in Hype.

02:38:08.110 --> 02:38:09.830
This is much easier to solve.

02:38:11.570 --> 02:38:15.390
What do you prefer? Do we all do it or do we do it individually?

02:38:15.530 --> 02:38:17.330
Let's do it together then, perfect.

02:38:18.750 --> 02:38:21.930
For this, please, what are we going to do?

02:38:21.930 --> 02:38:27.590
Create a document there in the browser, I don't know if it's Word.

02:38:27.970 --> 02:38:33.110
Open Word there so that it resolves each point and at each point it will add a screen.

02:38:35.170 --> 02:38:36.930
Ok, that's the first thing you have to do.

02:38:39.910 --> 02:38:45.250
No, the tone machine has Ophi installed, then in its machine, in its machine and there it is taking screens.

02:38:46.730 --> 02:38:51.430
Ready, I'm going to locate myself in the machine, ready, click, close all.

02:38:53.530 --> 02:38:54.650
So there.

02:38:55.850 --> 02:39:00.170
Well, the first thing then is we are going to create a folder.

02:39:02.970 --> 02:39:06.990
Ok, I think we already have that file, please confirm it if we don't do it.

02:39:07.310 --> 02:39:18.410
Confirm within the files that you have in the Hadoop Cluster if you have one called team underscore nba.

02:39:19.450 --> 02:39:21.330
Ok, this one that is here, I am going to confirm to see.

02:39:22.190 --> 02:39:23.930
Confirm please if you have it.

02:39:27.510 --> 02:39:35.390
If you don't have it then to avoid, I'm going to do the following, I'm going to upload that file in the shared folder.

02:39:37.550 --> 02:39:40.890
Wait a moment please, I download it because it is in AWS.

02:39:41.970 --> 02:39:44.510
So that you have it and we can start from there, AWS.

02:39:44.710 --> 02:39:48.550
I'm going to upload it, it's called team nba, this one that is here.

02:39:48.550 --> 02:39:49.410
Look, verify.

02:39:50.730 --> 02:39:52.770
It doesn't have it, so let's do the following.

02:39:53.870 --> 02:39:57.470
I'm going to download, I'm going to look for it for a moment and I'm going to upload it.

02:39:57.990 --> 02:39:59.750
Well, I just put it in the shared folder.

02:40:01.590 --> 02:40:04.130
And who was it that didn't have it? Fernanda was it? Was it Fernanda?

02:40:06.150 --> 02:40:06.630
Ok.

02:40:08.190 --> 02:40:11.230
So download it initially from the shared folder.

02:40:11.870 --> 02:40:13.550
The others please wait for me for a moment.

02:40:15.770 --> 02:40:20.370
Wait a moment please while I review here.

02:40:20.850 --> 02:40:23.630
Perfect, I hope I lost the share window.

02:40:26.830 --> 02:40:28.870
Wait a moment please.

02:40:30.350 --> 02:40:31.130
Ok, now yes.

02:40:32.210 --> 02:40:35.310
Ok, did you download it? Yes, you downloaded it.

02:40:35.610 --> 02:40:42.370
Perfect, let's see if you have it in the browser, in the Hadoop cluster in the interface.

02:40:43.370 --> 02:40:45.690
And look, there is a button that says upload file.

02:40:45.790 --> 02:40:49.250
Look if you can upload it so you don't have to do the job.

02:40:50.590 --> 02:40:51.370
Do it manually then.

02:40:52.190 --> 02:40:54.310
Confirm if you uploaded it please.

02:40:55.870 --> 02:40:58.450
Ok, perfect, I uploaded it right? Perfect, I did it.

02:40:58.910 --> 02:41:00.790
Now yes, let's go then.

02:41:01.850 --> 02:41:04.370
First point, let's create a subfolder called case studio.

02:41:08.050 --> 02:41:11.370
Case studio, folder case underscore studio.

02:41:12.550 --> 02:41:17.930
And remember that at the level of what is data warehouse, a set of dimensions is defined.

02:41:18.770 --> 02:41:20.850
The dimensions can be in stars or in a snowflake.

02:41:21.030 --> 02:41:27.450
We are going to make then two dimensions, practically two tables in stars.

02:41:28.610 --> 02:41:31.090
For this, they will be mounted in HAI.

02:41:31.550 --> 02:41:34.630
We are going to create then the first case studio.

02:41:35.010 --> 02:41:36.510
If anyone stays please tell me.

02:41:36.510 --> 02:41:44.510
The first case studio, we are going to call it job dimension players.

02:41:47.190 --> 02:41:48.990
Job dimension players.

02:41:52.010 --> 02:41:54.670
And copy the pre-job and post-job.

02:41:54.710 --> 02:42:01.170
The last job we did, we had the one from HAI with which we created the table, the last table.

02:42:01.290 --> 02:42:03.710
I'm going to look for it here, which would be this one.

02:42:03.710 --> 02:42:08.870
Job sell product by genre.

02:42:09.510 --> 02:42:11.750
And we copy the pre-job and post-job.

02:42:12.570 --> 02:42:14.710
We copy this pre-job.

02:42:23.290 --> 02:42:26.550
And we copy this post-job.

02:42:30.550 --> 02:42:31.870
Copy list.

02:42:42.330 --> 02:42:47.910
It is asked that the names of the dimensions are in Spanish.

02:42:48.430 --> 02:42:56.090
Then in the THAI create table, we are going to create a table called players.

02:43:00.650 --> 02:43:01.310
Players.

02:43:02.030 --> 02:43:05.130
And add a scheme of three columns.

02:43:05.390 --> 02:43:07.090
Erase the columns that I had previously there.

02:43:09.030 --> 02:43:11.390
And add the following columns.

02:43:12.050 --> 02:43:15.250
We are going to put a full type identifier.

02:43:19.410 --> 02:43:20.070
Identifier.

02:43:21.730 --> 02:43:24.130
We are going to put a full type.

02:43:28.790 --> 02:43:32.090
Add a new column called name.

02:43:35.030 --> 02:43:39.030
This is going to be of type stream.

02:43:39.470 --> 02:43:43.410
Add a new column called team.

02:43:45.890 --> 02:43:48.290
And it will be of type stream, only three.

02:43:48.650 --> 02:43:50.150
We are going to give it here, ok.

02:43:56.970 --> 02:43:58.850
And then we go to the following.

02:43:59.130 --> 02:44:02.490
Now yes, we are going to make an HDFS input.

02:44:09.570 --> 02:44:12.070
We are going to make a theme that we are going to map.

02:44:16.190 --> 02:44:18.950
The table has all the data of team.

02:44:19.590 --> 02:44:20.130
I have not explained.

02:44:20.250 --> 02:44:23.930
Team has the data of all the games that the NBA has in a year.

02:44:23.930 --> 02:44:26.010
I do not remember if it was 2010 or 2012.

02:44:26.730 --> 02:44:29.310
You will notice that the players will appear NBS.

02:44:30.830 --> 02:44:32.710
Then we are going to do the following.

02:44:32.770 --> 02:44:35.130
We are going to add here a new component that we had not seen.

02:44:35.130 --> 02:44:38.930
That is called Tejunic row to eliminate duplicates.

02:44:40.690 --> 02:44:42.930
Teunic with Q at the end, row.

02:44:43.150 --> 02:44:47.310
This is going to send to an HDFS output.

02:44:50.350 --> 02:44:59.530
And it ends up in a HLWA to load the data in the table.

02:45:01.090 --> 02:45:02.330
Well, let's connect then.

02:45:02.810 --> 02:45:05.530
Connect the HF input to the theme.

02:45:06.970 --> 02:45:08.310
The theme to Tejunic.

02:45:11.730 --> 02:45:15.310
Tejunic, here call it names.

02:45:17.310 --> 02:45:20.270
The output between the theme and Tejunic names.

02:45:22.550 --> 02:45:23.290
Unique players.

02:45:24.690 --> 02:45:34.690
And unites Tejunic with the THF output of the Unique.

02:45:34.690 --> 02:45:38.710
In other words, those who filtered the repeated ones will not add it again.

02:45:38.710 --> 02:45:40.170
Perfect.

02:45:40.170 --> 02:45:47.710
I hope then some are configuring.

02:45:53.210 --> 02:45:54.610
Perfect, ready.

02:45:54.790 --> 02:46:01.770
Then we are going to initially configure the HDFS input.

02:46:02.010 --> 02:46:03.650
This one that is here.

02:46:03.650 --> 02:46:08.090
As the HF input, what I want to read is the file called team NDA.

02:46:09.850 --> 02:46:11.950
Then please do the following.

02:46:12.090 --> 02:46:15.690
We are going to copy a metadata so that the scheme is easier for us.

02:46:16.490 --> 02:46:29.530
If there are some that have already downloaded it, those who do not have it downloaded, go to their Hadood cluster and please download the file to get the metadata.

02:46:31.230 --> 02:46:31.770
Download.

02:46:32.510 --> 02:46:41.830
And then it will create a metadata of the Limiter file that points towards that file that we download now.

02:46:46.010 --> 02:46:48.770
I'm going to call it here team NDA.

02:46:49.890 --> 02:46:53.090
And I think it's separated by a comma, I don't know, I have to check to see.

02:46:53.790 --> 02:46:54.770
I'm going to look for it.

02:46:55.690 --> 02:46:57.150
It must be in downloads.

02:46:59.510 --> 02:47:01.070
There it is, team NDA.

02:47:01.770 --> 02:47:04.830
Separated by comma, indeed.

02:47:05.370 --> 02:47:06.770
Separated by comma.

02:47:08.170 --> 02:47:08.550
And refresh.

02:47:09.850 --> 02:47:11.550
Team NDA.

02:47:12.610 --> 02:47:13.690
Team NDA.

02:47:17.550 --> 02:47:19.670
Yes, indeed, you have to mark it in header.

02:47:24.070 --> 02:47:25.390
And there it is.

02:47:26.250 --> 02:47:27.470
Perfect, ready.

02:47:27.470 --> 02:47:32.050
So then give it here if you tell me who is talking to me.

02:47:33.390 --> 02:47:35.310
Alejandro, I'm going to go to you, Alejandro.

02:47:36.210 --> 02:47:37.750
I don't know if you have it there.

02:47:38.070 --> 02:47:39.930
Wait a moment please.

02:47:40.830 --> 02:47:41.610
Yes, download it please.

02:47:41.810 --> 02:47:42.770
Click there.

02:47:43.150 --> 02:47:44.070
It just has the name there.

02:47:44.310 --> 02:47:44.930
Give it download.

02:47:45.090 --> 02:47:46.070
There it is on the left.

02:47:46.650 --> 02:47:47.970
There is the download.

02:47:48.650 --> 02:47:50.270
And now if you point it towards it.

02:47:53.090 --> 02:47:54.210
Brenda, I'm with you.

02:47:54.290 --> 02:47:55.850
Brenda, yes.

02:47:56.690 --> 02:47:57.410
Let's see.

02:47:57.810 --> 02:47:59.170
Give it there in refresh for a moment.

02:47:59.370 --> 02:47:59.790
No, it's weird.

02:47:59.930 --> 02:48:00.710
Give it back for a moment.

02:48:00.830 --> 02:48:01.230
Give it back.

02:48:03.690 --> 02:48:04.410
Give it...

02:48:04.410 --> 02:48:05.690
Give it next again to see.

02:48:05.750 --> 02:48:06.790
Wait a moment to see there.

02:48:08.610 --> 02:48:09.750
This is really weird here.

02:48:09.890 --> 02:48:10.250
What happened?

02:48:10.750 --> 02:48:11.630
Give it cancel for a moment.

02:48:11.730 --> 02:48:12.770
I'll open it again.

02:48:12.830 --> 02:48:13.990
To see if it was that.

02:48:13.990 --> 02:48:15.310
Give it to team NDA.

02:48:15.690 --> 02:48:16.290
Browse there.

02:48:16.450 --> 02:48:16.810
Ready.

02:48:16.950 --> 02:48:18.650
Give it to team NDA again.

02:48:19.130 --> 02:48:19.590
One moment, one moment.

02:48:19.730 --> 02:48:20.610
It's that the file is empty.

02:48:20.650 --> 02:48:21.370
It's in zero.

02:48:21.670 --> 02:48:22.730
Because it's in zero.

02:48:23.890 --> 02:48:25.110
Go to the folder for a moment.

02:48:25.110 --> 02:48:28.290
You were the one who downloaded the shared folder, right?

02:48:28.530 --> 02:48:30.110
Go to the shared folder for a moment to see.

02:48:30.390 --> 02:48:31.430
If you want.

02:48:32.870 --> 02:48:34.110
Go to the shared folder.

02:48:34.770 --> 02:48:35.970
If it was that I uploaded it wrong.

02:48:36.450 --> 02:48:38.250
Give it download up there.

02:48:40.690 --> 02:48:43.430
And I check to see if it has data in the shared folder.

02:48:44.350 --> 02:48:44.790
Because...

02:48:44.790 --> 02:48:45.690
Yes, it has data.

02:48:45.790 --> 02:48:47.010
This is to see how it looks now.

02:48:47.070 --> 02:48:47.510
Ready.

02:48:48.010 --> 02:48:48.790
Open it to see.

02:48:48.810 --> 02:48:49.630
Go to downloads.

02:48:51.590 --> 02:48:52.650
Now if it has data.

02:48:53.110 --> 02:48:54.090
You have to upload it again.

02:48:55.130 --> 02:48:56.610
But let's go here in the team.

02:48:57.190 --> 02:48:58.150
Let's go again.

02:48:58.910 --> 02:48:59.530
Yes, because it is in zero.

02:48:59.550 --> 02:49:00.310
You have to realize that it is in zero.

02:49:00.330 --> 02:49:01.090
Now upload it.

02:49:02.610 --> 02:49:04.190
Then take team again.

02:49:04.210 --> 02:49:04.930
Team NDA.

02:49:05.150 --> 02:49:05.530
Now yes.

02:49:05.730 --> 02:49:06.070
Next.

02:49:06.170 --> 02:49:07.710
Separated by comma and they have header.

02:49:07.950 --> 02:49:08.910
Give it to Sexhear.

02:49:09.090 --> 02:49:09.750
If you want to put...

02:49:09.750 --> 02:49:10.130
Perfect.

02:49:10.350 --> 02:49:10.930
Give it to refresh.

02:49:11.030 --> 02:49:12.290
There is a small problem.

02:49:13.130 --> 02:49:16.110
What happens is that I add one more when I click it.

02:49:16.290 --> 02:49:16.490
Give it.

02:49:16.530 --> 02:49:17.170
Put one there.

02:49:17.870 --> 02:49:18.050
One.

02:49:18.090 --> 02:49:19.070
Put the number one there.

02:49:19.210 --> 02:49:20.150
Give it again to refresh.

02:49:20.630 --> 02:49:21.310
Now yes.

02:49:22.090 --> 02:49:22.690
Give it to Next.

02:49:22.690 --> 02:49:23.630
Please.

02:49:25.710 --> 02:49:26.950
We give it to Next here.

02:49:27.350 --> 02:49:27.690
And right here.

02:49:28.630 --> 02:49:30.450
We take the button that allows us to export.

02:49:30.490 --> 02:49:32.030
Which is the penultimate.

02:49:32.910 --> 02:49:35.090
And I'm going to put it as usual.

02:49:35.350 --> 02:49:35.830
Temporal.

02:49:35.830 --> 02:49:38.010
And I'm going to call it Team NDA.

02:49:38.430 --> 02:49:38.910
XML.

02:49:39.230 --> 02:49:39.710
NDA.

02:49:40.590 --> 02:49:41.070
XML.

02:49:44.470 --> 02:49:44.850
Finish.

02:49:45.290 --> 02:49:45.350
Release.

02:49:46.470 --> 02:49:46.950
Release.

02:49:48.690 --> 02:49:50.330
Yes, because they have header that is empty.

02:49:53.330 --> 02:49:54.250
Well.

02:49:55.110 --> 02:49:57.950
Now we are going to confirm the HDFS input.

02:49:59.330 --> 02:49:59.650
First.

02:49:59.750 --> 02:50:01.770
Import the Team NDA scheme.

02:50:05.490 --> 02:50:06.690
It already has it.

02:50:07.990 --> 02:50:08.730
OK.

02:50:11.050 --> 02:50:11.430
Second.

02:50:11.710 --> 02:50:14.010
Configure the existing connection.

02:50:18.270 --> 02:50:19.190
Third.

02:50:19.430 --> 02:50:22.170
Refer to the Team NDA file.

02:50:22.170 --> 02:50:23.570
Where it says file name.

02:50:23.690 --> 02:50:25.790
Search it please.

02:50:25.990 --> 02:50:26.610
Team NDA.

02:50:26.950 --> 02:50:27.170
There it is.

02:50:29.230 --> 02:50:31.390
And the field separator is comma.

02:50:32.450 --> 02:50:33.910
And the header is one.

02:50:34.370 --> 02:50:34.730
Remember.

02:50:34.790 --> 02:50:35.790
I repeat again.

02:50:36.210 --> 02:50:38.230
Import the Team NDA scheme.

02:50:38.830 --> 02:50:39.150
Second.

02:50:39.210 --> 02:50:40.210
Refer to the file.

02:50:40.350 --> 02:50:41.570
AXA Team NDA.

02:50:42.410 --> 02:50:42.950
Third.

02:50:43.030 --> 02:50:45.410
Change field separator from point comma to comma.

02:50:46.150 --> 02:50:47.370
And tell it that it has a header.

02:50:47.630 --> 02:50:48.370
Act then.

02:50:48.810 --> 02:50:49.250
Ready.

02:50:49.530 --> 02:50:50.650
As we are going to configure the theme.

02:50:52.170 --> 02:50:55.430
And in the theme what we are going to do is simply make a column filter.

02:50:56.490 --> 02:50:58.530
To the theme you are going to take the following fields.

02:51:00.110 --> 02:51:00.650
One moment.

02:51:00.950 --> 02:51:02.490
I'm going to take.

02:51:02.690 --> 02:51:03.170
Player ID.

02:51:04.770 --> 02:51:06.630
I release it on this side.

02:51:06.890 --> 02:51:09.170
And when I release the first one I can expand here.

02:51:12.050 --> 02:51:13.270
It takes player.

02:51:14.170 --> 02:51:15.670
And it takes team.

02:51:16.910 --> 02:51:18.310
Three fields that are practically united.

02:51:18.950 --> 02:51:22.510
Now in the case of Haydn does not handle primary keys.

02:51:23.690 --> 02:51:26.670
Conventionally as we handle them in.

02:51:27.330 --> 02:51:27.890
Transitional database.

02:51:28.690 --> 02:51:32.570
But here when you are going to export to a database.

02:51:32.570 --> 02:51:33.550
For example.

02:51:33.770 --> 02:51:35.190
If you put here key.

02:51:36.250 --> 02:51:38.450
And you are going to send it to a database connection.

02:51:38.510 --> 02:51:41.130
It is what is going to try is that it is going to create this as a primary key.

02:51:41.270 --> 02:51:44.810
But Haydn does not handle primary keys.

02:51:45.290 --> 02:51:46.310
We call here OK.

02:51:48.250 --> 02:51:48.770
Yes.

02:51:50.470 --> 02:51:52.390
And we go here where it says Tejunik row.

02:51:52.450 --> 02:51:54.190
And we are going to configure the Tejunik row.

02:51:55.190 --> 02:51:59.030
Here it configures those elements in which it wants to eliminate the duplicates.

02:51:59.710 --> 02:52:02.370
It turns out that the player ID is going to appear n times.

02:52:04.010 --> 02:52:05.050
So there.

02:52:05.130 --> 02:52:07.310
It is mentioned in the section that says key attribute.

02:52:07.570 --> 02:52:11.050
The other section does not apply because player ID is whole.

02:52:11.710 --> 02:52:15.590
And the sensitive case only applies if it is textual.

02:52:16.410 --> 02:52:17.830
So what would I do there?

02:52:18.350 --> 02:52:21.850
I would only let the player ID pass once.

02:52:22.150 --> 02:52:23.690
If it is repeated, it does not let them pass.

02:52:23.770 --> 02:52:24.130
Ready.

02:52:24.330 --> 02:52:25.170
There is nothing else here.

02:52:25.550 --> 02:52:27.770
I'm going to the HDFS output.

02:52:29.290 --> 02:52:30.750
I say existing connection.

02:52:31.590 --> 02:52:33.950
Here I make sure that the field separator is point and comma.

02:52:34.050 --> 02:52:36.070
Because the one on the table is point and comma.

02:52:37.470 --> 02:52:39.950
I'm going to check the one on the table that is point and comma.

02:52:40.550 --> 02:52:41.670
Yes. Point and comma.

02:52:41.890 --> 02:52:42.410
Of the table.

02:52:43.210 --> 02:52:44.330
We are not going to change there.

02:52:44.330 --> 02:52:48.730
Here what I am going to reference is the name that I want to give practically.

02:52:49.090 --> 02:52:50.990
So I choose the connection first.

02:52:52.310 --> 02:52:53.330
And then I'm going to choose.

02:52:55.330 --> 02:52:57.390
Any name to then change it.

02:52:57.450 --> 02:52:58.090
So.

02:53:00.250 --> 02:53:01.890
And here I'm going to change it.

02:53:02.970 --> 02:53:05.810
And I'm going to call players CSV.

02:53:09.790 --> 02:53:11.070
players.csv

02:53:13.350 --> 02:53:15.030
Yes, just in case we make a mistake.

02:53:15.270 --> 02:53:15.750
Yes, it is true.

02:53:15.910 --> 02:53:16.350
Let's go here.

02:53:16.570 --> 02:53:17.590
Just in case we make a mistake.

02:53:17.770 --> 02:53:19.690
Please copy the name of the file.

02:53:19.790 --> 02:53:21.710
The output to paste it here in the Teja and Loat.

02:53:22.970 --> 02:53:25.130
I'm going to copy it here.

02:53:25.830 --> 02:53:27.850
I'm going to include header.

02:53:28.130 --> 02:53:28.770
Thank you.

02:53:31.090 --> 02:53:32.370
Say include header.

02:53:32.610 --> 02:53:32.950
Perfect.

02:53:33.310 --> 02:53:34.610
So include header.

02:53:34.870 --> 02:53:36.470
We are missing Teja and Loat.

02:53:38.650 --> 02:53:40.890
Teja and Loat says use existing connection.

02:53:41.470 --> 02:53:42.890
I choose the connection.

02:53:43.670 --> 02:53:46.910
The path will paste the output output of the HDFS.

02:53:47.670 --> 02:53:51.370
And the name of the table must coincide with the name that you create in the Teja and Loat.

02:53:51.370 --> 02:53:52.470
I think I call it players.

02:53:53.250 --> 02:53:54.010
Yes, players.

02:53:56.350 --> 02:53:57.810
And I paste it here.

02:54:00.690 --> 02:54:04.310
Please then join the two sub jobs with a sub job.

02:54:04.550 --> 02:54:04.670
Ok.

02:54:06.470 --> 02:54:09.410
And we have to test if the first point does not work then.

02:54:09.590 --> 02:54:10.650
Perfect, ready.

02:54:10.950 --> 02:54:12.690
Well, let's try it then.

02:54:12.950 --> 02:54:15.210
Make sure that the one who reads it does the job.

02:54:16.170 --> 02:54:18.390
I'm going to run to say there is no error.

02:54:26.910 --> 02:54:29.370
The output of the HDFS.

02:54:29.490 --> 02:54:32.930
Well, now you have to check that this table has data.

02:54:32.950 --> 02:54:33.790
You have two ways.

02:54:34.310 --> 02:54:35.730
Or you do a job.

02:54:36.490 --> 02:54:38.090
That lists the table.

02:54:38.310 --> 02:54:39.270
You know how it would be.

02:54:39.290 --> 02:54:42.470
It would be to update the connection.

02:54:44.910 --> 02:54:46.170
Extract and do a job.

02:54:46.290 --> 02:54:47.330
This is one way.

02:54:48.650 --> 02:54:55.850
The other way is to do the SELE directly in the console of the container.

02:54:56.490 --> 02:54:57.090
So, look at the following.

02:54:57.170 --> 02:54:59.130
If it worked well, take a screen.

02:54:59.810 --> 02:55:00.230
Place.

02:55:00.710 --> 02:55:01.270
First point.

02:55:01.890 --> 02:55:05.110
This screen and the screen that solves that has data.

02:55:05.110 --> 02:55:07.170
Or you do it with a SQL query.

02:55:07.590 --> 02:55:08.790
Or you do it with a job.

02:55:09.050 --> 02:55:11.210
If it is with a SQL query, you go here to the container.

02:55:11.290 --> 02:55:13.110
And you are going to put SELECT.

02:55:14.070 --> 02:55:14.550
Asterix.

02:55:14.670 --> 02:55:15.850
From players.

02:55:20.530 --> 02:55:21.430
And you take the screen.

02:55:21.790 --> 02:55:22.390
That's it.

02:55:22.390 --> 02:55:22.570
Ready.

02:55:22.810 --> 02:55:23.290
Here.

02:55:24.490 --> 02:55:24.770
Ok.

02:55:24.890 --> 02:55:25.570
Perfect then.

02:55:25.690 --> 02:55:27.050
I'm going with Beatriz.

02:55:27.510 --> 02:55:28.810
And the others already know what to do.

02:55:28.850 --> 02:55:29.610
I'm going with you Beatriz.

02:55:29.670 --> 02:55:30.170
One moment.

02:55:31.350 --> 02:55:33.510
While the others are placing.

02:55:33.610 --> 02:55:34.130
Let's see.

02:55:36.070 --> 02:55:36.550
Ok.

02:55:37.230 --> 02:55:37.930
Give it.

02:55:38.070 --> 02:55:39.590
Cancel the same moment.

02:55:40.550 --> 02:55:41.290
Go down please.

02:55:41.610 --> 02:55:42.230
Go down a little.

02:55:42.690 --> 02:55:43.790
Go down the job.

02:55:44.570 --> 02:55:44.890
Ok.

02:55:45.050 --> 02:55:46.170
In that text there is CLOS.

02:55:46.350 --> 02:55:46.930
Give it a moment.

02:55:47.790 --> 02:55:48.030
No.

02:55:48.470 --> 02:55:49.590
It is in the text HILOA.

02:55:49.750 --> 02:55:51.270
Give it a click in the text HILOA please.

02:55:52.870 --> 02:55:54.990
There he forgot to choose the existing connection.

02:55:55.570 --> 02:55:56.570
There is the empty combo.

02:55:56.670 --> 02:55:57.590
It says COMPON ELEASE.

02:55:57.610 --> 02:55:58.830
It did not choose the connection.

02:55:59.810 --> 02:56:00.290
Ok.

02:56:00.390 --> 02:56:00.730
Perfect.

02:56:01.030 --> 02:56:02.630
Now try to run to see.

02:56:06.750 --> 02:56:07.310
Ok.

02:56:07.310 --> 02:56:09.530
It is starting to run.

02:56:10.110 --> 02:56:10.550
Remember.

02:56:10.590 --> 02:56:11.490
It would be two screens.

02:56:11.810 --> 02:56:12.690
The job screen.

02:56:13.350 --> 02:56:15.270
And the screen that shows you the data.

02:56:15.810 --> 02:56:16.970
Of the table you created.

02:56:17.750 --> 02:56:18.770
The table or the consult.

02:56:20.790 --> 02:56:21.350
Ready.

02:56:23.270 --> 02:56:23.470
Ok.

02:56:23.610 --> 02:56:23.830
I'm going.

02:56:24.070 --> 02:56:24.290
Give me a moment.

02:56:25.110 --> 02:56:26.330
To finish with Beatriz.

02:56:26.570 --> 02:56:26.950
Beatriz.

02:56:27.330 --> 02:56:27.830
You already.

02:56:27.830 --> 02:56:29.610
Give him a cell from players there.

02:56:29.650 --> 02:56:30.530
Please you.

02:56:31.970 --> 02:56:32.950
Who spoke to me?

02:56:32.950 --> 02:56:33.270
Alejandro.

02:56:33.430 --> 02:56:33.910
Yes.

02:56:34.730 --> 02:56:36.030
I'm going with you for example.

02:56:36.610 --> 02:56:38.030
Select Asterix from players.

02:56:38.350 --> 02:56:38.770
Point comma.

02:56:39.890 --> 02:56:40.090
Ok.

02:56:40.150 --> 02:56:40.770
There it would have.

02:56:40.990 --> 02:56:43.130
I'm going with Alejandro then.

02:56:44.730 --> 02:56:45.910
Take the two screens please.

02:56:46.110 --> 02:56:46.510
In the file.

02:56:48.450 --> 02:56:49.030
Ok.

02:56:49.250 --> 02:56:52.530
Go then again to your job.

02:56:52.790 --> 02:56:53.930
Show me your output.

02:56:55.190 --> 02:56:55.770
Perfect.

02:56:56.410 --> 02:56:58.230
There you have it.

02:56:58.230 --> 02:56:58.370
Ok.

02:57:00.210 --> 02:57:01.370
In case of doubt.

02:57:01.570 --> 02:57:01.790
Go.

02:57:01.790 --> 02:57:03.610
Edit the SDC scheme.

02:57:03.930 --> 02:57:05.570
Put a moment there.

02:57:05.770 --> 02:57:06.150
Perfect.

02:57:06.630 --> 02:57:06.650
Ok.

02:57:06.830 --> 02:57:07.390
In case of doubt.

02:57:07.490 --> 02:57:09.750
Go to your.

02:57:09.750 --> 02:57:10.810
To your.

02:57:10.810 --> 02:57:11.090
Cluster.

02:57:11.330 --> 02:57:11.990
Leave your moment.

02:57:12.770 --> 02:57:15.170
And it should have the player file there.

02:57:15.370 --> 02:57:16.010
Go to browser.

02:57:16.190 --> 02:57:17.750
Give it F5 there.

02:57:18.670 --> 02:57:19.670
It's not there.

02:57:20.490 --> 02:57:21.010
I don't think so.

02:57:22.130 --> 02:57:23.190
Let's go again.

02:57:23.250 --> 02:57:23.790
Let's go again.

02:57:24.390 --> 02:57:25.510
Let's go again.

02:57:25.910 --> 02:57:26.290
To the.

02:57:27.450 --> 02:57:27.970
To the.

02:57:28.010 --> 02:57:28.270
To the job.

02:57:30.470 --> 02:57:30.930
That's weird.

02:57:30.930 --> 02:57:33.930
Go to the high loader for a moment.

02:57:37.430 --> 02:57:39.610
This is weird because the file doesn't have it.

02:57:39.690 --> 02:57:41.010
Go again to see what happens.

02:57:41.710 --> 02:57:42.930
Show me again.

02:57:44.170 --> 02:57:45.050
The cluster.

02:57:45.870 --> 02:57:47.370
In the browser.

02:57:49.050 --> 02:57:49.590
To see the moment.

02:57:49.710 --> 02:57:51.230
I didn't see the player file.

02:57:51.490 --> 02:57:52.270
It was there.

02:57:52.930 --> 02:57:53.850
You don't know it's not there.

02:57:55.290 --> 02:57:55.830
Ok.

02:57:55.830 --> 02:57:57.130
Go again to the job.

02:57:59.730 --> 02:58:00.210
Ok.

02:58:00.210 --> 02:58:00.850
That's it.

02:58:01.050 --> 02:58:01.590
Show me again.

02:58:01.810 --> 02:58:02.790
Show me again the job.

02:58:03.150 --> 02:58:03.510
Ok.

02:58:03.630 --> 02:58:06.430
Now check the table for a moment please.

02:58:06.870 --> 02:58:08.470
But you're going to have the nulls initially.

02:58:08.690 --> 02:58:10.310
And then you're going to have.

02:58:11.230 --> 02:58:11.450
Ok.

02:58:11.710 --> 02:58:12.710
Click there.

02:58:13.790 --> 02:58:14.070
Go.

02:58:14.770 --> 02:58:15.210
Execute.

02:58:17.690 --> 02:58:18.130
Yes.

02:58:18.510 --> 02:58:18.950
That.

02:58:20.710 --> 02:58:21.850
The null is still sending it.

02:58:22.410 --> 02:58:23.190
How so?

02:58:23.690 --> 02:58:24.050
Show me.

02:58:24.110 --> 02:58:24.350
Now yes.

02:58:24.350 --> 02:58:25.150
Show me the file.

02:58:25.170 --> 02:58:27.650
How did it stay in Hadoop for a moment.

02:58:27.670 --> 02:58:28.270
In the browser.

02:58:28.270 --> 02:58:28.890
It's at zero.

02:58:29.050 --> 02:58:29.610
Do you see it?

02:58:29.690 --> 02:58:30.330
It's at zero.

02:58:30.450 --> 02:58:31.030
It's at zero.

02:58:31.150 --> 02:58:31.890
Do you see it?

02:58:32.870 --> 02:58:34.230
Let's go again to the job.

02:58:35.750 --> 02:58:37.590
Go to the Unicron for a moment.

02:58:37.830 --> 02:58:38.390
Ah, I know.

02:58:38.650 --> 02:58:41.130
Go to the Tmac for a moment.

02:58:41.190 --> 02:58:42.090
There's the problem.

02:58:42.990 --> 02:58:43.430
Ah, no, no.

02:58:43.430 --> 02:58:43.890
There's the problem.

02:58:44.250 --> 02:58:44.690
There it is.

02:58:44.890 --> 02:58:45.450
Player ID.

02:58:46.070 --> 02:58:46.470
Ok.

02:58:46.870 --> 02:58:47.650
No, it's fine.

02:58:48.830 --> 02:58:49.350
Wait a moment.

02:58:50.410 --> 02:58:51.350
No, no, no.

02:58:51.350 --> 02:58:51.650
No, no.

02:58:51.650 --> 02:58:51.970
There's a problem.

02:58:53.250 --> 02:58:54.030
It seems like.

02:58:54.610 --> 02:58:55.470
Yes, do it again.

02:58:55.630 --> 02:58:56.470
No, it's fine.

02:58:56.590 --> 02:58:57.210
It's fine.

02:58:57.950 --> 02:58:58.430
Ok.

02:58:58.670 --> 02:58:59.210
Give it ok there.

02:59:01.730 --> 02:59:03.330
Go again to the Unicron.

02:59:04.530 --> 02:59:06.150
Go to edit scheme for a moment.

02:59:06.410 --> 02:59:07.190
Edit scheme.

02:59:08.530 --> 02:59:08.830
There it is.

02:59:09.030 --> 02:59:09.890
Give it ok.

02:59:10.430 --> 02:59:13.210
And go to the THF output at the end.

02:59:15.770 --> 02:59:16.490
Oh, it's interesting.

02:59:17.290 --> 02:59:19.430
Let me tell you that I didn't see the error.

02:59:20.150 --> 02:59:20.790
Let me tell you.

02:59:20.950 --> 02:59:21.250
I'll see.

02:59:23.910 --> 02:59:25.090
Well, let's do the following.

02:59:25.090 --> 02:59:27.470
In this case, I'm going to test this theme here.

02:59:27.470 --> 02:59:29.870
And I'm going to put a telor row here.

02:59:30.190 --> 02:59:30.490
And tell me.

02:59:30.590 --> 02:59:31.890
Can't you see it or don't you have it?

02:59:33.150 --> 02:59:33.730
Ah, then.

02:59:34.930 --> 02:59:36.110
Connect and give it.

02:59:36.670 --> 02:59:37.250
Use AXA.

02:59:39.330 --> 02:59:39.770
In.

02:59:39.890 --> 02:59:40.470
In Dock.

02:59:40.730 --> 02:59:42.590
It seems to be connected to the default database.

02:59:43.070 --> 02:59:44.290
Use AXA.com.

02:59:44.430 --> 02:59:45.230
And then it gives.

02:59:45.530 --> 02:59:46.170
Be a player to see.

02:59:46.190 --> 02:59:47.170
Let's execute it for a moment, Alejandro.

02:59:47.210 --> 02:59:48.050
If you don't wait a moment.

02:59:48.910 --> 02:59:50.210
Let's execute it for now.

02:59:51.110 --> 02:59:52.010
Let's execute it.

02:59:52.010 --> 02:59:52.290
Let's see.

02:59:52.290 --> 02:59:54.670
Why are you saving that data?

02:59:57.150 --> 02:59:58.730
Well, the data goes there.

02:59:59.010 --> 02:59:59.750
I mean, there are the data.

03:00:00.430 --> 03:00:02.270
I mean, no.

03:00:03.250 --> 03:00:04.230
If it happens.

03:00:04.350 --> 03:00:05.290
I'm going back here.

03:00:07.590 --> 03:00:08.690
What was it?

03:00:08.930 --> 03:00:09.790
It was here.

03:00:10.050 --> 03:00:10.390
Here.

03:00:10.650 --> 03:00:11.310
F5.

03:00:12.930 --> 03:00:14.070
But he deleted it, right?

03:00:16.870 --> 03:00:18.170
Tabla la crea.

03:00:18.890 --> 03:00:20.670
But this man.

03:00:20.690 --> 03:00:21.530
It seems.

03:00:23.770 --> 03:00:24.270
Players.

03:00:24.970 --> 03:00:25.470
F5.

03:00:25.710 --> 03:00:26.130
Alright.

03:00:27.370 --> 03:00:28.710
Well, let's do the engineering.

03:00:29.010 --> 03:00:29.450
Let's see.

03:00:29.570 --> 03:00:31.710
Let's remove this component from here.

03:00:35.030 --> 03:00:37.050
I'm going to remove this other one.

03:00:37.090 --> 03:00:39.610
And I'm going to check that at this point there is data.

03:00:40.050 --> 03:00:41.830
I mean, because suddenly the line can arrive.

03:00:41.850 --> 03:00:43.690
But with the empty data that is null.

03:00:47.750 --> 03:00:48.970
Data is coming there.

03:00:49.650 --> 03:00:50.150
So.

03:00:50.830 --> 03:00:53.650
I'm going to create another HDFSUP here.

03:00:53.750 --> 03:00:56.850
In case suddenly there was some configuration that we did not see.

03:00:57.010 --> 03:00:58.930
Let's connect it here.

03:00:59.430 --> 03:00:59.550
Ready.

03:01:01.430 --> 03:01:02.650
Use existing connection.

03:01:03.690 --> 03:01:04.730
I already copied the name.

03:01:05.870 --> 03:01:07.110
And this would be ready.

03:01:07.330 --> 03:01:09.550
Axa, players, text file.

03:01:10.150 --> 03:01:11.570
Edit scheme here.

03:01:12.510 --> 03:01:14.370
This scheme here.

03:01:17.850 --> 03:01:18.750
Point and comma.

03:01:18.750 --> 03:01:20.370
Because this man was with a point.

03:01:20.630 --> 03:01:21.670
But you say comma.

03:01:21.870 --> 03:01:22.350
It fits.

03:01:26.110 --> 03:01:27.890
As you have comma here.

03:01:29.090 --> 03:01:30.350
And the table is created.

03:01:31.730 --> 03:01:32.750
You have to put here.

03:01:33.510 --> 03:01:34.550
Comma then.

03:01:34.950 --> 03:01:35.590
Include header.

03:01:35.790 --> 03:01:35.950
Ready.

03:01:36.770 --> 03:01:37.390
Let's see now.

03:01:38.810 --> 03:01:40.570
The other would be to delete the table.

03:01:41.190 --> 03:01:42.590
To be able to change the separator.

03:01:42.650 --> 03:01:43.890
But since the table is created.

03:01:44.550 --> 03:01:45.990
The separator by default was comma.

03:01:49.170 --> 03:01:50.070
Let's see here.

03:01:50.290 --> 03:01:50.750
It would be here.

03:01:52.210 --> 03:01:53.030
Indeed, Alejandro.

03:01:53.830 --> 03:01:56.650
It was the small comma that we had there.

03:01:57.190 --> 03:01:57.410
Ready.

03:01:57.490 --> 03:01:58.230
Let's take the screens.

03:01:58.330 --> 03:01:59.210
I think it was.

03:01:59.550 --> 03:01:59.750
Thank you.

03:01:59.870 --> 03:02:01.070
Sometimes you do not realize those errors.

03:02:01.390 --> 03:02:02.910
Sometimes you do not realize.

03:02:03.010 --> 03:02:03.890
I made a mistake in mine.

03:02:04.450 --> 03:02:05.230
Who told me?

03:02:05.270 --> 03:02:05.510
Beatriz.

03:02:05.550 --> 03:02:06.450
Was it you who did not ...

03:02:06.450 --> 03:02:07.370
It was perfect.

03:02:08.490 --> 03:02:10.130
Then the first point is ready.

03:02:11.330 --> 03:02:11.750
Who was that?

03:02:12.590 --> 03:02:13.130
Ah, Fer.

03:02:13.350 --> 03:02:14.250
This is the one I said.

03:02:14.430 --> 03:02:15.290
I thought it was.

03:02:16.310 --> 03:02:16.950
Ready, let's see.

03:02:20.030 --> 03:02:22.010
Ah, no, it is not connected.

03:02:22.470 --> 03:02:22.750
Let's see.

03:02:24.710 --> 03:02:26.650
Go to your container, please.

03:02:26.750 --> 03:02:28.230
If you are going to talk to the stock docker.

03:02:28.390 --> 03:02:28.910
There is the one.

03:02:29.830 --> 03:02:30.410
The little whale.

03:02:32.210 --> 03:02:33.110
Give hype there.

03:02:33.670 --> 03:02:34.110
Hi.

03:02:35.890 --> 03:02:37.230
And you have to reconnect.

03:02:37.370 --> 03:02:39.610
Let me see if you have something up there.

03:02:40.410 --> 03:02:41.890
I do not know if they have the connection up there.

03:02:41.930 --> 03:02:43.650
That one that connects with slash.

03:02:43.910 --> 03:02:45.150
Or you copy it.

03:02:46.510 --> 03:02:48.230
In the folder to share.

03:02:48.230 --> 03:02:48.870
Share.

03:02:49.210 --> 03:02:50.230
Go there, ready.

03:02:50.850 --> 03:02:51.370
Expression.

03:02:51.650 --> 03:02:53.630
Enter expressions.

03:02:54.190 --> 03:02:55.890
Copy the connection you have there.

03:02:56.910 --> 03:02:57.110
That one.

03:02:57.110 --> 03:02:58.290
Paste it, please, there.

03:02:59.330 --> 03:03:00.030
No, with right click.

03:03:00.130 --> 03:03:00.770
No, right click.

03:03:00.830 --> 03:03:01.750
You have to right click.

03:03:01.910 --> 03:03:02.370
Ready.

03:03:02.570 --> 03:03:03.230
Paste there.

03:03:03.770 --> 03:03:04.290
Enter.

03:03:05.730 --> 03:03:06.070
Ready.

03:03:06.450 --> 03:03:06.670
Enter.

03:03:07.890 --> 03:03:08.330
Enter.

03:03:09.910 --> 03:03:10.510
Give it.

03:03:10.710 --> 03:03:11.070
Use.

03:03:11.210 --> 03:03:13.450
Use space AXA that would be the database.

03:03:13.790 --> 03:03:15.110
Space AXA dot com.

03:03:15.250 --> 03:03:15.830
And go straight.

03:03:15.830 --> 03:03:18.370
So give it select asterisk from players to see.

03:03:18.490 --> 03:03:18.970
It separates.

03:03:19.890 --> 03:03:20.490
And sometimes.

03:03:21.550 --> 03:03:22.030
From.

03:03:23.210 --> 03:03:23.690
Players.

03:03:25.650 --> 03:03:26.170
Dot com.

03:03:26.410 --> 03:03:26.630
There it is.

03:03:26.750 --> 03:03:28.050
What it does is that you were.

03:03:28.370 --> 03:03:30.850
Is that I think you were connected to the default database.

03:03:30.950 --> 03:03:32.370
So it was not there.

03:03:32.630 --> 03:03:33.070
So.

03:03:33.910 --> 03:03:34.290
Screen.

03:03:34.470 --> 03:03:34.990
Two screens.

03:03:35.190 --> 03:03:36.650
The first screen is that of the job.

03:03:36.730 --> 03:03:38.930
And the second screen is that it checks that the data is there.

03:03:40.350 --> 03:03:40.970
We continue.

03:03:42.010 --> 03:03:42.490
Ok.

03:03:44.070 --> 03:03:44.610
Well.

03:03:44.610 --> 03:03:45.290
Let's see.

03:03:47.890 --> 03:03:48.450
Who spoke to me?

03:03:48.530 --> 03:03:48.890
Brenda.

03:03:49.370 --> 03:03:50.130
Or Beatriz.

03:03:50.650 --> 03:03:50.870
Brenda.

03:03:51.470 --> 03:03:52.510
Let's see how Brenda moves.

03:03:53.950 --> 03:03:54.730
Let me.

03:03:54.810 --> 03:03:55.950
And we solve the problem.

03:03:56.730 --> 03:03:58.490
Please go to.

03:03:59.290 --> 03:03:59.990
To the job.

03:04:02.110 --> 03:04:02.390
Now.

03:04:02.830 --> 03:04:04.510
As it happened to us with Alejandro.

03:04:04.590 --> 03:04:06.350
Go to the Teja and Create Table moment.

03:04:06.830 --> 03:04:08.450
To check Teja and Create Table.

03:04:08.530 --> 03:04:11.210
Look to see the separator that has in the Teja and Create Table.

03:04:11.510 --> 03:04:11.710
Double click.

03:04:12.570 --> 03:04:14.330
And there it is separated by comma.

03:04:14.570 --> 03:04:15.530
Effectively what it has.

03:04:15.750 --> 03:04:17.690
And then as it separated by comma.

03:04:18.110 --> 03:04:20.110
Go to the HDFS output.

03:04:20.610 --> 03:04:22.610
And I would have to separate that file by comma.

03:04:23.510 --> 03:04:24.850
So that's where it says field separator.

03:04:24.930 --> 03:04:27.390
I would have to see comma because it had to be the same as the table.

03:04:28.150 --> 03:04:29.050
Please execute it again.

03:04:34.450 --> 03:04:34.930
Perfect.

03:04:35.150 --> 03:04:37.030
Go now to the Docker desktop.

03:04:38.470 --> 03:04:39.210
Give it.

03:04:39.790 --> 03:04:40.150
Up.

03:04:40.350 --> 03:04:40.810
Enter.

03:04:40.810 --> 03:04:44.390
There are going to appear the nulls but in the end they should appear with data.

03:04:44.490 --> 03:04:44.810
There are the nulls.

03:04:45.350 --> 03:04:45.750
Done.

03:04:47.390 --> 03:04:49.870
No, in the Word document.

03:04:50.390 --> 03:04:52.990
Place a first point and place the two images.

03:04:53.030 --> 03:04:55.050
So that in the end they send it to me by email that file.

03:04:56.330 --> 03:04:59.230
Well then we go with the next point.

03:04:59.250 --> 03:05:01.010
Which is to create the dimension time.

03:05:02.290 --> 03:05:03.550
So for this.

03:05:03.970 --> 03:05:06.810
I leave this open to copy the pre-yog and the post-yog.

03:05:07.910 --> 03:05:12.190
So I'm going back to Casio Studio and I'm going to create a new job here.

03:05:12.290 --> 03:05:13.430
I'm going to call it Job.

03:05:14.350 --> 03:05:14.770
Perfect.

03:05:17.050 --> 03:05:17.730
Alejandro.

03:05:18.810 --> 03:05:18.890
Done.

03:05:19.990 --> 03:05:20.670
Beatriz.

03:05:21.610 --> 03:05:22.510
I'm going to my screen now.

03:05:24.970 --> 03:05:30.110
Well then I'm going from here to Casio Studio and I'm going to create a new job.

03:05:30.990 --> 03:05:31.550
Job.

03:05:32.090 --> 03:05:32.770
Underscore.

03:05:33.650 --> 03:05:34.330
Dimension.

03:05:35.450 --> 03:05:36.130
Underscore.

03:05:36.130 --> 03:05:37.350
Time.

03:05:38.170 --> 03:05:38.550
Done.

03:05:38.690 --> 03:05:41.130
I'm going to copy the pre-yog.

03:05:42.850 --> 03:05:45.650
With all the elements it has.

03:05:49.750 --> 03:05:52.130
And I'm going to copy the post-yog.

03:05:59.830 --> 03:06:02.830
And finally I copy the input.

03:06:02.890 --> 03:06:04.090
Let the input be the same.

03:06:04.090 --> 03:06:05.310
Input.

03:06:07.850 --> 03:06:09.410
Copy here.

03:06:10.730 --> 03:06:11.130
And here.

03:06:15.430 --> 03:06:15.910
Perfect.

03:06:16.130 --> 03:06:16.410
Done.

03:06:17.770 --> 03:06:19.190
Yes, I need the input.

03:06:19.550 --> 03:06:21.230
Wait a moment and I'll review the file.

03:06:21.550 --> 03:06:22.070
This team.

03:06:24.030 --> 03:06:25.890
Let's see how it is.

03:06:28.210 --> 03:06:28.990
Freelance.

03:06:29.270 --> 03:06:30.050
Clients.

03:06:31.530 --> 03:06:32.430
Team, team, team.

03:06:32.490 --> 03:06:33.050
Moment.

03:06:34.510 --> 03:06:35.770
Wait a moment please.

03:06:37.250 --> 03:06:38.890
Yes, I need the input.

03:06:39.090 --> 03:06:40.050
I'm going to help here.

03:06:41.970 --> 03:06:44.410
A T-Map.

03:06:44.950 --> 03:06:46.050
A T-Unit.

03:06:49.330 --> 03:06:49.590
Ok.

03:06:49.930 --> 03:06:51.410
A T-Unit.

03:06:53.870 --> 03:06:56.310
And I finish with another.

03:06:56.610 --> 03:06:56.670
Lie.

03:06:56.870 --> 03:06:57.490
Better here.

03:06:57.610 --> 03:06:58.810
A T-Unit back here.

03:06:58.930 --> 03:06:59.830
More efficient here.

03:06:59.850 --> 03:07:00.470
A T-Map.

03:07:00.750 --> 03:07:01.370
Nothing else.

03:07:01.370 --> 03:07:03.750
And a T-HDF-C output.

03:07:05.350 --> 03:07:06.350
And a T-High-Low.

03:07:15.870 --> 03:07:17.690
It would be in those components.

03:07:18.110 --> 03:07:18.450
Done.

03:07:18.650 --> 03:07:20.490
Let's proceed to connect the elements.

03:07:21.070 --> 03:07:23.370
The T-HDF-Input we connect with the.

03:07:24.970 --> 03:07:26.430
With the T-Unit-Rob.

03:07:27.310 --> 03:07:28.750
The T-Unit-Rob.

03:07:28.750 --> 03:07:32.130
We connect it with the Units with the T-Map.

03:07:32.510 --> 03:07:36.750
And we connect the T-Map with the HDF-C output that is here.

03:07:37.870 --> 03:07:38.910
I'm going to call it here.

03:07:40.310 --> 03:07:40.830
Output.

03:07:41.130 --> 03:07:41.650
Formatted.

03:07:44.110 --> 03:07:44.630
Time.

03:07:45.510 --> 03:07:45.930
Perfect.

03:07:46.050 --> 03:07:46.290
Done.

03:07:47.290 --> 03:07:48.930
Remember that at the data warehouse level.

03:07:49.490 --> 03:07:51.130
Normally one makes a time dimension.

03:07:52.130 --> 03:07:55.350
And that time dimension takes out the different times from the data.

03:07:55.350 --> 03:07:56.350
So it generates a.

03:07:57.510 --> 03:07:59.350
A key that we call a synthetic.

03:08:00.190 --> 03:08:01.630
Some of it depends on the author.

03:08:01.790 --> 03:08:02.870
There are authors that are different.

03:08:04.050 --> 03:08:05.790
It connects the two sub-jobs.

03:08:06.210 --> 03:08:06.890
Or a sub-jobs.

03:08:07.090 --> 03:08:08.410
As we did last time.

03:08:08.530 --> 03:08:09.010
A sub-jobs.

03:08:13.190 --> 03:08:15.250
Let's configure the table first.

03:08:16.790 --> 03:08:19.570
Remember that the character.

03:08:20.470 --> 03:08:23.410
Of the table separation with the character of the output.

03:08:23.410 --> 03:08:25.870
By default here I have it with point and comma.

03:08:26.030 --> 03:08:27.070
But there are some that have it with comma.

03:08:27.130 --> 03:08:29.670
So everyone decides how to do it.

03:08:30.010 --> 03:08:31.410
The table is going to be called times.

03:08:34.110 --> 03:08:36.710
And in the blur scheme that it has.

03:08:36.890 --> 03:08:38.430
And we are going to add some columns.

03:08:40.250 --> 03:08:41.410
Some columns then.

03:08:43.490 --> 03:08:44.470
The columns are the following.

03:08:45.890 --> 03:08:46.350
Identifier.

03:08:47.010 --> 03:08:48.370
Of whole type.

03:08:50.030 --> 03:08:50.490
Identifier.

03:08:54.870 --> 03:08:57.190
Whole type.

03:08:59.970 --> 03:09:00.550
Month.

03:09:02.390 --> 03:09:04.490
Of whole type.

03:09:09.870 --> 03:09:10.450
Month.

03:09:11.150 --> 03:09:12.370
Whole type.

03:09:16.450 --> 03:09:16.630
Year.

03:09:17.030 --> 03:09:18.730
But it is going to have a problem with the year.

03:09:18.730 --> 03:09:19.730
So I put annuality.

03:09:22.130 --> 03:09:23.350
Whole type.

03:09:29.390 --> 03:09:30.730
And I put now.

03:09:32.090 --> 03:09:32.790
Month.

03:09:34.030 --> 03:09:34.710
Trimester.

03:09:36.030 --> 03:09:36.550
Whole type.

03:09:39.790 --> 03:09:42.110
And finally, name of the month.

03:09:43.710 --> 03:09:44.850
Name of the month.

03:09:45.850 --> 03:09:46.670
Stream type.

03:09:46.670 --> 03:09:49.130
There are five fields.

03:09:49.670 --> 03:09:50.310
Identifier.

03:09:50.650 --> 03:09:50.910
Month.

03:09:51.330 --> 03:09:51.550
Annuality.

03:09:51.670 --> 03:09:52.150
And trimester.

03:09:52.250 --> 03:09:53.430
All those four that I mentioned.

03:09:54.130 --> 03:09:54.750
Whole type.

03:09:55.990 --> 03:09:58.050
And the name of the month.

03:09:58.410 --> 03:09:59.350
Defines it.

03:09:59.390 --> 03:10:00.030
Stream type.

03:10:00.150 --> 03:10:01.190
It gives it here ok.

03:10:01.730 --> 03:10:04.930
If the impulse took it from the previous job.

03:10:05.030 --> 03:10:05.850
There is nothing more to do.

03:10:06.150 --> 03:10:09.230
And we are going to configure then the Tejunik row.

03:10:12.210 --> 03:10:14.930
And here what it is going to do is that.

03:10:16.670 --> 03:10:17.130
Wait.

03:10:17.370 --> 03:10:18.590
What impulse did I take?

03:10:19.830 --> 03:10:20.190
What a shame.

03:10:20.270 --> 03:10:22.090
I took the sales impulse.

03:10:22.190 --> 03:10:23.870
It is the one of players.

03:10:24.110 --> 03:10:25.210
I think I am the input.

03:10:25.650 --> 03:10:27.870
It is this input here.

03:10:29.010 --> 03:10:29.650
I copied.

03:10:29.910 --> 03:10:30.950
It was the different job.

03:10:32.090 --> 03:10:32.670
I paste here.

03:10:35.670 --> 03:10:36.730
I connect here.

03:10:38.050 --> 03:10:40.870
And it is going to eliminate the duplicates of dates.

03:10:42.310 --> 03:10:42.950
So.

03:10:43.390 --> 03:10:44.970
It stayed the same.

03:10:44.970 --> 03:10:46.210
What was it that I took the same?

03:10:46.250 --> 03:10:46.970
One moment.

03:10:50.810 --> 03:10:51.350
Ok.

03:10:53.070 --> 03:10:56.630
This has that it did not synchronize me.

03:10:57.210 --> 03:10:58.190
It left me the previous one.

03:10:58.270 --> 03:10:59.970
So I'm going to eliminate all these that are here.

03:11:01.430 --> 03:11:04.570
I'm going to eliminate them.

03:11:07.970 --> 03:11:09.350
Because I was wrong.

03:11:09.610 --> 03:11:10.150
So.

03:11:10.450 --> 03:11:10.990
Ready.

03:11:12.890 --> 03:11:14.490
And I pass all these here.

03:11:16.650 --> 03:11:17.290
Now yes.

03:11:18.070 --> 03:11:18.270
Ok.

03:11:18.410 --> 03:11:18.970
I was wrong.

03:11:21.670 --> 03:11:22.970
So here you have to filter.

03:11:25.010 --> 03:11:25.890
The dates.

03:11:27.130 --> 03:11:27.630
Act then.

03:11:28.530 --> 03:11:30.770
Well, let's hope there are some that are still.

03:11:31.110 --> 03:11:31.550
Configuring.

03:11:32.630 --> 03:11:33.710
Well, those who are already.

03:11:33.710 --> 03:11:34.370
There are some that are still configured.

03:11:35.110 --> 03:11:35.550
Please.

03:11:36.750 --> 03:11:39.370
Connect the two subjobs that I escaped now with 11 subjobs.

03:11:39.570 --> 03:11:39.630
Ok.

03:11:39.690 --> 03:11:41.970
There are some that are still configuring some things.

03:11:43.190 --> 03:11:43.750
With.

03:11:44.190 --> 03:11:44.750
Sorry.

03:11:45.710 --> 03:11:49.070
Connect the HDF input with the Teja and Lua with a subjoke.

03:11:51.530 --> 03:11:52.130
And the date.

03:11:52.890 --> 03:11:53.490
Yes sir.

03:11:53.810 --> 03:11:54.210
The date.

03:11:56.450 --> 03:11:57.210
Nothing more the date.

03:11:57.330 --> 03:11:59.270
I want to filter the dates that are duplicates.

03:11:59.710 --> 03:11:59.930
Yes.

03:11:59.930 --> 03:12:01.430
Only from there.

03:12:03.550 --> 03:12:04.590
Let's go then.

03:12:04.770 --> 03:12:05.330
Yes.

03:12:05.330 --> 03:12:05.530
No.

03:12:05.550 --> 03:12:05.950
One moment.

03:12:06.810 --> 03:12:08.190
Who brings me, right?

03:12:09.310 --> 03:12:09.690
Yes.

03:12:09.690 --> 03:12:10.290
Let's go then.

03:12:10.550 --> 03:12:11.190
We are in.

03:12:11.330 --> 03:12:11.670
Where is it?

03:12:11.670 --> 03:12:14.610
It is in the Tejunic Road.

03:12:14.770 --> 03:12:15.290
Pass all.

03:12:15.790 --> 03:12:17.470
In the Tejunic Road you have to pass all.

03:12:17.850 --> 03:12:18.330
Exactly.

03:12:18.950 --> 03:12:19.770
You must accept.

03:12:21.150 --> 03:12:21.450
Now.

03:12:21.570 --> 03:12:22.150
What is there.

03:12:23.310 --> 03:12:24.270
And now.

03:12:24.790 --> 03:12:25.270
Yes.

03:12:25.590 --> 03:12:27.450
Go to Tejunic and nothing more.

03:12:27.570 --> 03:12:27.850
Cool.

03:12:28.230 --> 03:12:28.510
Date.

03:12:28.810 --> 03:12:29.170
Perfect.

03:12:29.410 --> 03:12:29.650
Ready.

03:12:29.770 --> 03:12:30.730
Well, let's do the following.

03:12:30.770 --> 03:12:31.590
We leave it here.

03:12:31.710 --> 03:12:34.330
We continue now between one hour while we have lunch.

03:12:35.410 --> 03:12:37.430
And stand up so that some finish some things.

03:12:37.430 --> 03:12:44.470
So we return in one hour and we finish so that I can send that by mail.

03:12:45.230 --> 03:12:46.430
So good luck then.

03:12:47.990 --> 03:12:48.610
Yes.

03:13:05.370 --> 03:13:07.210
Good afternoon again.

03:13:08.070 --> 03:13:09.670
I hope you already have a good time in your lunch.

03:13:10.150 --> 03:13:11.110
If we are connected.

03:13:11.410 --> 03:13:11.810
Perfect.

03:13:11.990 --> 03:13:13.810
We are almost finishing then.

03:13:14.230 --> 03:13:15.050
All ready.

03:13:15.050 --> 03:13:16.230
We are going to continue.

03:13:16.890 --> 03:13:19.690
With the second point of the study case.

03:13:20.390 --> 03:13:21.570
Which comes now.

03:13:22.170 --> 03:13:23.010
We already filter.

03:13:23.370 --> 03:13:26.150
We remove the duplicates of the dates.

03:13:26.910 --> 03:13:29.530
And now each date must have a specific code.

03:13:29.950 --> 03:13:31.250
Since the dates are unique.

03:13:31.690 --> 03:13:35.430
There will be no dates repeated there by the Tejunic Road that is dedicated to that.

03:13:35.710 --> 03:13:37.050
Let's go to the theme at some point.

03:13:37.230 --> 03:13:37.570
The theme.

03:13:37.610 --> 03:13:38.910
We are going to configure the theme.

03:13:40.930 --> 03:13:44.950
And we are going to define a sequence that will place us each date.

03:13:44.970 --> 03:13:47.370
Which is unique with a unique identifier.

03:13:48.670 --> 03:13:50.130
For this part then.

03:13:50.170 --> 03:13:53.390
We are going to define a new variable here.

03:13:54.510 --> 03:13:55.630
Of full type.

03:13:57.650 --> 03:13:58.910
That we are going to call identifier.

03:14:02.750 --> 03:14:03.310
Identifier.

03:14:03.970 --> 03:14:06.830
And now I could then with sequences.

03:14:07.010 --> 03:14:08.630
Which is an option that has the talent.

03:14:08.630 --> 03:14:09.750
Which we already did before.

03:14:10.090 --> 03:14:11.750
Let's go then to the expression.

03:14:12.170 --> 03:14:14.630
And we look for where it says talent in numeric.

03:14:16.170 --> 03:14:17.850
There is an option that says sequence.

03:14:17.890 --> 03:14:18.630
Which is the last.

03:14:19.430 --> 03:14:21.370
And there it specifies the name of the sequence.

03:14:21.370 --> 03:14:22.210
Where it starts.

03:14:22.410 --> 03:14:23.510
And with what increment.

03:14:24.750 --> 03:14:27.050
It would start in one with an increment of one then.

03:14:27.250 --> 03:14:28.350
I'm not going to make any changes.

03:14:28.670 --> 03:14:29.630
Sequence is called.

03:14:30.250 --> 03:14:31.490
It is the last option.

03:14:33.330 --> 03:14:35.630
It gives you double click and it comes out automatically.

03:14:36.550 --> 03:14:36.870
Perfect.

03:14:37.050 --> 03:14:37.470
Perfect.

03:14:37.470 --> 03:14:38.170
Okay.

03:14:39.930 --> 03:14:42.530
I need now then format the dates.

03:14:43.350 --> 03:14:44.670
The dates that are here.

03:14:45.170 --> 03:14:47.690
Go to the entry elements.

03:14:47.910 --> 03:14:48.770
That here is the date.

03:14:49.130 --> 03:14:50.830
Here it specifies the format of the date.

03:14:50.870 --> 03:14:53.170
In which it is DDMMJJJ.

03:14:53.770 --> 03:14:55.570
But now it's my turn then.

03:14:55.570 --> 03:14:56.530
Take out the year.

03:14:57.090 --> 03:14:58.370
Take out the number of the month.

03:14:59.250 --> 03:15:01.250
Take out the quarter.

03:15:01.470 --> 03:15:03.290
Then we are going to do it the following way.

03:15:04.410 --> 03:15:07.190
We are going to create a new variable.

03:15:08.830 --> 03:15:09.270
Entire type.

03:15:12.130 --> 03:15:14.290
And we are going to call it then annuality.

03:15:15.270 --> 03:15:16.950
For the problem of the year that we have.

03:15:17.930 --> 03:15:18.370
Annuality.

03:15:20.570 --> 03:15:23.930
And we are going to take out the year from the date it has as an entry.

03:15:24.030 --> 03:15:26.970
For this then we are going to look for a function.

03:15:27.910 --> 03:15:29.670
That is called.

03:15:29.670 --> 03:15:30.990
Go to Talent Date.

03:15:31.190 --> 03:15:34.050
The functions that are grouped in the Talent Date category.

03:15:34.610 --> 03:15:37.670
There is already an option that is called.

03:15:40.510 --> 03:15:42.030
Get Part of Date.

03:15:42.790 --> 03:15:43.470
It's called like that.

03:15:43.530 --> 03:15:45.570
The function is called Get Part of Date.

03:15:45.810 --> 03:15:47.730
Double click on that function then.

03:15:48.790 --> 03:15:52.130
And the function is telling you what modifiers you should send.

03:15:52.210 --> 03:15:53.610
Depending on what you want to take out.

03:15:53.630 --> 03:15:54.830
As I want to take out the year.

03:15:55.210 --> 03:15:56.870
I'm going to send you year in capital.

03:15:56.910 --> 03:15:57.890
I can copy it from here.

03:15:57.890 --> 03:15:58.110
The same.

03:15:58.730 --> 03:15:59.830
Of the help that is here.

03:16:00.210 --> 03:16:01.970
And where it says, Date of Weeb.

03:16:02.050 --> 03:16:03.270
I replace it by year.

03:16:05.530 --> 03:16:06.950
What do I want to get?

03:16:11.910 --> 03:16:12.390
Then.

03:16:12.990 --> 03:16:14.550
I'm going to erase the example that is there.

03:16:14.550 --> 03:16:15.750
Which begins from the T.

03:16:15.870 --> 03:16:16.790
To the parenthesis.

03:16:17.330 --> 03:16:18.730
To the first parenthesis that closes.

03:16:19.290 --> 03:16:20.690
And I would stay comma.

03:16:21.670 --> 03:16:23.270
Parenthesis that closes nothing else.

03:16:24.210 --> 03:16:25.090
That's how it would stay.

03:16:25.310 --> 03:16:27.010
Comma and parenthesis that closes.

03:16:27.890 --> 03:16:33.950
y busco en dentro de la variable de entrada busco la que dice road dos date le doy doble

03:16:33.950 --> 03:16:45.370
click y me aparece justamente donde está el cursor y ahí sacaría el año perfecto le

03:16:45.370 --> 03:16:50.350
okey voy a sacar ahora el número del mes entonces para el número de mes voy a crear

03:16:50.350 --> 03:16:58.450
nueva variable voy a usar la misma función talent day get part of day es más creo que

03:16:58.450 --> 03:17:05.930
me sale más rápido que copie la expresión de la variable anterior y la pegue acá abajo sale

03:17:05.930 --> 03:17:20.330
más rápido y lo único que voy a cambiar es donde dice gear voy a colocar mom en inglés y en

03:17:20.330 --> 03:17:27.850
la expresión de la del año y pegarlo acá en la nueva variable que volverla configurar

03:17:27.850 --> 03:17:37.110
nuevamente a esto le damos okey vamos a cambiar el tipo sí dígame el tipo de datos sería entero

03:17:37.110 --> 03:17:47.390
primero porque la tal en devuelve un entero y el nombre sería mes me piden ahora el cuartel

03:17:47.390 --> 03:17:52.730
y para sacar el cuartel voy a hacer una función ya bastante antigua entonces voy a crear una nueva

03:17:52.730 --> 03:18:00.510
variable y voy a copiar la misma expresión del mes la copio y la pego en la expresión de acá

03:18:05.770 --> 03:18:12.550
vamos a ver dele hay más allá arriba más un momento un momento a ver qué pasa

03:18:14.110 --> 03:18:21.190
vuelvo a abrir nuevamente el tema no sé qué pasaba listo entonces para esto hacemos lo

03:18:21.190 --> 03:18:27.290
siguiente la función cuál sería vamos a restarle uno a la fech al mes que estoy

03:18:27.290 --> 03:18:34.190
sacando menos uno o copie brenda en su caso copie la expresión del mes en el talent day

03:18:34.830 --> 03:18:38.790
copia el anterior listo perfecto restele uno

03:18:41.690 --> 03:18:51.750
restele uno encierre entre paréntesis divida entre cuatro y sume le uno y esto saca esto

03:18:51.750 --> 03:18:59.070
me da el trimestre por ejemplo si tengo 11 entre cuatro entre cuatro daría dos

03:18:59.930 --> 03:19:05.590
sí no acuerdo si la función es entre tres o entre cuatro por un momento en la

03:19:05.590 --> 03:19:12.290
busco la función de se calcula la función entonces no sé si la función es para ver

03:19:12.290 --> 03:19:24.910
11 entre cuatro más uno no daría de quedar cuánto tenés es vamos a ver en 53 que pena

03:19:24.910 --> 03:19:32.830
entre tres no entre cuatro entre tres más un si entre tres más un

03:19:34.810 --> 03:19:44.510
esta función no sé si la estoy en el momento de buscarla apuntar rápidamente aquí a la

03:19:44.510 --> 03:19:55.230
ya dame la función a partir del mes que me del trimestre momento que no sé si me está dando

03:19:55.230 --> 03:19:57.610
es trimestre

03:20:08.590 --> 03:20:18.350
cuarto trimestre si señor esto sí está bien y listo sí entre tres entonces listo si perfecto

03:20:20.990 --> 03:20:29.370
ok y faltaría sacar aquí sería entero y sería aquí trimestre

03:20:31.330 --> 03:20:39.550
y creo que nada me falta el nombre del mes agrega aquí una nueva función pero aquí sí nos

03:20:39.550 --> 03:20:45.090
toca hacer una diferente porque la función está que par no tiene para obtener el nombre

03:20:45.090 --> 03:20:51.870
del mes entonces entre a la expresión por favor y va a buscar dentro de tal en date una

03:20:51.870 --> 03:21:07.370
función que se llama forma de y esta forma de este forma de forma de forma de entonces

03:21:07.370 --> 03:21:13.510
aquí en forma de donde está el formato que está ye ye ye vas a mandar como formato y

03:21:14.530 --> 03:21:20.950
4 m en mayúsculas que indique que quiero el nombre del mes m m m m 4 m en mayúsculas

03:21:23.650 --> 03:21:30.210
y en la opción que dice my day mandale como parámetro la variable que se llama road day

03:21:30.210 --> 03:21:37.550
la borra si tú haces el cursor en donde quieres le das doble clic a la ir creo que ya tenemos

03:21:37.550 --> 03:21:43.970
este punto aquí casi listo vamos a la que ok y antes de enviarlo de salida a perdón el

03:21:43.970 --> 03:21:52.770
nombre va a ser aquí nombre mes nombre mes se llama la variable y para ver el orden voy a

03:21:52.770 --> 03:21:56.410
ver cómo creer la tabla porque tengo que dar la misma orden la tabla está creada primero

03:21:57.010 --> 03:22:04.430
identificador número de mes anualidad trimestre y nomes el mismo orden entonces sería de esta

03:22:04.430 --> 03:22:18.430
manera mandaría primero va identificador la segunda columna va mes tercera anualidad

03:22:23.570 --> 03:22:30.670
cuarta trimestre y quinta nombre del mes y de nuevamente el orden identificador

03:22:30.670 --> 03:22:37.230
mes anualidad trimestre y nombre del mes porque lo coloque así porque como voy a

03:22:37.230 --> 03:22:41.890
mandarla para que cargue el orden de los columnas tiene que ser igual a como definir

03:22:41.890 --> 03:22:46.090
la tabla y mande otro elemento aquí que se me escapó en la tabla que lo agrego

03:22:46.090 --> 03:22:50.270
ahora es sumamente importante que es la fecha para poder hacer un join ahí con esa fecha

03:22:51.650 --> 03:22:58.010
que se me olvidó de regar la tabla sería entonces identificador mes anualidad trimestre

03:22:58.010 --> 03:23:08.370
nombre el mes y date date como último campo entonces perfecto listo vamos a darle aquí ok ok

03:23:09.610 --> 03:23:14.050
agregue por favor en el texay create table agregue una nueva columna que se me escapó que

03:23:14.050 --> 03:23:22.610
era en el esquema agregue una nueva columna que se llame fecha y es de tipo date el tipo

03:23:22.610 --> 03:23:28.530
es tipo date ok

03:23:30.470 --> 03:23:35.470
bueno configuremos entonces fecha tipo de ahí ya nos falta el

03:23:35.470 --> 03:23:41.010
thdfs output entonces configurémoslo usar conexión existente

03:23:44.670 --> 03:23:49.850
listo entonces vamos al nombre archivo busque un nombre archivo que ya esté después lo

03:23:50.170 --> 03:23:56.290
llamamos en aksa el nombre archivo lo va a llamar tiempos.sv

03:23:59.290 --> 03:24:09.530
listo perfecto en el hay create table texay create table agregue un nuevo elemento que

03:24:09.530 --> 03:24:20.110
sería llámelo fecha de tipo date fecha de tipo date tipo date nada más fue ahí se

03:24:20.630 --> 03:24:25.310
estamos ahora en el output listo vamos en el output estamos confiando el output

03:24:26.570 --> 03:24:35.230
y el output sería entonces el nombre lo mande hacia aksa tiempos.sv le doy que sobrescriba

03:24:36.230 --> 03:24:41.450
y confirme que su separador de fila coincida con el separador que tiene en el create table

03:24:41.450 --> 03:24:46.690
yo por defecto tengo punto y coma y acá lo dejé en punto y coma y le dice que incluye

03:24:46.690 --> 03:24:54.970
perfecto entonces copie por favor la el nombre archivo del output para colocarlo

03:24:54.970 --> 03:24:57.370
con el texay load copiar

03:25:02.050 --> 03:25:09.570
vete a tu texay load en el file bueno usar conexión existente recuerde colocar aquí

03:25:10.630 --> 03:25:15.410
el nombre la conexión en el path y el nombre a tala se llama tiempos

03:25:18.270 --> 03:25:24.830
perdón se ve que venga vamos no vamos a configurar ya el texay load pegue por favor el

03:25:24.830 --> 03:25:32.370
path de la salida del output que sería aksa tiempos.sv y el nombre a tablas se llama tiempos

03:25:37.770 --> 03:25:39.690
bueno aquí creo que no hay que hacer más nada

03:25:42.690 --> 03:25:47.550
voy a probar ejecutar para ver si de pronto no tengo ningún tipo de error

03:25:50.650 --> 03:25:56.910
bien al parecer no hubo ningún error voy a ejecutar acá select asterisco frontiempos a ver

03:26:15.890 --> 03:26:21.270
bueno ahí están con los nombres noviembre listo voy a ver quién tiene problemas lo que les

03:26:21.270 --> 03:26:27.370
salió bien por favor recuerde punto número 2 un print screen del job y un print screen de la

03:26:27.370 --> 03:26:34.390
salida alejandro voy a ver qué pasó contigo a ver vamos a ver cuando ese es un error de

03:26:34.390 --> 03:26:41.190
compilación entonces para lo más seguro vete al si está en el texay load parece que hay una

03:26:41.190 --> 03:26:46.830
comilla doble que parece que leí para ver mira al inicio colocaste dos comillas dobles

03:26:48.370 --> 03:26:51.710
quítale una por favor ok eso creo que es el error ejecuta para ver

03:27:02.610 --> 03:27:07.350
perfecto dale un select asterisco por favor a la a la tabla tiempo y deberías tener los

03:27:07.350 --> 03:27:14.610
datos allá parecer fernanda creo que tiene un error voy a ver para ver bueno vamos a ver

03:27:14.610 --> 03:27:21.510
ese error de compilación vayase ok vayase al co para que lo marque le marque exactamente

03:27:21.510 --> 03:27:35.250
donde sacó que problema que tiene con su intercom no sé no sé por qué bueno vayase

03:27:35.250 --> 03:27:41.530
el punto rojo que tenemos ahí que le marca un error de compilación dice

03:27:42.990 --> 03:27:50.070
si quiere dele click acá en el en el rojito que tiene la izquierda si le coloca el mouse ahí

03:27:50.070 --> 03:27:55.970
le da click ahí de ese que tiene no puede convertir entero a string a seguramente el valor de la

03:27:55.970 --> 03:28:08.510
variable la colocó string vayase al tema y la la trimestre es entera pero espera un momento

03:28:08.510 --> 03:28:13.930
que aquí hay un problema cuando lo hago de aquí en entero y ahora enter de ley por favor ahora el

03:28:13.930 --> 03:28:17.990
trimestre de la salida o sea del lado derecho tiene que convertirlo también a entero en la

03:28:17.990 --> 03:28:22.350
parte de abajo están los mismos nombres de arriba que lo cambia ahí dice string y cambia

03:28:29.130 --> 03:28:30.550
intero dele para ver ejecutar

03:28:35.590 --> 03:28:41.350
por el creo que pronto colocate es colocate sobre escribir al té al loco de pronto que

03:28:41.350 --> 03:28:45.830
saca que se ha ido a loco en la acción dale sobre escribir porque parece que como que está

03:28:45.830 --> 03:28:52.290
intentando en el caso tuyo fernando a darse la catérica no era como con jaca con jorge en

03:28:52.290 --> 03:28:58.670
la que está rico tu fernanda ahora en la en la por favor en ahí en la acción jorge

03:28:58.670 --> 03:29:03.950
darle o sobre escribir ejecuta nuevamente para ver dale se la que está rico por favor fron

03:29:05.410 --> 03:29:14.970
tiempos escribe que él se colocan aparte abajo hay un pequeño error ahí vamos a ver

03:29:14.970 --> 03:29:19.690
dale que ir un momento ahí no el rollo de ejecución de beta el designer momento

03:29:19.690 --> 03:29:26.510
jorge qué pena contigo fernando te lo tiene bien voy con jorge entonces voy contigo ahora a ver

03:29:28.170 --> 03:29:31.090
cerca de y vete un momentico al tejai crédito por un momento

03:29:32.730 --> 03:29:36.570
le doy click ahí un momento a tu editar esquema

03:29:40.310 --> 03:29:46.850
ahí no sé por qué hubo un problema ahí ese tipo está bien pero el tipo de antes o sea el

03:29:46.850 --> 03:29:52.710
tipo que está ahí tenía que estar de ahí no sé por qué creo que lo cambiaste fue directamente

03:29:52.710 --> 03:29:57.810
en el debe tai si lo cambia el debe tai no te cambia el tipo de llave entonces ahora lo que

03:29:57.810 --> 03:30:00.730
no sé si creó la tabla entonces vuelve a ejecutar ahora se vuelve a ejecutar para ver

03:30:06.090 --> 03:30:09.250
no creo que yo me lo espero un momento y me conecto porque

03:30:11.370 --> 03:30:14.770
pero deja que termine un momento haz lo siguiente

03:30:17.650 --> 03:30:22.690
espérame y mira para ver si puede ver a mí me conecto interactivo para ver una configuración mejor

03:30:26.310 --> 03:30:33.230
ah ok aquí está un problema y creo que el inconveniente que tiene es que este campo

03:30:33.230 --> 03:30:37.790
no deja que sea no que se llame de y porque parece que es para la reservada

03:30:38.590 --> 03:30:42.670
un momento no no creo que vamos a ver para ver si es eso

03:30:45.530 --> 03:30:51.910
si deis lo tenía como para la reservada entonces hay y entonces ahí tenía el problema

03:30:51.910 --> 03:30:56.270
bueno ya está copiando vamos a ver

03:30:56.270 --> 03:31:01.730
tiempo sería tiempo tiempos creo que se llamaba tiempo perfecto listo copia si

03:31:02.370 --> 03:31:06.450
ahí tienes un problema con la última null de a tu momento yo no soy inconveniente ahí

03:31:07.550 --> 03:31:13.170
vete por favor ese no lo necesitamos ahora vete a tu a tu tema un momento a tu tema

03:31:17.710 --> 03:31:18.310
listo

03:31:19.770 --> 03:31:26.050
ahí está bien ok dale ok un momento ahí vete a tu hdfs output un momento

03:31:28.270 --> 03:31:31.510
y dale editar ahí ahí está bien también

03:31:32.310 --> 03:31:37.410
permítame yo tomarlo con todo el momento a ver y este es el hdfs que está acá

03:31:44.650 --> 03:31:50.010
si está bien vamos a hacer lo siguiente voy a borrar la tabla un momento drop table para

03:31:50.010 --> 03:31:56.470
que lo acabe nuevamente a ver si el problema sigue siendo ese perfecto listo vamos a ver

03:31:56.470 --> 03:31:58.570
entonces vamos a ejecutar nuevamente ese job tuyo

03:32:05.530 --> 03:32:10.050
está bien todo no lo veo nada extraño vamos a ver si

03:32:14.350 --> 03:32:21.250
la volve a borrar nuevamente pensé que estaba el cdk ahí que raro

03:32:25.970 --> 03:32:30.730
ah no no alcanzamos a ver verdad que sí vamos a ver si de pronto espera que termine

03:32:32.510 --> 03:32:38.370
está en nulo a los demás no salió nulo verdad yo a mí no no recuerdo que confirmó para ver yo no

03:32:38.370 --> 03:32:45.110
ah no pero es que yo también a un momento ah espera un momento vamos a correr a ver a ver

03:32:46.090 --> 03:32:50.490
porque está nulo es que todos tenemos nulo vamos a correr un momento que sin eso es vital

03:32:50.490 --> 03:32:51.970
para hacer la tabla de hechos

03:33:04.570 --> 03:33:11.030
no no creo que sea por el nombre porque es por posición del de los datos que no sé

03:33:11.030 --> 03:33:19.390
yo no no no no revisé si yo tenía también salió nulo entonces vamos a ver a a ti se salió

03:33:19.390 --> 03:33:24.190
ah ok bueno ahora vamos a buscar el caso tuyo para ver porque a ti sí y a nosotros no

03:33:24.190 --> 03:33:26.310
no sé te haces confirmar porque cerré aquí el doc

03:33:32.930 --> 03:33:37.270
y este docker tiene un pequeño problema en esta versión de windows que cuando

03:33:37.270 --> 03:33:48.050
cierro la ventana de forma bruta no me la abre más ah pero es un momento

03:33:56.530 --> 03:33:57.010
ah

03:33:57.010 --> 03:34:02.190
en el caso tuyo un momento a ver fernanda es porque a ti sí y al demás no que

03:34:03.810 --> 03:34:10.490
voy a entrar a tu ventana para ver fernanda no en realidad a ti te falta una campo fernanda

03:34:10.490 --> 03:34:16.130
que es la fecha el último campo 10 5 son 6 si vamos a vamos a comprobar tu yo que un momento

03:34:17.790 --> 03:34:24.610
y el te hay que ir aquí debería tener si aquí falta un campo creería que es

03:34:26.230 --> 03:34:31.650
momento a ver dele un descrae la tabla por favor y mire qué tipo de datos lo le apareció

03:34:31.650 --> 03:34:38.110
que tengo un problema acá dele des haga lo siguiente váyase al váyase si es tan amable

03:34:38.110 --> 03:34:51.070
al al contenedor y coloque lo siguiente des des en este caso sería tiempos.com no la fecha está bien este tipo date

03:34:54.390 --> 03:35:00.430
si creo que va a ser el problema es que yo creo que haga lo siguiente para ver cómo esto es

03:35:01.480 --> 03:35:04.440
del siguiente select current

03:35:06.200 --> 03:35:09.960
con the record date el así para ver creo que esto con

03:35:11.380 --> 03:35:15.400
no recuerdo si es con el así para ver current date

03:35:16.280 --> 03:35:22.440
para ver qué formatos tiene por defecto año mes día entonces

03:35:25.980 --> 03:35:29.980
voy a ver este day y este day está en

03:35:31.040 --> 03:35:39.060
día mes año por favor aquí en el data pattern este del del tema cambiela el orden aquí para

03:35:39.060 --> 03:35:47.880
que sea dd m m y y dd guión m m mayúscula dd ok

03:35:50.760 --> 03:35:56.560
vamos a dar aquí yes vuelvo a ejecutar yo haría un docker sec

03:36:17.580 --> 03:36:18.980
docker

03:36:29.260 --> 03:36:30.660
docker

03:36:30.660 --> 03:36:37.020
And now, in the last ones, if I add them in all of them.

03:36:38.340 --> 03:36:43.080
Oh, well, then, that error is already a moment.

03:36:44.420 --> 03:36:46.060
So there is a problem that you have to format that date.

03:36:47.140 --> 03:36:50.020
The problem that we had then was that the date,

03:36:50.880 --> 03:36:52.680
the format of the date that I am sending is not the same

03:36:52.680 --> 03:36:56.020
format that the problem that is causing already has.

03:36:56.120 --> 03:36:59.080
But for a moment there, I do something here, a moment,

03:36:59.080 --> 03:37:06.280
please, this is here in documents, just a moment, please.

03:37:18.760 --> 03:37:24.320
Hey, I connect here, I think I have it here, I have already seen it.

03:37:24.340 --> 03:37:28.240
Well, it is clear that the format of the date does not like the

03:37:28.500 --> 03:37:31.840
date, then I have to define a different format.

03:37:33.640 --> 03:37:34.820
Ready to copy.

03:37:37.840 --> 03:37:38.820
Wait a minute.

03:37:45.880 --> 03:37:53.160
Select a 30 of them.

03:38:08.540 --> 03:38:19.440
Okay, let's do something then. Let's turn that date into a stream, better. Wait a moment,

03:38:19.440 --> 03:38:23.840
I'm thinking right. What happens is that I don't know if this format, as it is entering me in this way,

03:38:25.620 --> 03:38:32.360
when I pass it through here I lose the format completely. I'm going to run it again to see what

03:38:32.360 --> 03:38:39.500
it is giving me, how it is saving those data. Can you tell us please that you sent it as

03:38:39.500 --> 03:38:53.580
data? Okay, and the last one because it didn't work, let's see what happened.

03:39:08.610 --> 03:39:13.770
Wait a moment, let's see, I'm going to see the original file to see what's going on.

03:39:14.430 --> 03:39:20.530
In what format is the original file? Wait a moment. Ah, the original,

03:39:20.530 --> 03:39:35.290
it's a bit messy, so there's no problem. Here we have a problem, that is, the problem is in format, so

03:39:38.170 --> 03:39:44.410
that's it, I understand. So, let's do the following, let's do the following then,

03:39:44.410 --> 03:39:50.890
let's go to the input, the input, let's edit the input-input scheme and we are going to change

03:39:50.890 --> 03:40:02.330
the format from here in, the format we are going to read it then here in yy-mmdd.

03:40:08.610 --> 03:40:15.070
Okay, let's see if I changed the unit, they all change immediately. Yes, I think so, yes, they all change.

03:40:16.730 --> 03:40:25.490
And in the theme, let's see. Yes, then let's do that. If you want, please give a delete

03:40:25.490 --> 03:40:32.790
to the data in the table, give it delete from, delete from times and run again to see.

03:40:34.990 --> 03:40:39.830
Ah, no, this command doesn't leave me with delete, it leaves me with, wait a moment, where is the command?

03:40:40.450 --> 03:40:47.210
But hey, if it executed it, take the screens there, this man to save it is not with delete.

03:40:51.330 --> 03:40:56.110
Well, I'm going to do a drop table better and run it again, give it a drop table, times,

03:40:57.830 --> 03:41:02.310
of speed because of the time that it is no longer consuming, delete it and run it again.

03:41:05.030 --> 03:41:10.910
And run everything again then, in this case I also run it again.

03:41:13.330 --> 03:41:15.730
And that conversion was strange then.

03:41:20.510 --> 03:41:25.830
Perfect, I think it's over, now if it's going to be a select, I'll select it.

03:41:28.110 --> 03:41:31.490
Well, in my case, I'm still null, I don't know why.

03:41:36.410 --> 03:41:42.990
DDD, okay, I don't know if I'm going to be able to relate to what it was. Well, I don't know, if the dates came out,

03:41:42.990 --> 03:41:47.310
we are going to be null, we are going to see what happened then.

03:41:52.390 --> 03:42:00.190
Ah, of course, edit the scheme, let's go then, the format here would be, we don't change it here, right?

03:42:01.890 --> 03:42:07.510
We are going to change it here then, let's see, here it would be,

03:42:11.490 --> 03:42:20.830
yymmdd, I don't know if, and this man, the dbtie has to be here date, I don't know why it appeared when it was here,

03:42:21.570 --> 03:42:28.590
here it has to be date, date, date and in this format, I don't know why I changed it there,

03:42:29.990 --> 03:42:33.930
delete the table again to see, because if not, it will not create with the changes.

03:42:37.490 --> 03:42:49.690
And ok, and we run again to see, you have to delete the table because if the change later does not affect, it does not take into account the change.

03:43:02.230 --> 03:43:10.810
Effectively, of course, now we are going to see, Beatriz, now we are going to see, let's go to the changes with you, Beatriz, to see them first.

03:43:10.810 --> 03:43:24.610
First change, let's go to the HDFS input, let's check the changes, go to HDFS input and verify that the format is yymmdd, in the scheme, perfect,

03:43:25.430 --> 03:43:32.430
yymmdd, enlarge a little more to confirm that it is well written, perfect, ok, then go to the test high create table,

03:43:34.290 --> 03:43:39.590
the one above, the create table, that one that is there, and give it to edit the scheme, please.

03:43:41.930 --> 03:43:46.350
Change there, that dbtie, I don't know why it changed, it must be date, not time-stan,

03:43:47.410 --> 03:43:55.290
that time-stans that it has next to the last row, it appears dbtie, in the column dbtie, time-stan, change it to date,

03:43:56.810 --> 03:44:06.570
I don't know why it changed to time-stan there, and now change the format to that one that is there, change the format there to yymmdd,

03:44:10.190 --> 03:44:25.770
yymmdd, ok, now go to the docker, and give it drop, space, table, times,

03:44:28.190 --> 03:44:30.810
I will execute the joe again if it is so kind,

03:44:38.430 --> 03:44:45.090
I don't think it needs anything there, I don't see anything there that it needs, execute to see,

03:44:55.690 --> 03:45:01.170
we are going to leave up to that second point, please save the document and send it to me

03:45:01.170 --> 03:45:05.810
that mail that I sent out there, please, I confirm it when I get there, I sent it by

03:45:05.810 --> 03:45:08.810
checking the mail, save the document up to point number 2, what they have already finished,

03:45:08.810 --> 03:45:17.890
go to the docker again, there it is, thank you very much, it was an interesting course,

03:45:18.210 --> 03:45:22.670
because you helped me a lot, that's why working with people who know how to do it is extremely

03:45:22.670 --> 03:45:29.410
interesting, well, there I sent you the mail so that you can send me that document that you worked on,

03:45:29.810 --> 03:45:35.650
they are two points that are going to have nothing else, that has to send me, yes, tell me,

03:45:40.090 --> 03:45:44.430
of course, of course, yes, of course, send it to me, but send it to me separately,

03:45:44.630 --> 03:45:53.850
please, so that later you confuse me, this is what we did now, perfect, ready,

03:45:54.030 --> 03:46:03.770
good question, good question, so look what it's going to be, it goes to where the talent is installed,

03:46:03.770 --> 03:46:09.490
which is the program file, one moment, let's see if I left it there, tost bd,

03:46:11.490 --> 03:46:19.330
and here there is a subfolder called workspace, enter workspace, and here is the project

03:46:19.330 --> 03:46:26.810
called local project, so imagine that you changed the project to project axa,

03:46:27.150 --> 03:46:32.850
in the end what you are going to do nothing else to have all the copy is to copy everything

03:46:32.850 --> 03:46:40.830
that is here within the name of your project and there it automatically recreates everything,

03:46:41.190 --> 03:46:46.090
look here, compress this folder, which is the important thing, and the content of that folder is the one that

03:46:46.090 --> 03:46:50.490
is going to paste in the project that you defined, by default local project is created, but everything

03:46:50.490 --> 03:46:55.630
that is inside is pasted into the local project and automatically you have all the exercises

03:46:55.630 --> 03:46:59.310
that we did there, yes, that too without needing the tools, exactly, well, here then,

03:46:59.310 --> 03:47:02.150
let me know when you send it, I will confirm it, please, the email,

03:47:04.050 --> 03:47:08.350
close it to have a copy of the exercises that we did, so if you want to send it,

03:47:08.350 --> 03:47:15.110
close your email and when you have the tool you can again recover those examples that we

03:47:15.110 --> 03:47:19.790
did in class, what you see send me by email is the doc document where it has point number 1

03:47:19.790 --> 03:47:27.750
and point number 2, which is the valuable one, the compressed one is to have a copy of the

03:47:27.750 --> 03:47:34.670
example that we did, call me in your house, here I got González Puentes and Pesquera

03:47:34.670 --> 03:47:42.350
Rodríguez, I already got it ready, I sent it here through the chat, there in that email you can

03:47:42.350 --> 03:47:47.270
write to me for any questions, you are going to write to me for the good practices and any

03:47:47.270 --> 03:47:52.510
questions you have, please write to me there, here I got Sánchez, Sánchez Rosas Beatriz,

03:47:52.510 --> 03:48:03.550
yes, I also got it, Jimena arrived and I got Jiménez Ramos and Rosales Trejo, perfect,

03:48:03.650 --> 03:48:09.090
I think they all sent it, right? I think they all sent it, well, without a doubt, I did not

03:48:09.090 --> 03:48:12.630
make a mistake in the introduction when I said that it was a transfer of knowledge of this

03:48:12.630 --> 03:48:17.510
account that many of you helped me in some problems and I really loved having

03:48:17.510 --> 03:48:21.870
worked with you, I wish you the best, any questions write me, I am willing to

03:48:22.510 --> 03:48:28.690
if you have any questions about the topic of such or what things write to me and if I have the

03:48:28.690 --> 03:48:34.570
opportunity to continue the knowledge, I will gladly share it with you, see you then,

03:48:35.950 --> 03:48:40.570
see you then, it has been a pleasure, I wish you the best, happy rest of the day then,

03:48:40.750 --> 03:48:51.070
happy weekend and have a great time then, yes, tell me, Muniz has not arrived, Muniz,

03:48:51.070 --> 03:48:56.030
where did you send it? It has not arrived yet, did you just send it or did you send it? González Fuentes,

03:48:58.570 --> 03:49:03.150
Pesquera Rodríguez, Sánchez Rosas, Sánchez Rosas sent it to me, ah, he sent me another question, Jiménez Ramos,

03:49:03.190 --> 03:49:09.290
no, it has not arrived, did you send it to John Jaime? Yes, let's see if I have it, I don't think I have it,

03:49:10.150 --> 03:49:16.850
in Spain I don't think I have it, no, send it back again to see, sometimes it takes a while, but

03:49:17.410 --> 03:49:25.330
share the screen, please, to see the name of the email, let's see if I'm going, yes, let's see,

03:49:31.050 --> 03:49:38.030
send it back, please, yes, ah, ready, yes, what a strange question,

03:49:38.030 --> 03:49:47.930
how strange, I don't know why it's strange, because they were ordered, I gave it F5 and he put it in the third,

03:49:48.750 --> 03:49:54.990
that is, he inserted it and it happened twice, not twice, well, yes, it has already been confirmed,

03:49:55.590 --> 03:49:58.870
well, I wish you the best, great group, have a great time,