
4
Антон
no recommendations
no complaints
Closed
Project title: Parsing information from the site
Type of cooperation: Periodic employment
Section: Web development, Web programming
Prepayment: prepayment is possible
Payment methods: Cash, Bank transfer
Acceptance of requests: closed
Type of cooperation: Periodic employment
Section: Web development, Web programming
Prepayment: prepayment is possible
Payment methods: Cash, Bank transfer
Acceptance of requests: closed
Project description:
Remote work on the Internet. You need to get information from the car site.
1. You need to parse the technical specifications for cars from the Auto_ru website.
2. No need to scour ads!!! Technical specifications only (numbers, letters, symbols)!
3. Technical data means - brand of car, model of car, generation, year of release, engine volume, engine type, type of transmission, number of gears in gearbox, drive, body.
4. Technical specifications are entered in our Excel template and in the specified fields (the variant of your Excel template is not considered!!!).
5. There will be 39 cars from the entire list of Auto_ru sites.
6. Each brand of car will be in a separate Excel file (i.e., a separate Excel file). One file will not contain all 39 items. Kia brand means the template contains only the technical characteristics of Kia.
7. Inside each Excel template there will be 6 Sheets. Divided into 3 types - Automatic transmission, Robotic transmission, Variator.
8. Dear executors (freelancers), we do not need a program and do not need to use the program constantly, but we need ready-made (filled) Excel files according to our template.
9. Using the example of Kia Sportage, detailed parsing steps and where to go to collect information on the Auto_ru website:
9.1. Auto_ru - https://Auto_ru/. This page lists all the brands of cars. The button “All brands” reveals a larger list of car brands;
9.2. The brand of the car Kia - https: //Auto_ru/cars/kia/all/. This page lists all Kia models. The All Models button reveals a larger list of Kia models. Already at this stage, some of the technical characteristics for parsing are visible - the car brand, the car model, the generation, the year of release in the Generation field;
9.3. We chose the brand of the car Kia, the Sportage model and choose the generation 3 restyling 2014-2016 - https://Auto_ru/cars/kia/sportage/all/?sort=fresh_relevance_1-desc. This page shows all generations of Kia, year of release, body number (from 1 and on increase). Important! The word “restyling” is also necessary to parse;
9.4. After choosing the brand of the car Kia, the Sportage model, generation 3 restyling 2014-2016, these fields were filled with values - https://Auto_ru/cars/kia/sportage/20101920/all/?sort=fresh_relevance_1-desc and click "Show" button. Then we click on the “Catalogues” button. At the bottom, click on the photo of the car - https://Auto_ru/catalog/cars/kia/sportage/20101920/, after clicking on the button "Characteristics" - https://Auto_ru/catalog/cars/kia/sportage/20101920/20101923/specifications/;
9.5. After the transition to the "Characteristics", there are all the technical characteristics - the brand of the car, the model of the car, the generation, the year of release, the engine size, the type of engine, the type of gearbox, the number of gearboxes, drive, body;
9.6. In the "Characteristics" repeat the specifications, they should not be duplicated in the Excel template. Here the rule should work on the following technical characteristics - Engine type (gasoline or diesel), Engine volume (liters), Engine power (hp). - horsepower), drive (front, rear or full), type of transmission (automatic, mechanical, robot or CVT);
Let’s analyze the example of the names of the car cabin trim levels. There are 4 trim levels - Premium, Comfort, Luxe, Prestige. In these configurations, the same engine (engine type, in volume, power), drive and type of gearbox.
Engine type - Gasoline
Engine capacity - 2.0
Engine power - 150
Drive - full (4x4)
Type of transmission - Automatic
We need only non-duplicable specifications, listed in the Excel template – car brand, car model, generation, year of release, engine size, engine type, gearbox type, number of gearboxes, drive, body.
9.7. In the "Characteristics" duplicate technical characteristics:
Fuel = Engine type (2 times) = Fuel brand
Box = Transmission
Drive - Drive type
9.8. In the Excel template there is a field “Country of the brand”, filled with the value – “USA”, only if Auto_ru in “Characteristics” – “General information” – “Country of the brand” indicates – “USA” (for American car brands);
10. The Excel template will be attached to familiarize yourself with + screen what the filling looks like:
10.1. Excel may not have enough columns in the template, so you need to add them automatically.
10.2. List of columns that can increase in number - Generation, Year (year of release), Volume (engine volume), Number of gears (number of gears in the gearbox). Important! In variators there are no gears, Drive (drive type), Body (body type);
10.3. 4 types of engines are put on cars – gasoline (petrol), diesel (diesel), gas (gas), hybrid (electric), so each generation of the car will have from 2 to 4 tables;
10.4. In the Excel template, the “Reference to Characteristics” field is filled with the Auto_ru link from the “Characteristics” – https://Auto_en/catalog/cars/kia/sportage/20101920/20101923/specifications/20101923_20101936_20101927/
Responses are considered, in the format:
1. Term of execution in days (can be approximately), from 2 days to 14 days
2. When is it ready to start?
3. Value in rubles
Remote work on the Internet. You need to get information from the car site.
1. You need to parse the technical specifications for cars from the Auto_ru website.
2. No need to scour ads!!! Technical specifications only (numbers, letters, symbols)!
3. Technical data means - brand of car, model of car, generation, year of release, engine volume, engine type, type of transmission, number of gears in gearbox, drive, body.
4. Technical specifications are entered in our Excel template and in the specified fields (the variant of your Excel template is not considered!!!).
5. There will be 39 cars from the entire list of Auto_ru sites.
6. Each brand of car will be in a separate Excel file (i.e., a separate Excel file). One file will not contain all 39 items. Kia brand means the template contains only the technical characteristics of Kia.
7. Inside each Excel template there will be 6 Sheets. Divided into 3 types - Automatic transmission, Robotic transmission, Variator.
8. Dear executors (freelancers), we do not need a program and do not need to use the program constantly, but we need ready-made (filled) Excel files according to our template.
9. Using the example of Kia Sportage, detailed parsing steps and where to go to collect information on the Auto_ru website:
9.1. Auto_ru - https://Auto_ru/. This page lists all the brands of cars. The button “All brands” reveals a larger list of car brands;
9.2. The brand of the car Kia - https: //Auto_ru/cars/kia/all/. This page lists all Kia models. The All Models button reveals a larger list of Kia models. Already at this stage, some of the technical characteristics for parsing are visible - the car brand, the car model, the generation, the year of release in the Generation field;
9.3. We chose the brand of the car Kia, the Sportage model and choose the generation 3 restyling 2014-2016 - https://Auto_ru/cars/kia/sportage/all/?sort=fresh_relevance_1-desc. This page shows all generations of Kia, year of release, body number (from 1 and on increase). Important! The word “restyling” is also necessary to parse;
9.4. After choosing the brand of the car Kia, the Sportage model, generation 3 restyling 2014-2016, these fields were filled with values - https://Auto_ru/cars/kia/sportage/20101920/all/?sort=fresh_relevance_1-desc and click "Show" button. Then we click on the “Catalogues” button. At the bottom, click on the photo of the car - https://Auto_ru/catalog/cars/kia/sportage/20101920/, after clicking on the button "Characteristics" - https://Auto_ru/catalog/cars/kia/sportage/20101920/20101923/specifications/;
9.5. After the transition to the "Characteristics", there are all the technical characteristics - the brand of the car, the model of the car, the generation, the year of release, the engine size, the type of engine, the type of gearbox, the number of gearboxes, drive, body;
9.6. In the "Characteristics" repeat the specifications, they should not be duplicated in the Excel template. Here the rule should work on the following technical characteristics - Engine type (gasoline or diesel), Engine volume (liters), Engine power (hp). - horsepower), drive (front, rear or full), type of transmission (automatic, mechanical, robot or CVT);
Let’s analyze the example of the names of the car cabin trim levels. There are 4 trim levels - Premium, Comfort, Luxe, Prestige. In these configurations, the same engine (engine type, in volume, power), drive and type of gearbox.
Engine type - Gasoline
Engine capacity - 2.0
Engine power - 150
Drive - full (4x4)
Type of transmission - Automatic
We need only non-duplicable specifications, listed in the Excel template – car brand, car model, generation, year of release, engine size, engine type, gearbox type, number of gearboxes, drive, body.
9.7. In the "Characteristics" duplicate technical characteristics:
Fuel = Engine type (2 times) = Fuel brand
Box = Transmission
Drive - Drive type
9.8. In the Excel template there is a field “Country of the brand”, filled with the value – “USA”, only if Auto_ru in “Characteristics” – “General information” – “Country of the brand” indicates – “USA” (for American car brands);
10. The Excel template will be attached to familiarize yourself with + screen what the filling looks like:
10.1. Excel may not have enough columns in the template, so you need to add them automatically.
10.2. List of columns that can increase in number - Generation, Year (year of release), Volume (engine volume), Number of gears (number of gears in the gearbox). Important! In variators there are no gears, Drive (drive type), Body (body type);
10.3. 4 types of engines are put on cars – gasoline (petrol), diesel (diesel), gas (gas), hybrid (electric), so each generation of the car will have from 2 to 4 tables;
10.4. In the Excel template, the “Reference to Characteristics” field is filled with the Auto_ru link from the “Characteristics” – https://Auto_en/catalog/cars/kia/sportage/20101920/20101923/specifications/20101923_20101936_20101927/
Responses are considered, in the format:
1. Term of execution in days (can be approximately), from 2 days to 14 days
2. When is it ready to start?
3. Value in rubles