Filtër

Kërkimet e mia të fundit
Filtro sipas:
Buxheti
deri në
deri në
deri në
Lloji
Aftёsitё
Gjuhët
    Shteti i punës
    916 pyspark mllib punët e gjetura, me çmimin EUR

    I am looking for an experienced data analyst who is well-versed in PySpark to clean up a medium-sized dataset in a CSV file format. The file contains between 10k-100k rows, and your primary role will be to: - Remove duplicate data entries - Deduplicate the dataset - Handle missing values - Aggregate the resultant data Your proficiency in using PySpark to automate these processes efficiently will be critical to the success of this project. Therefore, prior experience in handling and cleaning similar large datasets would be beneficial. Please note, this project requires precision, meticulousness, and a good understanding of data aggregation principles.

    €23 (Avg Bid)
    €23 Oferta mesatare
    9 ofertat

    This vital task entails cleaning and sorting two CSV files of approximately 100,000 rows and second one of about 1.5million rows using pyspark (Python) in Jupyter Notebook(s). The project consists of several key tasks: Read in both datasets and then: - Standardizing data to ensure consistency - Removal of duplicate entries - Filtering columns we need - Handling and filling missing values - Aggregating data on certain groupings as output Important requirement: I also need unit tests to be written for the code at the end. Ideal Skills: Candidates applying for this project should be adept with Pyspark in Python and have experience in data cleaning and manipulation. Experience with working on datasets of similar size would also be preferable. Attention to detail in ensuring ...

    €167 (Avg Bid)
    €167 Oferta mesatare
    57 ofertat

    I'm seeking an experienced Data Engineer with proficiency in SQL and PySpark. Key Responsibilities: - Develop and optimize our ETL processes. - Enhance our data pipeline for smoother operations. The ideal candidate should deliver efficient extraction, transformation, and loading of data, which is critical to our project's success. Skills and Experience: - Proficient in SQL and PySpark - Proven experience in ETL process development - Previous experience in data pipeline optimization Your expertise will significantly improve our data management systems, and your ability to deliver effectively and promptly will be highly appreciated.

    €85 (Avg Bid)
    €85 Oferta mesatare
    14 ofertat

    - Conversion of the entire Python code into PySpark. Skills and experience required: - Proficient knowledge in Python.

    €24 (Avg Bid)
    €24 Oferta mesatare
    26 ofertat

    ...competent in either PySpark or RDD, using Python to create versatile code fitting for several scenarios. Your main task will be to write code to compare rows using Python in line with the clear set of rules I provide. These rules are detailed in an attached Word document and are based on comparisons encompassing specific columns, presence or absence of particular data, and multiple criteria comparisons. The expected output is a reversal logic for claim_opened_timestamp_utc. I need output that are in right side. I need either in pyspark or in rdd to compare rows. spark - spark-3.3.0-bin-hadoop3 py4j-0.10.9.5 I am using I need your support till I execute it in my office computer I need it in 3 days. Ideal Skills and Experience: - Proficiency in Python - Experience with...

    €143 (Avg Bid)
    Urgjent
    €143 Oferta mesatare
    20 ofertat

    I'm beginer user of Azure Databricks and Pyspark. I'm looking to boost my skills to the next level and need an expert to guide me through advanced techniques. Ideal freelancers should have vast experience and profound knowledge in data manipulation using Pyspark, Azure Databricks, data pipeline construction, and data analysis and visualization. If you've previously tutored or mentored in these areas, it'll be a plus.

    €11 / hr (Avg Bid)
    €11 / hr Oferta mesatare
    4 ofertat

    I need complete 2 small projects done. The data needs to be pulled from API using python. The pulled data needs to be unnested, then transformed to answer some insights with medallion architecture. Here, you need to showcase SCD-type 2 ingestions, incremental joins,...to be pulled from API using python. The pulled data needs to be unnested, then transformed to answer some insights with medallion architecture. Here, you need to showcase SCD-type 2 ingestions, incremental joins, managing PII information, aggregation. Final deliverable needed for 1st project (databricks): Data model designed and architecture overview Notebooks of transformations in Python and PySpark/Spark Scala Final deliverable needed for 2nd project (dbt): Data model designed and architecture overview dbt sql and ...

    €258 (Avg Bid)
    €258 Oferta mesatare
    16 ofertat

    Looking for someone with good skills in Airflow, Pyspark and SQL.

    €234 (Avg Bid)
    €234 Oferta mesatare
    13 ofertat

    I am looking for a skilled professional in Python, with a comprehensive understanding of PySpark, Databricks, and GCP. A primary focus of the project is to build a data pipeline and apply time series forecasting techniques for revenue projection, using historical sales data. Key tasks will include: - Constructing a robust data pipeline using Python, PySpark, and Databricks. - Applying time series forecasting to produce revenue predictions. - Using Mean Squared Error (MSE) to measure model accuracy. The ideal candidate for this project would have: - Proven experience with Python, PySpark, Databricks, and GCP. - Expertise in time series forecasting models. - Practical understanding and use of Mean Squared Error (MSE) for model accuracy. - Experience with large scale ...

    €10 / hr (Avg Bid)
    €10 / hr Oferta mesatare
    14 ofertat

    I am looking to develop a sophisticated and efficient data pipeline for revenue forecasting. This pipeline will be implemented using Python, pyspark, databrics, and gcp Big Data. Here is what you need to know about this task: - Data Source: The data originates from Google Cloud Platform's Big Data service. As such, the freelancer should have solid experience and understanding of working with Big Data services on GCP. - Data Update Frequency: The frequency of data updates will be confirmed during the project, but suffice to say frequency could be high. Prior experience with real-time or near-real-time data processing will be highly beneficial. - Performance Metrics: The key performance metric I'm focusing on is data processing speed. The freelancer should have a strong kn...

    €17 / hr (Avg Bid)
    €17 / hr Oferta mesatare
    13 ofertat

    I'm in need of a specialist, ideally with experience in data science, Python, PySpark, and Databricks, to undertake a project encompassing data pipeline creation, time series forecasting and revenue forecasting. #### Goal: * Be able to extract data from GCP BigData efficiently. * Develop a data pipeline to automate this process. * Implement time series forecasting techniques on the extracted data. * Use the time series forecasting models for accurate revenue forecasting. #### Deadline: * The project needs to be completed ASAP, hence a freelancer with a good turnaround time is preferred. #### Key Skill Sets: * Data Science * Python, PySpark, Databricks * BigData on GCP * Time series forecasting * Revenue forecasting * Data Extraction and Automation Qualification in...

    €17 / hr (Avg Bid)
    €17 / hr Oferta mesatare
    15 ofertat

    I am looking for a developer to create an AWS Glue and Pyspark script that will strengthen the data management of my project. The task involves moving more than 100GB of text data from a MySQL RDS table to my S3 storage account, on a weekly basis. Additionally, the procured data needs to be written on parquet files, for easy referencing. The developer will also need to send scripts to deploy the AWS Glue pipelines on Terraform, fitting all parameters. Skilled expertise in AWS Glue, PySpark, Terraform, MySQL and experience in handling large data is required. There is no compromise on the quality and completion timeline. Effective performance on this project will open doors to more work opportunities on my various projects.

    €38 (Avg Bid)
    €38 Oferta mesatare
    15 ofertat

    I am seeking a skilled professional proficient in managing big data tasks with Hadoop, Hive, and PySpark. The primary aim of this project involves processing and analyzing structured data. Key Tasks: - Implementing Hadoop, Hive, and PySpark for my project to analyze large volumes of structured data. - Use Hive and PySpark for sophisticated data analysis and processing techniques. Ideal Skills: - Proficiency in Hadoop ecosystem - Experience with Hive and PySpark - Strong background in working with structured data - Expertise in big data processing and data analysis - Excellent problem-solving and communication skills Deliverables: - Converting raw data into useful information using Hive and Visualizing the results of queries into the graphical representation...

    €16 / hr (Avg Bid)
    €16 / hr Oferta mesatare
    15 ofertat

    ...currently searching for an experienced AWS Glue expert, proficient in PYsPARK with data frames and Kafka development. The ideal candidate will have: • Expertise in data frame manipulation. • Experience with Kafka integration. • Strong PYsPARK development skills. The purpose of this project is data integration, and we will be primarily processing data from structured databases. The selected freelancer should be able to work with these databases seamlessly, ensuring efficient and effective data integration using AWS Glue. The required work would involve converting structured databases to fit into a data pipeline, setting up data processing, and integrating APIs using Kafka. This project requires a strong background in AWS Glue, PYSPARK, data frame ...

    €217 (Avg Bid)
    €217 Oferta mesatare
    24 ofertat

    I'm seeking assistance to develop a Python-based solution utilizing PySpark for efficient data processing using the Chord Protocol. This project demands an intermediate level of expertise in Apache Spark or PySpark, combining distributed computing knowledge with specific focus on Python programming. Key Requirements: - Proficiency in Python programming and PySpark framework. - Solid understanding of the Chord Protocol and its application in data processing. - Capable of implementing robust data processing solutions in a distributed environment. Ideal Skills and Experience: - Intermediate to advanced knowledge in Apache Spark or PySpark. - Experience in implementing distributed file sharing or data processing systems. - Familiarity with network communicati...

    €504 (Avg Bid)
    €504 Oferta mesatare
    38 ofertat

    ...Professional with strong expertise in Pyspark for a multi-faceted project. Your responsibilities will extend to but not limited to: - Data analysis: You'll be working with diverse datasets including customer data, sales data and sensor data. Your role will involve deciphering this data, identifying key patterns and drawing out impactful insights. - Data processing: A major part of this role will be processing the mentioned datasets, and preparing them effectively for analysis. - Performance optimization: The ultimate aim is to enhance our customer targeting, boost sales revenue and identify patterns in sensor data. Utilizing your skills to optimize performance in these sectors will be highly appreciated. The ideal candidate will be skilled in Hadoop and Pyspark wi...

    €429 (Avg Bid)
    €429 Oferta mesatare
    25 ofertat

    Build a glue etl using pyspark to transfer data from mysql to postgres. facing challenges in column mappings between the 2 sources, the target database has datatypes enums and text arrays. should solve the erros in column mappings Should have prior experience ingesting data into postgres enum datatype

    €20 / hr (Avg Bid)
    €20 / hr Oferta mesatare
    54 ofertat

    I am in need of an experienced data engineer with specific expertise in PySpark. This project involves the integration and migration of data from structured databases currently housed in AWS. Here's a rundown of your key responsibilities: - Data integration from various existing structured databases - Migration of the combined data to a single, more efficacious database Ideal Candidate: - Proven experience in data migration and integration projects - Expertise in PySpark is indispensable - Proficiency in manipulating AWS databases - A solid understanding of structured databases and various data formats is mandatory This project is more than just technical skills- I'm looking for someone who can understand the bigger picture and contribute to the overarching str...

    €612 (Avg Bid)
    €612 Oferta mesatare
    13 ofertat

    I'm looking for a professional with a strong understanding of PySpark to help transform a dataframe into JSON following a specific schema. This project's main task is data transformation to aid in data interchange. The project requires: - Expertise in PySpark - Proficiency in data transformation techniques - Specific experience in data aggregation For the transformation, I require the application of an aggregation method. In this case, we will be sorting the data. It's crucial that you are skilled in various aggregation methods, especially sorting. Your knowledge in handling critical PySpark operations is crucial for this job's success. Experience in similar projects will be highly regarded.

    €22 (Avg Bid)
    €22 Oferta mesatare
    19 ofertat

    Looking for an expert Azure Data Engineer to assist with multiple tasks. Your responsibilities will include: - Implementing and managing Azure Data Lake and Data Ingestion. - Developing visual reports...platforms to achieve three main objectives: - Perform sophisticated data analysis and visualization. - Enable advanced data integration and transformation. - Build custom applications to meet specific needs. Candidates should have an advanced understanding of Azure Data Lake, Power BI, and Powerapps, bringing a minimum of 6 years experience as Databricks. Proficiency in Python, SQL, PostGre SQL, and Pyspark is also required. Knowledge of GitHub and the CI/CD Process will be beneficial for this role. If you have the skills and expertise needed for this project, I'd love to...

    €31 / hr (Avg Bid)
    €31 / hr Oferta mesatare
    28 ofertat

    ...need to be pushed swiftly to Elasticsearch using Pyspark. Your expertise will help push all data columns from this file into Elasticsearch, establishing a more actionable access to a significant amount of data. Given the project's urgency, I'm expecting a rapid, reliable transition. While the structure for the documents remains undecided due to the project's intricacies, I'm open to suggestions that will make this process more efficient and effective. Anyone with experience in Pyspark, Elasticsearch, and vast data manipulation will have a substantial edge on this project, as these skills are highly necessary for success. A strong understanding of different data structures is also a plus. • Leading Skills Required: Proficiency in Pyspark ...

    €9 / hr (Avg Bid)
    €9 / hr Oferta mesatare
    3 ofertat

    ...Title: Pyspark Data Engineering Training Overview: I am a beginner/intermediate in Pyspark and I am looking for a training program that focuses on data processing. I prefer one on one and written guides as the format for the training. Skills and Experience Required: - Strong expertise in Pyspark and data engineering - Excellent knowledge of data processing techniques - Experience in creating and optimizing data pipelines - Familiarity with data manipulation and transformation using Pyspark - Ability to explain complex concepts in a clear and concise manner through written guides - Understanding of best practices for data processing in Pyspark Training Topics: The training should primarily focus on data processing. The following topics should be cov...

    €21 / hr (Avg Bid)
    €21 / hr Oferta mesatare
    75 ofertat

    ...training is expected to be spread across multiple days. The trainer must have the capability to provide an understanding of the major concepts and components of Apache Spark, with a focus on how to use Databricks and the Pyspark API to manipulate and visualize data. As the training progresses, the instructor should be able to explain how to develop applications using Pyspark and articulate different approaches that a data scientist would use to evaluate and test their models. The instructor should also be able to educate the users on how to deploy and maintain Pyspark applications and how to provide feedback and questions in order to improve their performance. We expect the trainer to be readily available to answer any questions and guide the users along the w...

    €92 (Avg Bid)
    €92 Oferta mesatare
    78 ofertat

    I am seeking an expert in the field to provide remote training in the use of Databricks and Python with PySpark. This is important for developing data processing applications with a high degree of efficiency. The training should cover areas such as data wrangling, machine learning, and Spark streaming. In order to be successful, attendees must be well-versed in Databricks, Python and PySpark, as these skills will be essential for completing the course. The course should provide a good understanding of the concepts and practical application of these tools. This training will give attendees the skills they need to analyse and manipulate large datasets, develop effective data processing pipelines, design powerful machine learning models and build reliable applications that use...

    €97 (Avg Bid)
    €97 Oferta mesatare
    77 ofertat

    ...S3, and RDS; Azure services; and Pyspark data processing and transformations. Essential Skills: - Proficient in AWS, specifically on EC2, S3, RDS with strong understanding of data storage and retrieval. - Expert in Azure services such as Azure SQL Database and Blob Storage. - Highly experienced in writing efficient data transformations using Pyspark. Ideal Experience: - Minimum 7 years in the field with solid experience in technical interviews and coaching. Your task will be to provide actionable insights, best practices, and expert advice to nail my upcoming technical interview. Having been on the other side of the interview table would be an added advantage. - Proven track record of performing successful data processing and transformations using Pyspark. - Prev...

    €14 / hr (Avg Bid)
    €14 / hr Oferta mesatare
    8 ofertat

    Experienced Python + SQL +AWS +AZURE data engineer (7+ years) for evening IST timings. For guiding in interview preparation specially for data engineering. Tasks: Should have good knowledge of pyspark, sql, pandas Should have written multiple ETL pipeline in aws and azure. Note: The freelancer must be available during evening ist timings.

    €9 / hr (Avg Bid)
    €9 / hr Oferta mesatare
    12 ofertat

    ...structured data such as SQL databases. Skills and experience required: - Expertise in AWS migration, specifically from another cloud provider - Strong knowledge and experience with structured data, particularly SQL databases - Familiarity with AWS Glue and Athena for data processing and analysis - Ability to work with a combination of different AWS services for optimal performance and efficiency Pyspark ,sql,python Cdk Typescript Aws glue ,Emr and andes Currently Migrating from teradata to aws. Responsibilities: - Migrate data from another cloud provider to AWS, ensuring a smooth transition and minimal downtime - Design and develop applications that utilize AWS Glue and Athena for data processing and analysis - Optimize data storage and retrieval using AWS S3 and R...

    €8 / hr (Avg Bid)
    €8 / hr Oferta mesatare
    14 ofertat

    ...am looking for a skilled and experienced developer to work on a personal project involving the use of CNN by pyspark for analyzing brain and lung cancer. Skills and Experience: - Proficient in using pyspark and CNN - Intermediate understanding of convolutional neural networks - Familiarity with analyzing medical data - Experience in working with cancer-related datasets - Strong problem-solving skills and attention to detail The project requires the use of specific datasets, which I already have. However, any additional assistance in acquiring relevant datasets would be appreciated. The ideal candidate should have a good understanding of CNN and be able to apply it using pyspark. Experience in analyzing medical data and working with cancer-related datasets would ...

    €33 (Avg Bid)
    €33 Oferta mesatare
    10 ofertat

    I am looking for a skilled professional who can help me with a project titled "synapse pyspark delta lake merge scd type2 without primary key". The ideal candidate should have experience and expertise in the following areas: Desired Outcome: - The desired outcome of the merge process is to update existing records and insert new records. Data Quality: - The level of data quality required for the outcome is high integrity, with no duplicates and full accuracy. Handling Historical Data: - There is a specific requirement to keep track of historical changes to the data. Skills and Experience: - Proficiency in Synapse, Pyspark, Delta Lake - Experience with SCD Type 2 implementation - Strong understanding of data integrity and accuracy - Ability to handle historical da...

    €306 (Avg Bid)
    €306 Oferta mesatare
    2 ofertat
    senriod data engineer Ka përfunduar left

    ...Senior Data Engineer who possesses extensive experience and proficiency in a range of key technologies and tools. The ideal candidate should have a strong background in Python, demonstrating skillful use of this programming language in data engineering contexts. Proficiency in Apache Spark is essential, as we rely heavily on this powerful analytics engine for big data processing. Experience with PySpark, the Python API for Spark, is also crucial. In addition to these core skills, we require expertise in AWS cloud services, particularly AWS Glue and Amazon Kinesis. Experience with AWS Glue will be vital for ETL operations and data integration tasks, while familiarity with Amazon Kinesis is important for real-time data processing applications. Furthermore, the candidate should hav...

    €10 / hr (Avg Bid)
    €10 / hr Oferta mesatare
    11 ofertat

    I am looking for an expert in Spark to perform a student's project. The project involves data cleaning and preprocessing, machine learning modeling, and data visualization and analysis. Skills and experience required: - Strong knowledge and experience in Spark - Proficiency in data cleaning and preprocessing techniques - Experience in machine learning modeling using Spark's MLlib - Familiarity with data visualization and analysis using Spark's GraphX - Ability to present the results of the analysis in both a written report and a visual presentation (charts, graphs) Please note that the project details will be shared privately with the selected freelancer.

    €32 (Avg Bid)
    €32 Oferta mesatare
    5 ofertat

    I am looking for an Airflow, GCP, and Python expert to assist me with my project. Candidate should have a good knowledge of DAG, GIT, pandas, agile, pyspark and Airflow.

    €271 (Avg Bid)
    €271 Oferta mesatare
    19 ofertat
    Pyspark AWS ML Django Ka përfunduar left

    I am looking for a freelancer who can assist me with a Pyspark AWS ML project. The main goal of the project is data processing and transformation. I already have all the data needed for the project. The preferred timeline for this project is flexible. Skills and Experience: - Strong experience with Pyspark and AWS ML - Proficient in data processing and transformation techniques - Familiarity with machine learning model development - Ability to work within a flexible timeline

    €14 / hr (Avg Bid)
    €14 / hr Oferta mesatare
    28 ofertat

    Years of experience: 7+ Location: Remote - India Contract Tenure - 03-06 Months Notice Period - Immediate -15/20 Days Timings : 12pm - 9pm IST M - F AWS Data Engineer Requirements • Collaborate with business an...functions to handle data quality and validation. • Should have good understanding on S3,Cloud Formation, Cloud Watch, Service Catalog and IAM Roles • Perform data validation and ensure data accuracy and completeness by creating automated tests and implementing data validation processes. • Should have good knowledge about Tableau, with creating Tableau Published Datasets and managing access. • Write PySpark scripts to process data and perform transformations.(Good to have) • Run Spark jobs on AWS EMR cluster using Airflow DAGs.(Good to have) &...

    €3049 (Avg Bid)
    €3049 Oferta mesatare
    16 ofertat

    Looking for someone who has a good knowledge of Pyspark, Airflow DAGs, GitHub, Pandas and Agile Framework. Overall candidate should be well aware of the data ingestion approach. Knowledge of Google cloud platform is a Bonus

    €277 (Avg Bid)
    €277 Oferta mesatare
    25 ofertat

    I am looking for a skilled AWS Cloud + PySpark developer to create a Glue Streaming WordCount program. The program should be able to perform word count analysis on streaming data. I have the pyspark streaming code ready which works in my Jupyter notebook. So need help in integrating Kinesis/MSK --> Glue --> RDS/S3

    €8 (Avg Bid)
    €8 Oferta mesatare
    2 ofertat
    Big Data Project Ka përfunduar left

    ...Specific Letters (Using Spark) 5. Top Selling Countries (Using Spark) 6. Item Costs (Using Spark) 7. Sales Yearwise (Using PySpark) 8. Orders per Item (Using PySpark) 9. Country with Highest Sales (Using PySpark) 10. Customer Segmentation: Use clustering algorithms to identify different customer segments. 11. Time Series Forecasting: Predict future sales using ARIMA or LSTM. 12. Anomaly Detection: Identify any anomalies or outliers that could indicate fraudulent activity. 13. Association Rule Mining: Find associations between different products in the data (Using Spark). 14. Price Elasticity: Understand how the demand for a product changes with a change in its price (Using PySpark). 15. Correlation Between Priority and Profit: Analyze if 'Order Priority&...

    €75 (Avg Bid)
    €75 Oferta mesatare
    10 ofertat
    Big Data Project Ka përfunduar left

    ...Specific Letters (Using Spark) 5. Top Selling Countries (Using Spark) 6. Item Costs (Using Spark) 7. Sales Yearwise (Using PySpark) 8. Orders per Item (Using PySpark) 9. Country with Highest Sales (Using PySpark) 10. Customer Segmentation: Use clustering algorithms to identify different customer segments. 11. Time Series Forecasting: Predict future sales using ARIMA or LSTM. 12. Anomaly Detection: Identify any anomalies or outliers that could indicate fraudulent activity. 13. Association Rule Mining: Find associations between different products in the data (Using Spark). 14. Price Elasticity: Understand how the demand for a product changes with a change in its price (Using PySpark). 15. Correlation Between Priority and Profit: Analyze if 'Order Priority&...

    €66 (Avg Bid)
    €66 Oferta mesatare
    4 ofertat

    Its a simple dataset and I have already analysed it using pandas. I want to analyse it using Pyspark and Koalas API.

    €163 (Avg Bid)
    €163 Oferta mesatare
    6 ofertat
    Pyspark traning Ka përfunduar left

    Project Description: I am looking for a PySpark trainer who has advanced experience and expertise in data processing. The ideal candidate should be able to provide a scheduled training course. Skills and Experience: - Advanced level of experience with PySpark - Strong knowledge and expertise in tools like DataBricks, Pycharm, transformation & Actions. - Ability to provide a scheduled training course

    €284 (Avg Bid)
    €284 Oferta mesatare
    3 ofertat

    I am seeking assistance with Pyspark and small file remediation. Specifically, I am facing file format compatibility issues. Skills and experience required: - Intermediate level of experience with Pyspark - Strong understanding of file format compatibility - Proficiency in data processing and performance optimization Project requirements: - The small files I am working with have a size of 10 GB - The goal is to resolve file format compatibility issues and ensure smooth data processing - Attention to detail is crucial to avoid any data processing errors If you have expertise in Pyspark, file format compatibility, and can efficiently handle large files, I would love to discuss this project further. Please provide any relevant experience or work samples in your prop...

    €28 (Avg Bid)
    €28 Oferta mesatare
    1 ofertat
    Software developers Ka përfunduar left

    I am looking for software developers who are proficient in Python ,Pyspark ,AWS and have good experience, The project timeline is estimated to be 1-2 weeks. Skills and experience required: - Proficiency in Python programming language - Experience working with various frameworks or platforms - Must be hands on experience on AWS , Pyspark - Strong problem-solving skills - Good communication and collaboration skills.

    €4 / hr (Avg Bid)
    €4 / hr Oferta mesatare
    15 ofertat
    Hdfs and pyspark expert Ka përfunduar left

    I am looking for an experienced HDFS and PySpark expert to assist me with various tasks related to data ingestion, storage, processing, and analysis. The ideal freelancer should have a strong background in these technologies and be able to provide past work examples that showcase their expertise. Key requirements: - Expertise in HDFS and PySpark Timeline: - The project is expected to be completed within 1-2 weeks. If you meet these requirements and have the necessary experience, please include details of your past work and relevant experience in your application.

    €46 / hr (Avg Bid)
    €46 / hr Oferta mesatare
    7 ofertat
    query in pyspark Ka përfunduar left

    I am looking for a freelancer who can help me with a data analysis project using PySpark. I have a specific dataset that I would like to query, which is of medium size (1-10 GB). Skills and Experience: - Strong knowledge and experience in PySpark - Expertise in data analysis and data manipulation - Familiarity with working with medium-sized datasets - Ability to write efficient and optimized queries in PySpark The ideal freelancer for this project should have a strong background in data analysis and be proficient in PySpark. They should also have experience working with medium-sized datasets and be able to write efficient queries to extract meaningful insights from the data.

    €15 (Avg Bid)
    €15 Oferta mesatare
    4 ofertat
    Pyspark aws data engineer Ka përfunduar left

    ...looking for a Pyspark AWS data engineer who can help me with building and deploying ETL for machine learning models. Must initially pass a python online coding exam. Tasks: - Building ETL models using Pyspark and AWS - Deploying the models on AWS infrastructure - use terraform, spin up etl clusters, understand basic data related aws cloud tools, infrastructure and security. This is NOT a devops position but you should be able to get around and use data engineering related aws tools. Infrastructure: - The project requires migrating within aws to a new infrastructure Involvement: - partially involved in the project at half time 3-5 hours a day on a consistent reliable time of your choosing. Ideal skills and experience: - Strong experience in data engineering with P...

    €36 / hr (Avg Bid)
    €36 / hr Oferta mesatare
    14 ofertat
    Databricks pyspark Ka përfunduar left

    Need help on databricks task. Need to parse fixed width file and load to unity catalog tables

    €19 / hr (Avg Bid)
    €19 / hr Oferta mesatare
    26 ofertat
    PySpark Developer Ka përfunduar left

    Have a project with SQL and Python code but need to convert in spark-sql and dataframe.

    €473 (Avg Bid)
    €473 Oferta mesatare
    60 ofertat

    I am looking for a skilled PySpark developer to help me fix bugs in my visualization project. The specific bugs I am experiencing are related to data not displaying correctly. Skills and experience required: - Strong knowledge of PySpark and data visualization - Experience with troubleshooting and debugging PySpark projects - Familiarity with visualization tools such as Matplotlib and Seaborn The ideal candidate should be able to work efficiently and effectively to fix the bugs within a two-week timeframe. Attention to detail and the ability to analyze and interpret data accurately are essential for this project.

    €55 (Avg Bid)
    €55 Oferta mesatare
    8 ofertat

    Project Title: Bug Identification in pyspark project I am looking for a skilled developer who can help me identify and fix functional issues in my pyspark project. The bug is specifically affecting the data analysis section of the code. Skills and Experience: - Strong proficiency in pyspark and data analysis - Experience in identifying and fixing functional issues in pyspark projects - Familiarity with data processing and data visualization - Ability to work within a deadline, as the bug needs to be fixed within two weeks If you have the necessary skills and experience, please submit your proposal. Thank you.

    €47 (Avg Bid)
    €47 Oferta mesatare
    7 ofertat
    Wanted Azure DataEngineer Ka përfunduar left

    I am looking for an experienced Azure Data Engineer to work on my project specifically only from Hyderabad , India Specific Data Engineering Tasks: - Yes, I have some specific data engineering tasks in mind Preferred Tool for Data Processing and Analysis: Pyspark - Azure Databricks Skills and Experience Required: - Strong experience with Azure Data Factory, Azure Databricks, and Azure Synapse Analytics - Proficiency in data processing and analysis using Azure Databricks - Ability to handle large data sets efficiently - Knowledge of data engineering best practices and optimization techniques - Familiarity with Azure cloud services and infrastructure - Excellent problem-solving and troubleshooting skills - Strong communication and collaboration skills If you have the required sk...

    €78 (Avg Bid)
    €78 Oferta mesatare
    2 ofertat