Looking for someone who can help me in the next 12 hours with this code. Need to do a challenge for
Data science suspects that advertising campaigns are showing the same ad to users too many
times (a high frequency) as they browse their favorite websites. They’ve asked the data
engineers to investigate.
Given two input files ([login to view URL] and [login to view URL]) containing tab delimited ad event data,
find all of the users that saw the same ad more than 5x on a site.
Output should be Ad ID, Site ID, Frequency and Total users that saw the ad at that
frequency.
Frequency is defined as the total number of times the same ad was shown to
a user on the same site.
The output should be tab separated and sorted in descending order by frequency.
Hi,
I am a Hadoop and Spark developer and have experience with scala.
I understood your requirements and I can work on it if you provide the input files.
I can finish this in less than 12 hours, probably in couple of hours.
Thanks,
Pranay
We are a Team of Data Scientists having healthy experience into Big Data technologies like Scala/Spark/Akka, Hadoop,MapReduce and Data Analytics like R,HBase etc. The Team has qualified engineers having expertise in solving complex problems.
I am a future Computer Science student and money is not important to me. I have plenty of time since it is the summer and I believe I can finish the specified program in less than a week. Although I don't know Scala, I know Java and I am experienced with problem solving.