We are excited to team up with Bright Data to bring the community a new challenge.
Running through December 29, the Bright Data Web Scraping Chall...
For further actions, you may consider blocking this person and/or reporting abuse
This one was fun, thanks @noahbrinker @thepracticaldev!
Can't wait to see everyone's submissions!
I wonder if it is possible to scrape more that just textual data, i.e audio recordings, images and videos with Bright Data
You can... You scrape the links and you download them to your device....
In here, I used selenium to load a website and get the image urls, then download them to my device
I wish this also had guidance on ethical approaches to scraping. Just because we can scrape and train AI doesn't entitle us to disregard the hard work that authors, journalists, and artists produce by scraping and deriving from their work without consent or compensation.
But Bright Data's terms of service and license are to be respected.
This all just seems so wildly unethical.
@thepracticaldev @jess For the second prompt, at the heading, it states "Build a web scraper API..." and at the start of the text below it says "Use a Web Scraper API..." This has me a bit confused. Are we supposed to build an API that returns data scraped by Bright's web scraper API?
I think they are trying to say to use bright data api to solve any business problem. This will probably be judged based on the novelty of the business problem and the complexity of the sources being scraped… IG.
Thankss
I gave it a try and was amazed by what I discovered!
This is my submission:
Yoda’s EU Grant Finder for Solopreneurs: Powered by Bright Data
This should be fun : )
Cool hackathon! Can't wait to submit my project idea.
One problem, I didn't get the $15 credit after signing up using the provided link, I only get $2 trial credits. I signed up using Google login.
Hey @fahminlb33, if you haven't already please email noah@brightdata.com for support!
For the third prompt, do we need to submit a running, working AI model fine tuned with the data (which would require us to host the model at our own expense), or can we submit just the data normalization pipeline, up to the point where the normalized data is about to be fed into the model?
Hey @delaaja, you can just submit the pipeline and not the fully tuned model.
This sounds like a nice idea to work with!
So I have a question. Is it compulsory we include the GitHub repo of the project or just the project live Demo @noahbrinker @thepracticaldev
Gonna be fun let's do this
It's already Jan 9th, I was wondering when will the winners be announced.
Am I too late to participate? I am very interested and would like to join!
Great opportunity! Unfortunately, I just received this announcement, but very eager to look at the awesome result. Please show your skills.
Hum, this is the one I will finally deep into!
This is interesting, I wish to have the free time to work on this challenge. Good luck to the participants!
Looks Awesome but gotta sit this one out...
Lets do this guys
Alright!! Let's make it happen