Find answers, ask questions, and connect with our
community around the world.

Activity Forums Web Design How does Flipp extract data from pdf circulars?

  • How does Flipp extract data from pdf circulars?

    updated 2 weeks, 5 days ago 0 Member · 1 Post
  • Maverick

    Member
    October 31, 2019 at 12:48 am

    A possible answer to both questions is that retailers access a dashboard, upload their pdf circular / flyer. Then have a tool where you drag / specify / crop a section of the flyer and then manually type the title, the price etc. Then when the user comes along you know the boundaries the mouse has to enter and click on and the data associated with that “section” of the pdf. But I’m grabbing at straws here… It also seems a bit ridiculous, if retailers are already uploading their own content. Why not upload it in a proper manner, i.e. product image, price, title etc. And Flipp can display it in a way better UI than those ugly flyers. Firstly how do they accumulate all these flyers? Do they do web scraping? Do they find them and upload them manually? Do Retailers have a dashboard and upload the circulars themselves? And second – how do they extract data accurately from those flyers? Sure there’s OCR but that’s never fool proof with easy legible text, never mind the crazy designs / fonts / layouts that you’ll find on a flyer. A possible answer to both questions is that retailers access a dashboard, upload their pdf circular / flyer. Then have a tool where you drag / specify / crop a section of the flyer and then manually type the title, the price etc. Then when the user comes along you know the boundaries the mouse has to enter and click on and the data associated with that “section” of the pdf. But I’m grabbing at straws here… It also seems a bit ridiculous, if retailers are already uploading their own content. Why not upload it in a proper manner, i.e. product image, price, title etc. And Flipp can display it in a way better UI than those ugly flyers. – by hq overview Fattyhawk – –

Reply to: Maverick
Your information:

Cancel
Original Post
0 of 0 posts June 2018
Now