Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minimize duplicate posts during ingest #15

Open
that1guy opened this issue Aug 16, 2015 · 0 comments
Open

Minimize duplicate posts during ingest #15

that1guy opened this issue Aug 16, 2015 · 0 comments
Assignees

Comments

@that1guy
Copy link
Member

3Taps often feeds us identical items back to back. They do have different postingIds but they're the effectively the same item. Therefore, the feed and search results have many identical images and headings on the screen.

Examples:

Idea to resolve this:

  1. During ingestion but before upserting check to see if we have item with identical heading and price and whose posting date create was the same day.

  2. If the first suggestion is not accurate enough or perhaps overly stringent we could look into avoiding ingestion if we already have referencing the same image.

Examples:

https://www.hashtagsell.com/feed/cdbb17c1958d41bd8d4cf7a233d17f75
http://thumbs4.ebaystatic.com/m/mBmVw9tz6qcKQndYuGoB0Pg/140.jpg

https://www.hashtagsell.com/feed/fc41f6e4220a422fab6df9615afc8f3b
http://thumbs3.ebaystatic.com/m/mBmVw9tz6qcKQndYuGoB0Pg/140.jpg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants