instagram_scraper

Public repository for Instragram scraping django app

Please follow the instructions below to setup and run this project:

DATABASE:

PROJECT SETUP

Make a virtual env with python 2.7.10 (run "virtualenv instagram_scraper_env")
cd into instagram_env
Activate the virtualenv (inside instagram_scraper_env, run "source bin/activate")
Clone the repository
cd into repository
Run "pip install -r requirements.txt"
Run "python manage.py migrate"

RUNNING THE APP

Run "python manage.py runserver" (defaults to port 8000)
Visit http://127.0.0.1:8000 in your browser (I've only confirmed on Chrome for the time being)
Sign in to Instagram
Upon sign in you should be redirected to the apps main dashboard
Enter job information:
Tag (without hashmark)
From date (exclusive)
To date (exclusive)
Press "Submit"
A new job should soon appear in the "Current Jobs" list on the right

REBOOTING JOBS

Back in the terminal, ctrl-c to terminate your Django server. This will halt any running jobs.
Restart the server with "python manage.py runserver". The jobs will still be halted, but should show up in the browser in the "Current Jobs" list.
Press "Reboot Job" to restart selected jobs
Jobs will disappear when they have paginated to a page with a tag time equal to the "From date"

NOTES:

Files will be saved in /tmp/images/
You can change this path by updating "MEDIA_ROOT" in pixlee_project/settings.py, and "upload_to" in pixlee_app/models.py (may require new migrations to be made)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
pixlee_app		pixlee_app
pixlee_project		pixlee_project
.gitignore		.gitignore
README.md		README.md
manage.py		manage.py
requirements.txt		requirements.txt

Provide feedback