Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compare expenses made with lodging against official prices of rooms #26

Open
Irio opened this issue Aug 28, 2016 · 5 comments
Open

Compare expenses made with lodging against official prices of rooms #26

Irio opened this issue Aug 28, 2016 · 5 comments

Comments

@Irio
Copy link
Collaborator

Irio commented Aug 28, 2016

Filtering quota's dataset by records with value 'Lodging, except for congressperson from Distrito Federal' in the column subquota_description will return many expenses made with hotels. We could match the value in the receipt against publicly available (through Booking.com, for instance) range of prices.

@Irio Irio added the analysis label Aug 28, 2016
@JVUnderground
Copy link

Holidays and events should be considered as they usually significantly the price of rooms.

@cuducos cuducos modified the milestone: Roadmap: Rental state outliers Nov 7, 2016
samuelgrigolato added a commit to samuelgrigolato/serenata-de-amor that referenced this issue Nov 21, 2016
This should be considered as an initial approach to okfn-brasil#26. Of course one may argue that there isn't any official data being scraped and considered yet, but this mean/std approach may prove to be useful for further analysis, with or without external data.
samuelgrigolato added a commit to samuelgrigolato/serenata-de-amor that referenced this issue Nov 24, 2016
This should be considered as an initial approach to okfn-brasil#26. Of course one may argue that there isn't any official data being scraped and considered yet, but this mean/std approach may prove to be useful for further analysis, with or without external data.
@samuelgrigolato
Copy link
Contributor

Does anybody have an idea how to proceed with this scraping? I mean, in addition to what is already being done by @Lrcezimbra at #100.

I had a look into booking.com but couldn't find any suitable API. I also tried decolar.com (they do have a public and free API [1]), but their terms of usage doesn't seem to allow the kind of data scraping we need (I don't even know why I thought it would 😄).

[1] http://dev.despegar.com/howto/hotels

samuelgrigolato added a commit to samuelgrigolato/serenata-de-amor that referenced this issue Nov 26, 2016
This should be considered as an initial approach to okfn-brasil#26. Of course
one may argue that there isn't any official data being scraped and
considered yet, but this mean/std approach may prove to be useful
for further analysis, with or without external data.

This version uses the recently added reimbursements dataset okfn-brasil#140.
@ebonet-zz
Copy link

I don't believe there are historical databases for pricing. What could be done is to identify hotels on the database and start watching booking/expedia/... and scrape data, building Serenata's own dataset for that. Keep in mind that hotel pricing is somewhat complex, and database can become large.

@evilasiov
Copy link

evilasiov commented Jan 7, 2017 via email

@cuducos cuducos removed this from the Roadmap: Rental state outliers milestone Mar 24, 2017
Irio pushed a commit that referenced this issue Feb 27, 2018
@cuducos
Copy link
Collaborator

cuducos commented Mar 1, 2018

Closed accidentally by unrelated commit from Rosie/Jarbas repos.

@cuducos cuducos reopened this Mar 1, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants