SQL: fix meta/has_table #6627

jorisvandenbossche · 2014-03-13T12:44:31Z

Further work for cleaning up the sql code (#6292).

the meta attribute is not updated automatically, with the consequence that eg when you delete a table from sql directly, the has_table function does not work anymore:
- I added a test for that
- I converted the meta attribute to one which is always updated when called. @mangecoeur looks OK?
now the tests are skipped when no connection could be made. @jreback, I just did a raise nose.SkipTest inside the setup function of the test class. -> moved and merged this in TST: skip sql tests if connection to server fails #6651

jreback · 2014-03-13T12:48:02Z

pandas/io/tests/test_sql.py

-            try:
-                import pymysql
-                self.driver = 'pymysql'
+    def setUp(self):


maybe you want to have a test setup() method to install modules and connect (then can define the setup_imports() and setup_connection() or something in the sub class of the test

I am no sure I fully understand what you mean.
Something like this? (just to clean up the code)

class _TestSQLAlchemy(PandasSQLTest): def set_up(self): setup_import() setup_driver() setup_connect() def setup_import(): if not SQLALCHEMY_INSTALLED: raise nose.SkipTest('SQLAlchemy not installed') def setup_connect(): try: self.conn = self.connect() self.pandasSQL = sql.PandasSQLAlchemy(self.conn) except sqlalchemy.exc.OperationalError: raise nose.SkipTest("Can't connect to server") class TestPostgreSQLAlchemy(_TestSQLAlchemy): def setup_driver() try: import psycopg2 self.driver = 'psycopg2' except ImportError: raise nose.SkipTest('psycopg2 not installed')

yep that's what I meant

ok, ik keep this in mind for a pr to clean up the tests

mangecoeur · 2014-03-13T18:26:46Z

@jorisvandenbossche I'm not 100% sure it's a good idea to always clear and reflect when you need Meta, in cases where you have a lot of tables it can take a few seconds. On the other hand you are right that if someone starts mixing in raw SQL (which i guess is very likely) we have no way of know whether Meta needs updating or not :/ I guess we can try it as a property and dive into potential performance issues later

jreback · 2014-03-13T18:41:16Z

can you invalidate the metadata _meta? when a raw string is passed? then only need to recreate if the user does raw stuff

jorisvandenbossche · 2014-03-14T11:10:19Z

@mangecoeur I agree it seems not the best idea to always clear and reflect the meta, but I do not see another way to prevent that the meta will not be up to date anymore if one has executed some raw SQL code.

@jreback But the problem is that we don't know when the user does raw stuff. They could do it with sql.execute("DROP TABLE table"), and this we could maybe 'detect', but you can also do it via sqlalchemy of directly in eg PgAdminIII for PostgreSQL.

Maybe another way to prevent the possible performance issues is to limit the cases that meta is called in our code. Eg for has_table, sqlalchemy has also an has_table method directly on the engine that maybe has not to pass through meta.

jreback · 2014-03-14T11:24:42Z

what happens if you only reflect the meta once
if user does raw stuff then they should manually reflect if they are then using pandas functions
(or maybe a flag to do it every time)

I think this is users issue

jorisvandenbossche · 2014-03-15T00:14:50Z

It is indeed maybe a user issue, but I think a common issue (as we already did it ourselves in the test suite (https://github.com/jorisvandenbossche/pandas/blob/sql2/pandas/io/tests/test_sql.py#L109), why one of the tests was not really working), so if we can easily prevent it, why not?

But I can see an argument to leave this as is, as it is the same with sqlalchemy, and we would just copy sqlalchemy behaviour then.
If we do that, i would certainly change has_table so this is up to date. So it would only be the direct use of PandasSQL.meta that would not always be up to date.

… has_table Because meta is not automatically updated, has_table will fail when a table is dropped with an sql query (not with drop_table function)

jorisvandenbossche · 2014-03-16T11:38:34Z

@mangecoeur @hayd @jreback Anybody a strong opinion on this?

jreback · 2014-03-16T13:12:44Z

I think should be a passed in option when creating the engine defaulting to True is ok
maybe meta='always', and accept None which will allow the user to manually reflect (maybe should make reflect/clear a public method)

jorisvandenbossche · 2014-05-16T15:01:44Z

Pushed this to 0.14.1, as the OO api is also not yet finalized (and this only concerns that, not the functional api)

mangecoeur · 2014-05-16T15:42:04Z

On refelction I like the idea of invalidating meta if the user does something "manual" like using raw SQL, but leave scope open for adding more useful functions that take advantage of SQLs expression API (or these could be added by people/projects who want to subclass the SQLAlchemy api class). Like I mentioned in another issue, it's important to make it possible for a user to manually manage reflection of metadata/supply their own metadata object for advanced use cases.

hayd · 2014-05-28T02:49:37Z

Perhaps raise a UserWarning if they have done something manual? Say that meta could be invalid / user should reflect ?

mangecoeur · 2014-05-28T13:37:50Z

@hayd I think that might make it too easy for people to shoot themselves in the foot. Perhaps the best is to always reflect but have a class "reflect" or "get meta" function. An advanced user can just create a subclass and override that function.
OR we could think of a clean way to pass SQLAlchemy options into the object - preferably without spamming dozens of kwargs...

jreback · 2015-01-25T22:57:23Z

@jorisvandenbossche keep this open?

jreback · 2015-03-05T23:46:50Z

@jorisvandenbossche ?

jorisvandenbossche · 2015-03-06T09:11:56Z

Closing for now. In any case not applicable as is, but I should have a closer look about how meta is handled now (things changed somewhat).

jreback reviewed Mar 13, 2014
View reviewed changes

jorisvandenbossche mentioned this pull request Mar 16, 2014

TST: skip sql tests if connection to server fails #6651

Merged

TST/BUG: SQL make meta a property which is updated when called + test…

b31c142

… has_table Because meta is not automatically updated, has_table will fail when a table is dropped with an sql query (not with drop_table function)

jreback added the SQL label Mar 22, 2014

jreback added this to the 0.14.0 milestone Mar 22, 2014

jorisvandenbossche modified the milestones: 0.14.1, 0.14.0 May 16, 2014

jreback added the API Design label May 16, 2014

jorisvandenbossche modified the milestones: 0.15.0, 0.14.1 Jul 1, 2014

jorisvandenbossche closed this Mar 6, 2015

jorisvandenbossche modified the milestones: No action, 0.16.0 Mar 6, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SQL: fix meta/has_table #6627

SQL: fix meta/has_table #6627

jorisvandenbossche commented Mar 13, 2014

jreback Mar 13, 2014

jorisvandenbossche Mar 14, 2014

jreback Mar 14, 2014

jorisvandenbossche Mar 14, 2014

mangecoeur commented Mar 13, 2014

jreback commented Mar 13, 2014

jorisvandenbossche commented Mar 14, 2014

jreback commented Mar 14, 2014

jorisvandenbossche commented Mar 15, 2014

jorisvandenbossche commented Mar 16, 2014

jreback commented Mar 16, 2014

jorisvandenbossche commented May 16, 2014

mangecoeur commented May 16, 2014

hayd commented May 28, 2014

mangecoeur commented May 28, 2014

jreback commented Jan 25, 2015

jreback commented Mar 5, 2015

jorisvandenbossche commented Mar 6, 2015

SQL: fix meta/has_table #6627

SQL: fix meta/has_table #6627

Conversation

jorisvandenbossche commented Mar 13, 2014

jreback Mar 13, 2014

Choose a reason for hiding this comment

jorisvandenbossche Mar 14, 2014

Choose a reason for hiding this comment

jreback Mar 14, 2014

Choose a reason for hiding this comment

jorisvandenbossche Mar 14, 2014

Choose a reason for hiding this comment

mangecoeur commented Mar 13, 2014

jreback commented Mar 13, 2014

jorisvandenbossche commented Mar 14, 2014

jreback commented Mar 14, 2014

jorisvandenbossche commented Mar 15, 2014

jorisvandenbossche commented Mar 16, 2014

jreback commented Mar 16, 2014

jorisvandenbossche commented May 16, 2014

mangecoeur commented May 16, 2014

hayd commented May 28, 2014

mangecoeur commented May 28, 2014

jreback commented Jan 25, 2015

jreback commented Mar 5, 2015

jorisvandenbossche commented Mar 6, 2015