-
Notifications
You must be signed in to change notification settings - Fork 211
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
JDBC Connection to SPARQL Endpoint Randomly Throws 404 Error #565
Comments
Do you have an pseudo code for how the Jena JDBC driver (3.0.0) is actually specifying a connection to the Virtuoso SPARQL endpoint ? As we need to be clear as to whether it is connecting via the /sparql http interface or via the JDBC interface. How often do these random errors occur, and are any messages written to the "virtuoso.log" file when they occur ? |
Sure thing:
They happen whenever I try to load the dataset in thru the sparql endpoint, usually within the first hour or two. The queries that it hits when the 404 error occurs are syntactically correct. I verified it by running the query directly in the Virtuoso Conductor's Sparql editor. Here is the log from the same time as the above error message was coming from PDI. The PDI process was kicked off at 7/13 13:33 and hit the 404 error while running at 7/13 at 14:20. I've included all of Virtuoso's 7/13 log below, which included a manual restart on my part in an attempt at troubleshooting.
|
@OpenDataAlex: Also, you previously indicated not having granted the |
@HughWilliams The connection is opened by PDI and data is passed one triple at a time through the templates in the original posting. I'm using the |
@OpenDataAlex: It is still unclear to me how you are authenticating and performing SPARQL update operation against the default I assume this is the Jena JDBC driver being used and configured for remote access to a SPARQL endpoint? So in conclusion, for a default Virtuoso installation, based on the connect string you provide above, I would expect it to be something more like:
|
@HughWilliams I appreciate that, but don't know what to tell you. I use the connection as mentioned earlier and I do have inserts/deletes/selects working fine (at least until the 404 hits). This could be a result of the AWS image that's available, but not sure. In any case, all I have done to get PDI talking to the Virtuoso instance is add the Jena jdbc driver and provide for it the connection from my previous comments. I'm happy to provide more information, just let me know what to provide. In the meantime, I'll try switching to the connection string you recommended and will let you know if there is a change in performance. |
@OpenDataAlex: Note we have this post on the virtuoso-users mailing list, where it is implied Can you check the A manual checkpoint can be performed to commit transactions to the database, or the |
@HughWilliams That did the trick, but ran out of memory. The |
@OpenDataAlex: What is running out of memory? Is a specific error being reported by Virtuoso on the client or server (log) side? |
…ntInterval = 0 parameter would fix the 404 errors as per the Issue on virtuoso Github: openlink/virtuoso-opensource#565
@OpenDataAlex - Were you able to resolve the "ran out of memory" error you mentioned? We should probably address that in a fresh issue, if not. In either case, would you consider this issue (#565) resolved? |
We never were able to resolve it and unfortunately I've had to move on to other projects since this issue was first noted. I believe the issue does boil down to configuration and environment. We were trying to load more data into memory than memory was available I believe. We can consider the issue closed but if I run into ever again I'll reference this one and create a new issue. |
I've written a process using Pentaho Data Integration (6.0) and the Jena JDBC driver (3.0.0) connecting to the SPARQL endpoint for Virtuoso version 7.20.3215. The process normalizes data into triples and then fills out the triples into one of two INSERT statements (see below sample queries). Randomly while loading the triples in I will receive the following error:
Sample queries:
AND
The text was updated successfully, but these errors were encountered: