Skip to content

Commit

Permalink
Merge pull request #7 from brendancsmith/dev
Browse files Browse the repository at this point in the history
v0.2.1
  • Loading branch information
brendancsmith authored Apr 26, 2024
2 parents 041bae7 + c245150 commit ae661db
Show file tree
Hide file tree
Showing 14 changed files with 113 additions and 93 deletions.
15 changes: 2 additions & 13 deletions .trunk/configs/.markdownlint.yaml
Original file line number Diff line number Diff line change
@@ -1,13 +1,2 @@
# Autoformatter friendly markdownlint config (all formatting rules disabled)
default: true
blank_lines: false
bullet: false
html: false
indentation: false
line_length: false
no-duplicate-heading:
siblings_only: true
no-trailing-punctuation: false
spaces: false
url: false
whitespace: false
# Prettier friendly markdownlint config (all formatting rules disabled)
extends: markdownlint/style/prettier
23 changes: 9 additions & 14 deletions .trunk/trunk.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -2,12 +2,12 @@
# To learn more about the format of this file, see https://docs.trunk.io/reference/trunk-yaml
version: 0.1
cli:
version: 1.21.0
version: 1.22.0
# Trunk provides extensibility via plugins. (https://docs.trunk.io/plugins)
plugins:
sources:
- id: trunk
ref: v1.4.5
ref: v1.5.0
uri: https://github.com/trunk-io/plugins
# Many linters and tools depend on runtimes - configure them here. (https://docs.trunk.io/runtimes)
runtimes:
Expand All @@ -27,30 +27,25 @@ lint:
commands:
- name: lint
run: bandit --exit-zero -c bandit.yaml --format json --output ${tmpfile} ${target}
- name: trufflehog
commands:
- name: lint
run: trufflehog filesystem --json --fail --exclude-paths=/.gitignore ${target}
enabled:
- actionlint@1.6.27
- bandit@1.7.8
- checkov@3.2.53
- checkov@3.2.74
- git-diff-check
- markdownlint@0.39.0
- osv-scanner@1.7.0
- osv-scanner@1.7.2
- prettier@3.2.5
- ruff@0.3.5
- semgrep@1.67.0
- ruff@0.4.1
- sourcery@1.16.0
- taplo@0.8.1
- trivy@0.50.1
- trufflehog-git@3.72.0
# - trufflehog@3.71.0
- trivy@0.50.4
- trufflehog@3.74.0
- trufflehog-git@3.74.0
- yamllint@1.35.1
disabled:
- trufflehog
- black
- isort
- semgrep
actions:
enabled:
- commitizen
Expand Down
25 changes: 13 additions & 12 deletions CONTRIBUTING.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,12 @@
## Contributing to Diffbot Knowledge Graph Client
# Contributing to Diffbot Knowledge Graph Client

First off, thanks for taking the time to contribute!

The following is a set of guidelines for contributing to `diffbot-kg`, which is hosted on GitHub. These are mostly guidelines, not rules. Use your best judgment, and feel free to propose changes to this document in a pull request.

### How Can I Contribute?
## How Can I Contribute?

#### Reporting Bugs
### Reporting Bugs

This section guides you through submitting a bug report for `diffbot-kg`. Following these guidelines helps the maintainer and the community understand your report, reproduce the behavior, and find related reports.

Expand All @@ -17,25 +17,26 @@ This section guides you through submitting a bug report for `diffbot-kg`. Follow
- **Explain which behavior you expected to see instead and why.**
- **Include screenshots and/or animated GIFs** which help demonstrate the steps or point out the part of Indeed Job Scraper which the suggestion is related to.

#### Pull Requests
### Pull Requests

Please follow these steps to have your contribution considered by the maintainer:

- After you submit your pull request, verify that all status checks are passing.
- While the maintainer reviews your PR, you can also ask for specific people to review your changes.
- Once your pull request is created, it will be reviewed by the maintainer of the project. You may be asked to make changes to your pull request. There's always a chance your pull request won't be accepted.

#### Python Styleguide
### Python Styleguide

- All Python must adhere to [PEP 8](https://www.python.org/dev/peps/pep-0008/).
- Use type annotations according to [PEP 484](https://www.python.org/dev/peps/pep-0484/) and [PEP 526](https://www.python.org/dev/peps/pep-0526/).
- Format your python code with [Black](https://github.com/ambv/black).
- Lint your python code with [Ruff](https://github.com/jendrikseipp/ruff).
- All Python must adhere to [PEP 8][PEP8].
- Use type annotations according to [PEP 484][PEP484] and [PEP 526][PEP526].
- Format and lint your python code with [Ruff](https://github.com/jendrikseipp/ruff).
- Include docstrings and comments where appropriate.
- Write tests for new features and bug fixes.

To automatically format and lint your code on commit, run `pre-commit install` in the root of the repository.

### Attribution
## Attribution

This Contributing guide is adapted from the [Contributing to Atom](https://github.com/atom/atom/blob/master/CONTRIBUTING.md) guide.

[PEP8]: https://www.python.org/dev/peps/pep-0008/
[PEP484]: https://www.python.org/dev/peps/pep-0484/
[PEP526]: https://www.python.org/dev/peps/pep-0526/
22 changes: 7 additions & 15 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,29 +1,23 @@
Diffbot Knowledge Graph Client
=============

![](https://www.diffbot.com/assets/img/diffbot-logo-darkbg.svg)
# Diffbot Knowledge Graph Client

![Diffbot Logo](https://www.diffbot.com/assets/img/diffbot-logo-darkbg.svg)

[![CodeFactor](https://www.codefactor.io/repository/github/brendancsmith/diffbot-kg/badge)](https://www.codefactor.io/repository/github/brendancsmith/diffbot-kg)
![GitHub Actions Workflow Status](https://img.shields.io/github/actions/workflow/status/brendancsmith/diffbot-kg/python-package.yml)
![PyPI - Version](https://img.shields.io/pypi/v/diffbot-kg)
![GitHub License](https://img.shields.io/github/license/brendancsmith/diffbot-kg)


Description
-----------
## Description

Python client for the Diffbot Knowledge Graph API.

Installation
------------
## Installation

```sh
pip install diffbot-kg
```

Usage
-----
## Usage

```python
from diffbot_kg import DiffbotSearchClient, DiffbotEnhanceClient
Expand All @@ -38,12 +32,10 @@ search_results = search_client.search({query='type:Organization name:Diffbot'})
enhanced_entity = enhance_client.enhance({query='type:Organization name:Diffbot'})
```

Contributing
------------
## Contributing

Contributions to this project are welcome. - see the CONTRIBUTING.md file for details.

License
-------
## License

This project is licensed under the MIT License - see the LICENSE file for details.
2 changes: 1 addition & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[tool.poetry]
name = "diffbot-kg"
version = "0.2.0"
version = "0.2.1"
description = "Python client for the Diffbot Knowledge Graph API."
authors = ["Brendan C. Smith"]
license = "MIT"
Expand Down
4 changes: 4 additions & 0 deletions src/diffbot_kg/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
# -*- coding: utf-8 -*-

from diffbot_kg.clients.enhance import DiffbotEnhanceClient # noqa: F401
from diffbot_kg.clients.search import DiffbotSearchClient # noqa: F401
Empty file.
11 changes: 8 additions & 3 deletions src/diffbot_kg/clients/base.py
Original file line number Diff line number Diff line change
@@ -1,8 +1,9 @@
from typing import Any

from diffbot_kg.clients.session import BaseDiffbotResponse, DiffbotSession
from yarl import URL

from diffbot_kg.clients.session import BaseDiffbotResponse, DiffbotSession


class BaseDiffbotKGClient:
"""
Expand Down Expand Up @@ -40,6 +41,8 @@ def _merge_params(self, params) -> dict[str, Any]:

params = params or {}
params = {**self.default_params, **params}

# sourcery skip: inline-immediately-returned-variable
params = {k: v for k, v in params.items() if v is not None}
return params

Expand All @@ -58,9 +61,11 @@ async def _get(
BaseDiffbotResponse: The response from the API.
"""

headers = {"accept": "application/json", **(headers or {})}
headers = headers or {}

params = self._merge_params(params)

# sourcery skip: inline-immediately-returned-variable
resp = await self.s.get(url, params=params, headers=headers)
return resp

Expand All @@ -87,10 +92,10 @@ async def _post(

headers = {
"content-type": "application/json",
"accept": "application/json",
**(headers or {}),
}

# sourcery skip: inline-immediately-returned-variable
resp = await self.s.post(url, params=params, headers=headers, json=json)
return resp

Expand Down
1 change: 1 addition & 0 deletions src/diffbot_kg/clients/enhance.py
Original file line number Diff line number Diff line change
Expand Up @@ -125,6 +125,7 @@ async def bulkjob_coverage_report(
url = self.bulk_job_coverage_report_url.human_repr().format(
bulkjobId=bulkjobId, reportId=reportId
)

resp = await self._get(url)
resp.__class__ = DiffbotCoverageReportResponse
return cast(DiffbotCoverageReportResponse, resp)
Expand Down
23 changes: 20 additions & 3 deletions src/diffbot_kg/clients/session.py
Original file line number Diff line number Diff line change
Expand Up @@ -36,23 +36,40 @@ class DiffbotSession:
"""

def __init__(self) -> None:
headers = {"accept": "application/json"}
timeout = aiohttp.ClientTimeout(total=60, sock_connect=5)
self._session = aiohttp.ClientSession(headers=headers, timeout=timeout)
self._headers = {"accept": "application/json"}
self._timeout = aiohttp.ClientTimeout(total=60, sock_connect=5)

self.is_open = False

async def open(self) -> Self:
self._session = aiohttp.ClientSession(headers=self._headers, timeout=self._timeout)
self._limiter = aiolimiter.AsyncLimiter(max_rate=5, time_period=1)

self.is_open = True
return self

async def get(self, url, **kwargs) -> BaseDiffbotResponse:
if not self.is_open:
await self.open()

# sourcery skip: inline-immediately-returned-variable
resp = await self._request(HTTPMethod.GET, url, **kwargs)
return resp

async def post(self, url, **kwargs) -> BaseDiffbotResponse:
if not self.is_open:
await self.open()

# sourcery skip: inline-immediately-returned-variable
resp = await self._request(HTTPMethod.POST, url, **kwargs)
return resp

async def close(self) -> None:
if not self._session.closed:
await self._session.close()

self.is_open = False

@retry(
retry=retry_if_exception_type(RetryableException),
reraise=True,
Expand Down
63 changes: 39 additions & 24 deletions tests/functional/clients/test_enhance_client.py
Original file line number Diff line number Diff line change
Expand Up @@ -89,15 +89,21 @@ async def test_bulkjob_status(self, request, token: Secret):

job_id = _get_job_id(request)

DELAY = 10
TIMEOUT = 60
BACKOFF_FACTOR = 1.5
backoff = 1
start = time.time()

# ACT
while time.time() - start <= DELAY:
while True:
response = await client.bulkjob_status(job_id)
if response.complete:
break
time.sleep(1)
elif time.time() - start > TIMEOUT:
pytest.fail("Bulk job status check did not complete in time")

time.sleep(backoff)
backoff *= BACKOFF_FACTOR

# ASSERT
assert response.status == 200
Expand Down Expand Up @@ -159,45 +165,54 @@ async def test_single_bulkjob_result(self, request, token: Secret):
await client.close()

@pytest.mark.asyncio
async def test_bulkjob_stop(self, request, token: Secret):
async def test_bulkjob_coverage_report(self, request, token: Secret):
# ARRANGE
client = DiffbotEnhanceClient(token=token.value)

job_id = _get_job_id(request)
report_id = request.config.cache.get("enhanceBulkJobCoverageReportId", None)
if report_id is None:
pytest.fail("Enhance bulk job coverage report ID not found in cache")


TIMEOUT = 60
BACKOFF_FACTOR = 1.5
backoff = 1
start = time.time()

# ACT
response = await client.stop_bulkjob(job_id)
while True:
try:
response = await client.bulkjob_coverage_report(job_id, report_id)
except ClientResponseError as e:
if e.status == 400:
time.sleep(backoff)
backoff *= BACKOFF_FACTOR
else:
if response.status == 200:
break
elif time.time() - start > TIMEOUT:
pytest.fail("Bulk job coverage report did not generate in time")

# ASSERT
assert response.status == 200
assert response.content["status"] == "COMPLETE"
assert response.content["message"] == f"Bulkjob [{job_id}] is completed"
assert len(response.content.strip().split("\n")) == 4

# TEARDOWN
await client.close()

@pytest.mark.asyncio
async def test_bulkjob_coverage_report(self, request, token: Secret):
async def test_bulkjob_stop(self, request, token: Secret):
# ARRANGE
client = DiffbotEnhanceClient(token=token.value)

job_id = _get_job_id(request)
report_id = request.config.cache.get("enhanceBulkJobCoverageReportId", None)
if report_id is None:
pytest.fail("Enhance bulk job coverage report ID not found in cache")

DELAY = 10
start = time.time()

# ACT
while time.time() - start <= DELAY:
try:
response = await client.bulkjob_coverage_report(job_id, report_id)
except ClientResponseError:
time.sleep(1)
else:
break
response = await client.stop_bulkjob(job_id)

# ASSERT
assert response.status == 200
assert len(response.content.strip().split("\n")) == 4
assert response.content["status"] == "COMPLETE"
assert response.content["message"] == f"Bulkjob [{job_id}] is completed"

# TEARDOWN
await client.close()
2 changes: 1 addition & 1 deletion tests/functional/clients/test_search_client.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
import pytest
from diffbot_kg.clients.search import DiffbotSearchClient

from diffbot_kg.clients.search import DiffbotSearchClient
from tests.functional.conftest import ORG_ENTITY_ID, ORG_NAME, Secret


Expand Down
Loading

0 comments on commit ae661db

Please sign in to comment.