swh.fuse.backends.web_api module#

class swh.fuse.backends.web_api.WebApiBackend(conf: Dict)[source]#

Bases: GraphBackend, ObjBackend

A Backend querying everything via Software Heritage’s public API.

This is simpler to configure and deploy, but expect long response times.

Only needs the web-api key of conf, searching for url and maybe auth-token keys.

async get_metadata(swhid: CoreSWHID) Dict | List[source]#

Entries in the returned dict depend on the swhid type.

For cnt, return a dict containing at least 'length': [int]

For dir, return a list of entries like

{
    "dir_id": "dir's hash",
    "name": "dir item name",
    "type": "str=dir|file|rev",  # yes, `file`...
    "target": "target hash",
    "perms": "permissions [int]",
    # if target is a cnt:
    "length": "file size [int]",
}

For rev, return a dict containing at least directory (SWHID), parents (list of objects containing at list id (hash)), date (str, isoformat), committer_date (str, isoformat), id` (object’s hash).

For rel, return a dict containing at least target (SWHID), target_type, date (str, isoformat), id (object’s hash).

For snp, return a dict where each key is a branch name, each value is a dict containing at least target (hash), target_type (content, directory, revision, release, snapshot or alias).

async get_blob(swhid) bytes[source]#

Fetch the content of a cnt object.

async get_history(swhid: CoreSWHID) List[Tuple[str, str]][source]#

Return a list of tuples (swhid, revision swhid)

async get_visits(url_encoded: str) List[Dict[str, Any]][source]#

Return a list of objects

Each object should contain fields date (ISO), origin (str), snapshot (SWHID’s hash as str)