Module `tf.core.files`

Global variables

var APIREF

Link to the Api docs of TF.

var APP_APP

Name of the top-level directory of a TF app.

var APP_CODE

Name of the top-level directory of a legacy TF app.

var APP_CONFIG

Name of the config file of a TF app.

var APP_CONFIG_OLD

Name of the config file of a an older, incompatible TF app.

var APP_DISPLAY

Relative path of the CSS file of a TF app.

var APP_EXPRESS_ZIP

Name of the zip file with the complete corpus data as attached to a release.

This zip file is retrieved when using a corpus without checkout specifiers and a part of the corpus is not locally available.

var DOWNLOADS

Local Downloads directory.

var EXPRESS_SYNC

Name of cache indicator file.

When a dataset is stored in the cache, information about the release / commit is stored in a file with this name.

var EXPRESS_SYNC_LEGACY

Legacy names of cache indicator files.

var LOCAL

Name of auxiliary directories.

Examples where this is used:

volume support: inside a TF dataset, the directory _local contains volumes of that dataset

var LOCATIONS

Default locations for TF data files.

If the locations parameter for the Fabric call is omitted, this is the default. TF will search all these directories as for .tf modules of files.

var LS

Directory where layered search code is stored.

Layered search is client-side search, generated in a dedicated search repo. If the main data resides in org/repo, then the layered search code resides in org/repo-search/layeredsearch.

var SEARCHREF

Link to the Search docs of TF.

var SERVER_DISPLAY

Bunch of TF generic CSS files.

var SERVER_DISPLAY_BASE

Base of server CSS files.

var TEMP_DIR

Name of temporary directories.

.gitignore

Take care that these directories are ignored by git operations. Put a line

_temp/

in the .gitignore file.

var TOOL_DISPLAY

Bunch of tool-specific CSS files.

var TOOL_DISPLAY_BASE

Base of tool CSS files.

var URL_TFDOC

Base URL of the online TF documentation.

Functions

def abspath(path)

Expand source code Browse git

def abspath(path):
    return normpath(os.path.abspath(path))

def annotateDir(app, tool)

Expand source code Browse git

def annotateDir(app, tool):
    """Return the input and output and report directories for a specific annotation tool.

    *   The input directory is located next to the TF data of the corpus
    *   The output directory is located in the `_temp` directory next to the TF data of
        the corpus
        of the corpus

    Parameters
    ----------
    app: object
        the TF app
    tool: string
        The name of the annotation tool

    Returns
    -------
    string
        The path of the working directory for that tool and that corpus
    """
    context = app.context
    appPath = context.appPath
    localDir = context.localDir
    baseDir = dirNm(appPath)
    return (f"{baseDir}/{tool}", f"{localDir}/{tool}")

Return the input and output and report directories for a specific annotation tool.

The input directory is located next to the TF data of the corpus
The output directory is located in the _temp directory next to the TF data of the corpus of the corpus

Parameters

app : object: the TF app
tool : string: The name of the annotation tool

Returns

string: The path of the working directory for that tool and that corpus

def backendRep(be, kind, default=None)

Expand source code Browse git

def backendRep(be, kind, default=None):
    """Various back-end dependent values.

    First of all, the back-end value is
    normalized. Then related values are computed.

    Parameters
    ----------
    be: string or None
        The raw back-end value.
        It will be normalized first, where missing, undefined, empty values are
        converted to the string `github`, and other values will be lower-cased.
        Also, `github.com` and `gitlab.com` will be shortened to `github` and `gitlab`.

    kind: string
        Indicates what kind of related value should be returned:

        *   `norm`: the normalized value as described above
        *   `tech`: technology of the back-end: either `github` or `gitlab` or None;
            we assume that there is only one GitHub; that there are many GitLabs;
            any back-end that is not `github` is an instance of `gitlab`.
        *   `name`: lowercase shortest name of the back-end: `github` or `gitlab`
            or a server name like `gitlab.huc.knaw.nl`
        *   `machine`: lowercase machine name of the back-end: `github.com` or
            `gitlab.com`
            or a server name like `gitlab.huc.knaw.nl`
        *   `spec`: enclosed in `<` and `>`. Depending on the parameter `default`
            the empty string is returned instead.
        *   `clone`: base directory where clones of repos in this back-end are stored
            `~/github`, etc.
        *   `cache`: base directory where data downloads from this back-end are stored:
            `~/text-fabric-data/github`, etc.
        *   `url`: URL of the online back-end
        *   `urlnb`: URL of notebooks from the online back-end, rendered on NB-Viewer
        *   `pages`: base URL of the Pages service of the back-end

    default: boolean, optional False
        Only relevant for `kind` = `rep`.
        If `default` is passed and not None and `be` is equal to `default`,
        then the empty string is returned.

        Explanation: this is used to supply a back-end specifier to a module
        but only if that module has a different back-end than the main module.

    Returns
    -------
    string
    """

    be = (be or "").lower()
    be = (
        GH
        if be in {None, "", GH, f"{GH}.com"}
        else GL if be in {GL, f"{GL}.com"} else be
    )
    beTail = ".".join(be.split(".")[1:])

    if kind == "norm":
        return be

    if kind == "tech":
        return be if be in {GH, GL} else GL

    if kind == "name":
        return "GitHub" if be == GH else "GitLab" if be == GL else be

    if kind == "machine":
        return "github.com" if be == GH else "gitlab.com" if be == GL else be

    if kind == "rep":
        if default is not None:
            default = backendRep(default, "norm")
            if be == default:
                return ""
        return f"<{be}>"

    if kind == "clone":
        return f"{_homeDir}/{be}"

    if kind == "cache":
        return f"{_homeDir}/text-fabric-data/{be}"

    if kind == "url":
        return URL_GH if be == GH else URL_GL if be == GL else f"https://{be}"

    if kind == "urlapi":
        return (
            URL_GH_API if be == GH else URL_GL_API if be == GL else f"https://api.{be}"
        )

    if kind == "urlupload":
        return (
            URL_GH_UPLOAD
            if be == GH
            else URL_GL_UPLOAD if be == GL else f"https://api.{be}"
        )

    if kind == "urlnb":
        return f"{URL_NB}/{be}"

    if kind == "pages":
        return f"{GH}.io" if be == GH else f"{GL}.io" if be == GL else f"pages.{beTail}"
    return None

Various back-end dependent values.

First of all, the back-end value is normalized. Then related values are computed.

Parameters

be : string or None

The raw back-end value. It will be normalized first, where missing, undefined, empty values are converted to the string github, and other values will be lower-cased. Also, github.com and gitlab.com will be shortened to github and gitlab.

kind : string

Indicates what kind of related value should be returned:

norm: the normalized value as described above
tech: technology of the back-end: either github or gitlab or None; we assume that there is only one GitHub; that there are many GitLabs; any back-end that is not github is an instance of gitlab.
name: lowercase shortest name of the back-end: github or gitlab or a server name like gitlab.huc.knaw.nl
machine: lowercase machine name of the back-end: github.com or gitlab.com or a server name like gitlab.huc.knaw.nl
spec: enclosed in < and >. Depending on the parameter default the empty string is returned instead.
clone: base directory where clones of repos in this back-end are stored ~/github, etc.
cache: base directory where data downloads from this back-end are stored: ~/text-fabric-data/github, etc.
url: URL of the online back-end
urlnb: URL of notebooks from the online back-end, rendered on NB-Viewer
pages: base URL of the Pages service of the back-end

default : boolean, optional False

Only relevant for kind = rep. If default is passed and not None and be is equal to default, then the empty string is returned.

Explanation: this is used to supply a back-end specifier to a module but only if that module has a different back-end than the main module.

Returns

string

def chDir(directory)

Expand source code Browse git

def chDir(directory):
    """Change to other directory.

    Parameters
    ----------
    directory: string
        The directory to change to.
    """
    return os.chdir(directory)

Change to other directory.

Parameters

directory : string: The directory to change to.

def clearTree(path)

Expand source code Browse git

def clearTree(path):
    """Remove all files from a directory, recursively, but leave subdirectories.

    Reason: we want to inspect output in an editor.
    But if we remove the directories, the editor looses its current directory
    all the time.

    Parameters
    ----------
    path:
        The directory in question. A leading `~` will be expanded to the user's
        home directory.
    """

    subdirs = []
    path = expanduser(path)

    with os.scandir(path) as dh:
        for i, entry in enumerate(dh):
            name = entry.name
            if name.startswith("."):
                continue
            if entry.is_file():
                os.remove(f"{path}/{name}")
            elif entry.is_dir():
                subdirs.append(name)

    for subdir in subdirs:
        clearTree(f"{path}/{subdir}")

Remove all files from a directory, recursively, but leave subdirectories.

Reason: we want to inspect output in an editor. But if we remove the directories, the editor looses its current directory all the time.

Parameters

path: The directory in question. A leading ~ will be expanded to the user's home directory.

def dirAllFiles(path, ignore=None)

Expand source code Browse git

def dirAllFiles(path, ignore=None):
    """Gets all the files found by `path`.

    The result is just `[path]` if `path` is a file, otherwise the list of files under
    `path`, recursively.

    The files are sorted alphabetically by path name.

    Parameters
    ----------
    path: string
        The path to the file or directory on the file system.
    ignore: set
        Names of directories that must be skipped

    Returns
    -------
    tuple of string
        The names of the files under `path`, starting with `path`, followed
        by the bit relative to `path`.
    """
    if fileExists(path):
        return [path]

    if not dirExists(path):
        return []

    files = []

    if not ignore:
        ignore = set()

    for entry in os.listdir(path):
        name = f"{path}/{entry}"

        if os.path.isfile(name):
            files.append(name)
        elif os.path.isdir(name):
            if entry in ignore:
                continue
            files.extend(dirAllFiles(name, ignore=ignore))

    return tuple(sorted(files))

Gets all the files found by path.

The result is just [path] if path is a file, otherwise the list of files under path, recursively.

The files are sorted alphabetically by path name.

Parameters

path : string: The path to the file or directory on the file system.
ignore : set: Names of directories that must be skipped

Returns

tuple of string: The names of the files under path, starting with path, followed by the bit relative to path.

def dirContents(path)

Expand source code Browse git

def dirContents(path):
    """Gets the contents of a directory.

    Only the direct entries in the directory (not recursively), and only real files
    and folders.

    The list of files and folders will be returned separately.
    There is no attempt to sort the files.

    Parameters
    ----------
    path: string
        The path to the directory on the file system.

    Returns
    -------
    tuple of tuple
        The subdirectories and the files.
    """
    if not dirExists(path):
        return ((), ())

    files = []
    dirs = []

    for entry in os.listdir(path):
        if os.path.isfile(f"{path}/{entry}"):
            files.append(entry)
        elif os.path.isdir(f"{path}/{entry}"):
            dirs.append(entry)

    return (tuple(files), tuple(dirs))

Gets the contents of a directory.

Only the direct entries in the directory (not recursively), and only real files and folders.

The list of files and folders will be returned separately. There is no attempt to sort the files.

Parameters

path : string: The path to the directory on the file system.

Returns

tuple of tuple: The subdirectories and the files.

def dirCopy(pathSrc, pathDst, noclobber=False)

Expand source code Browse git

def dirCopy(pathSrc, pathDst, noclobber=False):
    """Copies a directory if it exists as directory.

    Wipes the destination directory, if it exists.
    """
    if dirExists(pathSrc):
        if dirExists(pathDst):
            if noclobber:
                return False
        dirRemove(pathDst)
        copytree(pathSrc, pathDst)
        return True
    else:
        return False

Copies a directory if it exists as directory.

Wipes the destination directory, if it exists.

def dirEmpty(target)

Expand source code Browse git

def dirEmpty(target):
    target = normpath(target)
    return not os.path.exists(target) or not os.listdir(target)

def dirExists(path)

Expand source code Browse git

def dirExists(path):
    """Whether a path exists as directory on the file system."""
    return (
        False
        if path is None
        else True if path == "" else os.path.isdir(path) if path else True
    )

Whether a path exists as directory on the file system.

def dirMake(path)

Expand source code Browse git

def dirMake(path):
    """Creates a directory if it does not already exist as directory."""
    if not dirExists(path):
        os.makedirs(path, exist_ok=True)

Creates a directory if it does not already exist as directory.

def dirMove(pathSrc, pathDst)

Expand source code Browse git

def dirMove(pathSrc, pathDst):
    """Moves a directory if it exists as directory.

    Refuses the operation in the target exists.
    """
    if not dirExists(pathSrc) or dirExists(pathDst):
        return False
    os.rename(pathSrc, pathDst)
    return True

Moves a directory if it exists as directory.

Refuses the operation in the target exists.

def dirNm(path)

Expand source code Browse git

def dirNm(path):
    """Get the directory part of a file name."""
    return os.path.dirname(path)

Get the directory part of a file name.

def dirRemove(path)

Expand source code Browse git

def dirRemove(path):
    """Removes a directory if it exists as directory."""
    if dirExists(path):
        rmtree(path)

Removes a directory if it exists as directory.

def expandDir(obj, dirName)

Expand source code Browse git

def expandDir(obj, dirName):
    if dirName.startswith("~"):
        dirName = dirName.replace("~", obj.homeDir, 1)
    elif dirName.startswith(".."):
        dirName = dirName.replace("..", obj.parentDir, 1)
    elif dirName.startswith("."):
        dirName = dirName.replace(".", obj.curDir, 1)
    return normpath(dirName)

def expanduser(path)

Expand source code Browse git

def expanduser(path):
    nPath = normpath(path)
    if nPath.startswith("~"):
        return f"{_homeDir}{nPath[1:]}"

    return nPath

def extNm(path)

Expand source code Browse git

def extNm(path):
    """Get the extension part of a file name.

    The dot is not included.
    If there is no extension, the empty string is returned.
    """
    parts = fileNm(path).rsplit(".", 1)
    return "" if len(parts) == 0 else parts[-1]

Get the extension part of a file name.

The dot is not included. If there is no extension, the empty string is returned.

def fileCopy(pathSrc, pathDst)

Expand source code Browse git

def fileCopy(pathSrc, pathDst):
    """Copies a file if it exists as file.

    Wipes the destination file, if it exists.
    """
    if pathSrc == pathDst:
        return

    if fileExists(pathSrc):
        fileRemove(pathDst)
        copy(pathSrc, pathDst)

Copies a file if it exists as file.

Wipes the destination file, if it exists.

def fileCopyExpr(dirSrc, dirDst)

Expand source code Browse git

def fileCopyExpr(dirSrc, dirDst):
    """Copies the `__checkout__.txt` file from one directory to an other.

    Wipes the destination file, if it exists.
    """
    pathSrc = f"{dirSrc}/{EXPRESS_SYNC}"
    pathDst = f"{dirDst}/{EXPRESS_SYNC}"

    if pathSrc == pathDst:
        return

    if fileExists(pathSrc):
        fileRemove(pathDst)
        copy(pathSrc, pathDst)

Copies the __checkout__.txt file from one directory to an other.

Wipes the destination file, if it exists.

def fileExists(path)

Expand source code Browse git

def fileExists(path):
    """Whether a path exists as file on the file system."""
    return os.path.isfile(path)

Whether a path exists as file on the file system.

def fileMake(path, force=False)

Expand source code Browse git

def fileMake(path, force=False):
    """Create a new empty file.

    If necessary, create intermediate subdirectories.
    If the file already exists, do nothing, unless `force` is True.
    If the file exists as a directory, do nothing

    Parameters
    ----------
    path: string
        The path to the new file
    force: boolean, optional False
        If `False`, nothing is done if the file already exists.
        Otherwise, an existing file is truncated.
    """
    if not dirExists(path) and (force or not fileExists(path)):
        parentDir = dirNm(path)
        dirMake(parentDir)

        with fileOpen(path, "w"):
            pass

Create a new empty file.

If necessary, create intermediate subdirectories. If the file already exists, do nothing, unless force is True. If the file exists as a directory, do nothing

Parameters

path : string: The path to the new file
force : boolean, optional False: If False, nothing is done if the file already exists. Otherwise, an existing file is truncated.

def fileMove(pathSrc, pathDst)

Expand source code Browse git

def fileMove(pathSrc, pathDst):
    """Moves a file if it exists as file.

    Wipes the destination file, if it exists.
    """
    if fileExists(pathSrc):
        fileRemove(pathDst)
    os.rename(pathSrc, pathDst)

Moves a file if it exists as file.

Wipes the destination file, if it exists.

def fileNm(path)

Expand source code Browse git

def fileNm(path):
    """Get the file part of a file name."""
    return os.path.basename(path)

Get the file part of a file name.

def fileOpen(*args, **kwargs)

Expand source code Browse git

def fileOpen(*args, **kwargs):
    """Wrapper around `open()`, making sure `encoding="utf8" is passed.

    This function calls `open()` with the same arguments, but if the optional
    argument `encoding` is missing and the mode argument does not contain a `b`
    (binary file), then `encoding="utf8"` is supplied.
    """

    if "encoding" in kwargs or ("mode" in kwargs and "b" in kwargs["mode"]):
        return open(*args, **kwargs)
    return open(*args, **kwargs, encoding="utf8")

Wrapper around open(), making sure `encoding="utf8" is passed.

This function calls open() with the same arguments, but if the optional argument encoding is missing and the mode argument does not contain a b (binary file), then encoding="utf8" is supplied.

def fileRemove(path)

Expand source code Browse git

def fileRemove(path):
    """Removes a file if it exists as file."""
    if fileExists(path):
        os.remove(path)

Removes a file if it exists as file.

def getCwd()

Expand source code Browse git

def getCwd():
    """Get current directory.

    Returns
    -------
    string
        The current directory.
    """
    return os.getcwd()

Get current directory.

Returns

string: The current directory.

def getLocation(targetDir=None)

Expand source code Browse git

def getLocation(targetDir=None):
    """Get back-end, org, repo, relative of directory.

    Parameters
    ----------
    targetDir: string, optional None
        If None, we take the current directory.
        Otherwise, if it starts with a `/` we take it as the absolute
        target directory.
        Otherwise, we append it to the absolute path of the current directory,
        with a `/` in between.

    We assume the target directory is somewhere inside

    `~/backend/org/repo`

    If it is immediately inside this, we set `relative` to `""`.

    If it is deeper down, we assume the reference directory is the parent of the
    current directory, and the path of this parent, relative to the repo directory
    goes into the `relative` component, preceded with a backslash if it is non-empty.

    Returns
    -------
    tuple
        Consisting of `backend`, `org`, `repo`, `relative`.
        `relative` is either empty or it starts with a "/" plus a non-empty path.
    """
    curDir = normpath(os.getcwd())
    if targetDir is not None:
        targetDir = normpath(targetDir)

    destDir = (
        curDir
        if targetDir is None
        else targetDir if targetDir.startswith("/") else f"{curDir}/{targetDir}"
    )
    destDir = unexpanduser(destDir)

    if not destDir.startswith("~/"):
        return (None, None, None, None)

    destDir = destDir.removeprefix("~/")
    parts = destDir.split("/")

    if len(parts) == 1:
        return (parts[0], None, None, None)
    if len(parts) == 2:
        return (parts[0], parts[1], None, None)
    if len(parts) in {2, 3}:
        return (parts[0], parts[1], parts[2], "")

    relative = prefixSlash("/".join(parts[3:-1]))
    return (parts[0], parts[1], parts[2], relative)

Get back-end, org, repo, relative of directory.

Parameters

targetDir : string, optional None: If None, we take the current directory. Otherwise, if it starts with a / we take it as the absolute target directory. Otherwise, we append it to the absolute path of the current directory, with a / in between.

We assume the target directory is somewhere inside

~/backend/org/repo

If it is immediately inside this, we set relative to "".

If it is deeper down, we assume the reference directory is the parent of the current directory, and the path of this parent, relative to the repo directory goes into the relative component, preceded with a backslash if it is non-empty.

Returns

tuple: Consisting of backend, org, repo, relative. relative is either empty or it starts with a "/" plus a non-empty path.

def initTree(path, fresh=False, gentle=False)

Expand source code Browse git

def initTree(path, fresh=False, gentle=False):
    """Make sure a directory exists, optionally clean it.

    Parameters
    ----------
    path:
        The directory in question. A leading `~` will be expanded to the user's
        home directory.

        If the directory does not exist, it will be created.

    fresh: boolean, optional False
        If True, existing contents will be removed, more or less gently.

    gentle: boolean, optional False
        When existing content is removed, only files are recursively removed, not
        subdirectories.
    """

    path = expanduser(path)
    exists = os.path.exists(path)
    if fresh:
        if exists:
            if gentle:
                clearTree(path)
            else:
                rmtree(path)

    if not exists or fresh:
        os.makedirs(path, exist_ok=True)

Make sure a directory exists, optionally clean it.

Parameters

path: The directory in question. A leading ~ will be expanded to the user's home directory.

If the directory does not exist, it will be created.

fresh : boolean, optional False: If True, existing contents will be removed, more or less gently.
gentle : boolean, optional False: When existing content is removed, only files are recursively removed, not subdirectories.

def isDir(path)

Expand source code Browse git

def isDir(path):
    """Whether path exists and is a directory."""
    return os.path.isdir(path)

Whether path exists and is a directory.

def isFile(path)

Expand source code Browse git

def isFile(path):
    """Whether path exists and is a file."""
    return os.path.isfile(path)

Whether path exists and is a file.

def normpath(path)

Expand source code Browse git

def normpath(path):
    if path is None:
        return None
    norm = os.path.normpath(path)
    return "/".join(norm.split(os.path.sep))

def prefixSlash(path)

Expand source code Browse git

def prefixSlash(path):
    """Prefix a / before a path if it is non-empty and not already starts with it."""
    return f"/{path}" if path and not path.startswith("/") else path

Prefix a / before a path if it is non-empty and not already starts with it.

def readJson(text=None, plain=False, asFile=None, preferTuples=False)

Expand source code Browse git

def readJson(text=None, plain=False, asFile=None, preferTuples=False):
    """Read a JSON file or string.

    The input data is either a text string or a file name or a file handle.
    Exactly one of the optional parameters `text` and `asFile` should be `None`.

    Parameters
    ----------
    text: string, optional None
        The input text if it is a string.
    asFile: string | object, optional None
        The input text if it is a file.
        If the value of `asFile` is a string, it is taken as a file name to read.
        Otherwise, it is taken as a file handle from which data can be read.
    plain: boolean, optional False
        If True, it return a dictionary, otherwise it wraps the data structure
        recursively in an AttrDict.
    preferTuples: boolean, optional False
        If the resulting data structure is to be wrapped in an AttrDict,
        we will represent lists as tuples.

    Returns
    -------
    object
        The resulting data structure.
    """
    if asFile is None:
        cfg = json.loads(text)
    else:
        if type(asFile) is str:
            if fileExists(asFile):
                with fileOpen(asFile) as fh:
                    cfg = json.load(fh)
            else:
                cfg = {}
        else:
            # in this case asFile should be a file handle
            cfg = json.load(asFile)

    return cfg if plain else deepAttrDict(cfg, preferTuples=preferTuples)

Read a JSON file or string.

The input data is either a text string or a file name or a file handle. Exactly one of the optional parameters text and asFile should be None.

Parameters

text : string, optional None: The input text if it is a string.
asFile : string | object, optional None: The input text if it is a file. If the value of asFile is a string, it is taken as a file name to read. Otherwise, it is taken as a file handle from which data can be read.
plain : boolean, optional False: If True, it return a dictionary, otherwise it wraps the data structure recursively in an AttrDict.
preferTuples : boolean, optional False: If the resulting data structure is to be wrapped in an AttrDict, we will represent lists as tuples.

Returns

object: The resulting data structure.

def readYaml(text=None, plain=False, asFile=None, preferTuples=True, preferLists=True)

Expand source code Browse git

def readYaml(text=None, plain=False, asFile=None, preferTuples=True, preferLists=True):
    """Read a YAML file or string.

    The input data is either a text string or a file name or a file handle.
    Exactly one of the optional parameters `text` and `asFile` should be `None`.

    Parameters
    ----------
    text: string, optional None
        The input text if it is a string.
    asFile: string | object, optional None
        The input text if it is a file.
        If the value of `asFile` is a string, it is taken as a file name to read.
        Otherwise, it is taken as a file handle from which data can be read.
    plain: boolean, optional False
        If True, it return a dictionary, otherwise it wraps the data structure
        recursively in an AttrDict.
    preferTuples: boolean, optional False
        If the resulting data structure is to be wrapped in an AttrDict,
        we will represent lists as tuples.

    Returns
    -------
    object
        The resulting data structure.
    """
    kwargs = dict(Loader=yaml.FullLoader)

    if asFile is None:
        cfg = yaml.load(text, **kwargs)
    else:
        if fileExists(asFile):
            with fileOpen(asFile) as fh:
                cfg = yaml.load(fh, **kwargs)
        else:
            cfg = {}

    return cfg if plain else deepAttrDict(cfg, preferTuples=preferTuples)

Read a YAML file or string.

The input data is either a text string or a file name or a file handle. Exactly one of the optional parameters text and asFile should be None.

Parameters

text : string, optional None: The input text if it is a string.
asFile : string | object, optional None: The input text if it is a file. If the value of asFile is a string, it is taken as a file name to read. Otherwise, it is taken as a file handle from which data can be read.
plain : boolean, optional False: If True, it return a dictionary, otherwise it wraps the data structure recursively in an AttrDict.
preferTuples : boolean, optional False: If the resulting data structure is to be wrapped in an AttrDict, we will represent lists as tuples.

Returns

object: The resulting data structure.

def replaceExt(path, newExt)

Expand source code Browse git

def replaceExt(path, newExt):
    """Replace the extension of a path by another one. Specify it without dot."""
    (main, ext) = os.path.splitext(path)
    return f"{main}.{newExt}"

Replace the extension of a path by another one. Specify it without dot.

def setDir(obj)

Expand source code Browse git

def setDir(obj):
    obj.homeDir = expanduser("~")
    obj.curDir = normpath(os.getcwd())
    (obj.parentDir, x) = os.path.split(obj.curDir)

def splitPath(path)

Expand source code Browse git

def splitPath(path):
    """Split a file name in a directory part and a file part."""
    return os.path.split(path)

Split a file name in a directory part and a file part.

def str_presenter(dumper, data)

Expand source code Browse git

def str_presenter(dumper, data):
    """configures yaml for dumping multiline strings
    Ref: https://stackoverflow.com/questions/8640959
    """
    if data.count("\n") > 0:  # check for multiline string
        return dumper.represent_scalar("tag:yaml.org,2002:str", data, style="|")

    return dumper.represent_scalar("tag:yaml.org,2002:str", data)

configures yaml for dumping multiline strings Ref: https://stackoverflow.com/questions/8640959

def stripExt(path)

Expand source code Browse git

def stripExt(path):
    """Strip the extension of a file name, if there is one."""
    (d, f) = (dirNm(path), fileNm(path))
    sep = "/" if d else ""
    return f"{d}{sep}{f.rsplit('.', 1)[0]}"

Strip the extension of a file name, if there is one.

def unexpanduser(path)

Expand source code Browse git

def unexpanduser(path):
    nPath = normpath(path)
    # if nPath.startswith(_homeDir):
    #    return f"~{nPath[len(_homeDir):]}"

    return nPath.replace(_homeDir, "~")

def writeJson(data, asFile=None, **kwargs)

Expand source code Browse git

def writeJson(data, asFile=None, **kwargs):
    """Write data as JSON.

    The output is either delivered as string or written to a file.

    Parameters
    ----------
    data: object
        The input data.
    asFile: string | object, optional None
        The output destination.
        If `None`, the output text is delivered as the function result.
        If the value of `asFile` is a string, it is taken as a file name to write to.
        Otherwise, it is taken as a file handle to which text can be written.
    kwargs: dict, optional {}
        Additional paramters for the underlying json.dump method.
        By default, we use `indent=1, ensure_ascii=False`.

    Returns
    -------
    str | void
        If asFile is not None, the function returns None and the result is written
        to a file. Otherwise, the result string is returned.
    """
    if "indent" not in kwargs:
        kwargs["indent"] = 1
    if "ensure_ascii" not in kwargs:
        kwargs["ensure_ascii"] = False

    if type(asFile) is str:
        with fileOpen(asFile, "w") as fh:
            json.dump(data, fh, **kwargs)
    else:
        dumped = json.dumps(data, **kwargs)

        if asFile is None:
            return dumped

        asFile.write(dumped)

Write data as JSON.

The output is either delivered as string or written to a file.

Parameters

data : object: The input data.
asFile : string | object, optional None: The output destination. If None, the output text is delivered as the function result. If the value of asFile is a string, it is taken as a file name to write to. Otherwise, it is taken as a file handle to which text can be written.
kwargs : dict, optional {}: Additional paramters for the underlying json.dump method. By default, we use indent=1, ensure_ascii=False.

Returns

str | void: If asFile is not None, the function returns None and the result is written to a file. Otherwise, the result string is returned.

def writeYaml(data, asFile=None, sorted=False)

Expand source code Browse git

def writeYaml(data, asFile=None, sorted=False):
    """Write data as YAML.

    The output is either delivered as string or written to a file.

    Parameters
    ----------
    data: object
        The input data.
    asFile: string | object, optional None
        The output destination.
        If `None`, the output text is delivered as the function result.
        If the value of `asFile` is a string, it is taken as a file name to write to.
        Otherwise, it is taken as a file handle to which text can be written.
    sorted: boolean, optional False
        If True, when writing out a dictionary, its keys will be sorted.

    Returns
    -------
    str | void
        If asFile is not None, the function returns None and the result is written
        to a file. Otherwise, the result string is returned.
    """
    kwargs = dict(allow_unicode=True, sort_keys=sorted)

    if type(asFile) is str:
        with fileOpen(asFile, mode="w") as fh:
            yaml.dump(data, fh, **kwargs)
    else:
        dumped = yaml.dump(data, **kwargs)

        if asFile is None:
            return dumped

        asFile.write(dumped)

Write data as YAML.

The output is either delivered as string or written to a file.

Parameters

data : object: The input data.
asFile : string | object, optional None: The output destination. If None, the output text is delivered as the function result. If the value of asFile is a string, it is taken as a file name to write to. Otherwise, it is taken as a file handle to which text can be written.
sorted : boolean, optional False: If True, when writing out a dictionary, its keys will be sorted.

Returns

str | void: If asFile is not None, the function returns None and the result is written to a file. Otherwise, the result string is returned.