User Guide

Ezkv: JDK 21+ micro configuration framework

"Ezkv lemon squeezy!"

A non-opinionated Java bootstrapping configuration library that allows recursive chain loading of configuration from key values.

Description

A non-opinionated Java bootstrapping configuration library that allows recursive chain loading of configuration from key values.

Key values are everywhere (also known as associative arrays, list of tuples, or name-value pairs)!

Environment variables, system properties, cloud metadata, vault, HTTP FORM post, URI queries, command line arguments, and even most forms of JSON, HOCON, TOML, YAML, and XML can all be represented as simple "key value" pairs.

Thus, it is the perfect common denominator for providing applications with initial configuration. We call this "bootstrapping configuration" and it is usually gathered even before logging.

To do this, Ezkv loads streams of key values (KeyValue) from resources. What is unique about it is that certain key values will load more key values to the current stream, which is what we call chaining.

URI -> KeyValues* -> URI* -> KeyValues -> ...

Streams are useful because they can be filtered and transformed, which matters because keys and values from various sources often need transformation. For example, environment variable names often need to be converted to lower case, and some prefix removed.

In short, it is a micro configuration framework that itself can be configured with key values.

Why and Use Case

Ezkv allows "users" to decide where their configuration comes from, instead of developers.

A simple command-line application use case might be like ripgrep, where a single environment variable dictates the location of configuration. Ezkv easily supports this style and also allows the user to load additional configuration from other places using key values in the configuration file. Ezkv can even enable configuration to come from other environment variables, which is something not typically possible with tools like ripgrep or similar utilities.

Another use case might be to simulate Spring Boot's configuration loading but without the rest of Spring Boot. Ezkv can achieve this declaratively through key values. In fact, a user could configure the previously mentioned ripgrep environment variable to perform a Spring Boot-like load of configuration.

Adding Ezkv to Your Project

To use Ezkv in your project, add the following dependency to your build configuration:

Maven

<dependency>
  <groupId>io.jstach.ezkv</groupId>
  <artifactId>ezkv-kvs</artifactId>
  <version>0.5.0-SNAPSHOT</version>
</dependency>

Gradle

implementation 'io.jstach.ezkv:ezkv-kvs:0.5.0-SNAPSHOT'

Example Usage

A simple example using java.util.Properties files that could be parsed to KeyValues:

var kvs = KeyValuesSystem.defaults()
    .loader()
    .variables(Variables::ofSystemProperties) // Use system properties as base variables for interpolation
    .add("classpath:/start.properties")       // Use a classpath properties file
    .add("system:///")                        // add system properties to override
    .add("env:///")                           // add environment variables to override
    .add("cmd:///?_filter_sed=s/^-D//")       // add command line for final override using the prefix of -D
    .load();

// Give the key values to some other config framework like Spring:
var map = kvs.toMap();
ConfigurableEnvironment env = applicationContext.getEnvironment();
env.getPropertySources().addFirst(new MapPropertySource("start", map));

Now in start.properties you load more resources with special keys:

_load_foo=file:/./myapp.properties?_flag=optional # optionally load myapp.properties from the CWD.

Project Information

Source Control

https://github.com/jstachio/ezkv

Team

Adam Gent (agentgt) - lead

Issues

https://github.com/jstachio/ezkv/issues

Community

https://github.com/jstachio/ezkv/discussions

User Guide

This document

Javadoc

This document (modules listing at bottom)

The project follows the JDK Tip and Tail and semantic versioning.

Requirements

Java 21 or greater
A build system that supports running the Java compiler annotation processor

The only module needed during runtime is java.base

Architecture

Ezkv's two major concepts are:

KeyValues - a stream of key values
KeyValuesResource - a URI with associated key-value metadata

Resources are used to load key values, and key values can be used to specify and find more resources (to load more key values). Ezkv is recursive.

For the rest of the explanation of architecture, we will go bottom-up.

KeyValue

A KeyValue object in ezkv, unlike a Map.Entry<String, String>, has more information than just the simple tuple of key and value.

Immutable
Have interpolated value as well as the original pre-interpolated value
Have source information
Whether or not it should be used for interpolation
Whether or not it should be printed out ever (e.g., a password or other sensitive information)

Ezkv provides ergonomics for working with streams of key values to filter, collect, parse, and format them.

Notice that Ezkv is like a list of key values and thus:

Order can be important
There can be duplicate "keys" (that may or may not override in the final result)

Finally, a KeyValue can be a special key that can reference another resource to load.

These keys are usually prefixed with _ to avoid collision and maximize compatibility. The most important one is _load_name, where name is the name you assign to the resource and the value is a URI. This mini DSL syntax will be configurable in the future, so you can pick different key name patterns.

Interpolation

Ezkv can perform Bash-like interpolation on a stream of KeyValues (e.g., ${somevar:-${othervar}}). It does this by using the key values themselves and Variables. Variables are simply Function<String, String>.

This allows you to interpolate on key values with data you do not want in the final result (KeyValues). For example, it is common to use System.getProperties() as variables, but often you do not want all the system properties to end up in the KeyValues.

Variables can be added to the loader

Interpolation can be disabled with the resource flag no_interpolation.

Finally, you can load a resource as variables instead of KeyValues with the resource flag no_add.

var kvs = KeyValuesSystem.defaults()
    .loader()
    .add("classpath:/variable.properties?_flag=no_add")  # Use the variable.properties file as variables.
    .add("classpath:/start.properties")
    .load();

start.properties

# start.properties
_load_app=classpath:/${app.name:-app}.properties

If app.name is defined in variable.properties, it will be used in the _load_app URI. Otherwise, app.properties will be used.

Often, interpolation will create a new stream of KeyValues where the value part of the key is replaced with the interpolated result; however, the original value is always retained.

KeyValuesResource

A KeyValuesResource has a URI and a symbolic name (used to find configuration). It is backed by a key value with additional metadata on how to load that resource. URIs are designed to point at resources, and the additional metadata in a KeyValuesResource is, as you might guess, more KeyValues.

The additional metadata is used to determine how to load the key values and what metadata should be associated with each key value.

Some examples of metadata use cases include:

The key values from the resource are sensitive and should not be easily printed out.
The key values should not be interpolated because the data is raw.
The loaded key values should or should not load other key values.
The key values need their names transformed or some key values ignored.

This is all configurable through key values (and URIs), particularly using the _flags_name key.

Resource Key Value Configuration

Resource loading configuration can be done with special key values. These can either be specified in the URI of the resource or as key values within the resource where the _load_[name] is specified.

The underscore is a called the prefix. That will be configurable at some point as well as the separator which is also an underscore.

The key names without the prefix and separators are:

"load" - resource to load (required)
"mediaType" or "mime" - media type of the resource (not required)
"flags" or "flag" - flags for loading
"param", "parm" or "p" - parameters for custom plugins
"filter" or "filt" - filters

The next sections will cover the default key syntax within a resource or within a URI.

Resource Configuration in Resource

The default key-value pattern to specify resources is:

_load_[name]=URI
_mediaType_[name]=Content Type or file extension to resolve format of resource for parsing
_flags_[name]=CSV of flag names
_param_[name]_[key]=String
_filter_[name]_[filter]=String expression for filter

The [name] part should be replaced with a name of your choosing, where only case-sensitive alphanumeric characters are allowed. This becomes the symbolic name of the resource. Don't worry—those special keys will be filtered out.

The _load_[name] is the most important key as it dictates the name of the resource and the URI of the resource.

TIP: the key aliases like "filt" and "mime" are all 4 characters which is useful for visual alignment or brevity.

Resource Configuration in URI

The URI of the _load_[name] can also contribute to resource keys using the following format:

_load_custom=file://./something?_mediaType=properties&_flags=optional&_filter_sed=s/myapp_//&_param_custom=something

Notice: The resource name does not need to be specified with URI parameters, as it is deduced.

Resource Flags

Resource flags can be set with _flags_[name] in the resource or on the URI with _flags. These can be repeatable parameters, and their values are combined.

This is currently a subset of the flags:

"NO_REQUIRE" / "OPTIONAL" - Resources are usually required to exist; otherwise, an error occurs. This flag makes them not required.
"SENSITIVE" - The key values loaded from the resource will be marked as sensitive and will not be output in toString or similar methods.
"NO_ADD" - The key values will not be added to the final result but will instead be used as variables.
"NO_LOAD_CHILDREN" - The resource is not allowed to chain additional resources (e.g., it cannot use _load_ keys).

Resource Media Type

Ezkv will try to automatically determine the media type of a resource based on the URI file extension. However, in some cases, that is not sufficient. The key _mediaType_[name] allows you to explicitly specify the media type.

This is particularly useful for URIs that can load a key containing key-value data. A real-world example of this is the Spring Boot environment variable SPRING_APPLICATION_JSON.

An example of emulating that behavior in ezkv:

_load_springJson=env:///SPRING_APPLICATION_JSON?_mediaType=json

Note: JSON support is provided by the io.jstach.ezkv.json5 module.

See KeyValuesMedia section for information on how to add more media types.

Resource Filters

Filters can be applied to a resource to alter the key values after they are loaded and are applied before being parsed for more resources to load and before being added to the final results. Thus if the resource has child resources the filters are not applied to the childs key values when they are loaded. Only the resource's key values are filtered.

The ordering of filters matters, so it is generally recommended to use the URI notation, as the order is guaranteed.

The current filters provided out-of-the-box (OOB) are:

"sed" - Similar to the Unix sed utility but only supporting the s (substitute) and d (delete) commands.
"grep"- Filters keys matching a regular expression.
"join" - Joins duplicate keys (key with same name) using the expression as the join string on the values.

Filter Target

The "grep" and "sed" filters by default target the keys for search and manipulation. These filter can target the value by suffixing with "_value" or "_val". You can also explicitly say you want to target the key with "_key" suffix. For example _filt_sed_key=... would target keys for search and replace. Note the "join" filter only operates on values and combines keys so these suffixes do not apply.

Here is an example of using grep and sed filters:

_load_env=env:///?_filter_grep_key=^MY_APP_&_filter_sed=s/^MY_APP/myapp./

The above configuration will only load environment variables prefixed with MY_APP_ and will replace MY_APP_ with myapp..

The original key name is always preserved for key tracking, so regardless of how keys are renamed, users can still identify where the key-value originally came from.

Note: To add new filters, implement KeyValuesServiceProvider.KeyValuesFilter. This interface allows you to provide custom filters.

Retaining Resource Keys

Filters like sed can often remove the special chain loading keys (the keys by default prefixed with _). If using the out of box filters the keys can also be retained with the flag of "NO_FILTER_RESOURCE_KEYS" but custom filters may not honor it.

Resource Loading

The main KeyValuesLoader provided by the KeyValuesSystem will take KeyValuesResources and essentially concatenate all the found KeyValues into one KeyValues. The loader will parse for other other resources and load them in a depth first fashion based on the order of the load keys found. For example if we have.

_load_A=URI
_load_B=URI

Resource A and its children will be loaded before resource B and its children. Effectively that means that A and its children's key values will appear before resource B (and its children's) key values in the KeyValues stream. The idea is that if B has the same keys as A B will override A on a KeyValues.toMap().

EZKV provides an extension point that, in simple terms, takes a URI and loads KeyValues, usually based on the schema of the URI. For example:

classpath - Uses the JDK classloader mechanism.
file - Uses java.io/java.nio file loading.

This part of the library is extendable, and custom loaders can be manually wired or the service loader can be used.

Out of the box, Ezkv supports the following URI schemas:

"classpath" - classpath resource
"classpaths" - merge multiple classpath resource with same name.
"file" - file resource
"system" - System properties
"env" - Environment variables
"cmd" - Command line argument pairs separated by =
"stdin" - Allows Unix piping of key values, often useful for passwords
"profile." - Will load multiple resources based on a CSV of profiles where the profile name replaces part of the URI
"provider" - Will load reference config that is usually loaded from the ServiceLoader

Other URI schemas will be added in the future, usually as separate modules, such as vaults k8s ConfigMap API, cloud meta data services, distribute key value systems and more.

Note: To add new loaders, implement KeyValuesServiceProvider.KeyValuesLoaderFinder to optionally load key values based on a URI and resource parameters.

Note: If no schema is provided, Ezkv assumes it is a file path, which is treated as a URI.

We won’t cover file and classpath here, as they are straightforward and self-explanatory.

URI schema: `env`, `system`, `cmd`

env:///, system:///, and cmd:/// resources have features that differ from file or classpath.

Each of these URI schemas can take a path that fetches a key value, where the value is used as the source of the key values. If no path is provided, all key values of the resource will be loaded.

For example, if we have an environment variable whose value is JSON, we can load it like this:

_load_springJson=env:///SPRING_APPLICATION_JSON?_mediaType=json

URI schema: `stdin`

stdin:/// without a path assumes that the entire contents of stdin are in java.util.Properties key-value format. If this is not the case, it is advisable to specify _mediaType= explicitly with stdin.

stdin:/// can also bind the input content, parsed as a UTF-8 string, to the key provided in the URI path.

_load_stdin=stdin:///db.password?_flag=sensitive,optional

The above example is particularly useful for passwords, similar to how Docker handles passwords from stdin.

Stdin is retrieved from KeyValuesEnvironment.getStandardInput(). Because checking stdin will block an application one must enable reading from it which is done by either:

Checking resource parameter of "stdin" is true.
Setting "stdin_arg" to command line argument to check if present.
Relying on the default which will check "--" + KeyValuesResource.name() command line argument is present.

# will be enabled if --dbpassword is passed on the command line
# since dbpassword is the resource name.
_load_dbpassword=stdin:///db.password?_flag=sensitive,optional
# specify that the "--password" command line arg is required
_load_stdin=stdin:///db.password?_param_stdin_arg=--password&_flag=sensitive
# alternatively one can do:
_load_stdin=stdin:///?_p_stdin=${some_other_variable}
# which will be enabled if some_other_variable evalutes to 'true'

URI schema: `profile.`

Note: This schema may be renamed to profiles in the future.

The profile schema is one of the more complex KeyValuesLoader implementations. It allows loading multiple resources based on a CSV list of profile names. Here's an example:

PROFILES=profile1,profile2 # this could come from an environment variable

_load_profiles=profile.classpath:/app-__PROFILE__.properties
_param_profiles_profile=${PROFILES}
_flags_profiles=no_require

The above configuration will attempt to load the following resources:

classpath:/app-profile1.properties
classpath:/app-profile2.properties

It will not fail if those resources are not found, as the flag no_require is set.

The logic behind this schema is not special or internal. The profile loader simply generates the following key values:

_load_profiles1=classpath:/app-profile1.properties
_flags_profiles1=no_require
_load_profiles2=classpath:/app-profile2.properties
_flags_profiles2=no_require

This means users can create a similar implementation themselves if desired.

URI schema: `provider`

The "provider" schema is a way to load default configuration provided by modules your application depends on for example a database layer or service layer. The providers should be registered as ServiceLoader services so that they can be discovered however they will not be loaded unless this schema is used.

This is akin to Lightbend/Typesafe Config reference.conf but without a resource call for maximum portability with things like GraalVM native, Maven Shade plugin, and modular applications where the config could be an enscapulated resource (properties file) which would require the module to do the loading and not EZKV.

Example:

# It desirable to load the providers very early if possible
# as the idea is to have the config replaced with downstream key values
_load_providers=provider:///

In our database layer we would create a class like:

public class DatabaseConfigProvider implements KeyValuesServiceProvider.KeyValuesProvider {

	@Override
	public void provide(KeyValues.Builder builder, LoaderContext context) {
		builder.add("database.port", "5245");
	}

}

And then register it using the ServiceLoader registration with the service class of KeyValuesServiceProvider. See KeyValuesServiceProvider.KeyValuesProvider for more details on proper usage.

URI schema: `classpaths`

The "classpaths" (notice the "s" on the end) schema allows loading multiple resources found on the class/modulepath with the same name.

It analogous to Spring's classpath* support (classpath* is not a valid URI schema). However unlike Spring Ant wild card support is not currently provided.

It can also be used to mimic Lightbend/Typesafe Config reference.conf support by classpaths:/reference.conf

It is highly recommended you try not use this and is provided to support mimicing other configuration systems.

It is unlikely to work in "uber jars" created by Maven Shade or similar without proper config.
It has issues in GraalVM native
The ordering is not reliable as it is based on the order of the classpath
In a modular environment encapsulation issues maybe present
For security reasons the found resources are not allowed to load children.
But most of all it is incredibly slow!

Instead a more reliable option is to use the provider support or just plain classpath (no "s" on end).

KeyValuesMedia

KeyValuesMedia

Some KeyValuesLoader implementations can directly parse the URI into KeyValues. However, many will rely on a parser to handle the data.

Ezkv provides a framework to parse and format key values from or to byte streams or strings, based on media type (also known as "Content Type" or MIME) or file extension.

This part of the library is extendable. Custom media types can be manually wired or loaded using the Java ServiceLoader.

Out of the box, Ezkv supports:

java.util.Properties format
URL Query percent encoding format

Additional formats will be added as separate modules, such as dotenv, HOCON, and Terraform/Tofu tfvars.json format.

Note: To add new media types, implement KeyValuesServiceProvider.KeyValuesMediaFinder. This interface allows you to provide custom parsing logic for new formats.

KeyValuesEnvironment

KeyValuesEnvironment

Ezkv provides a facade to access system properties, environment variables, class resource loaders, stdin, command-line arguments, and logging. This is particularly useful for unit testing, but the most valuable feature is the logger.

The logger in Ezkv does nothing by default, but users can plug in their own implementations to track resource loading.

KeyValuesSystem

KeyValuesSystem

This is the entry point into Ezkv and is used to load the initial part of the chain of resources. The bootstrapping part of your application will call it first and will often convert or pass the loaded key values to another system.

Extensions and Integration

Most of the modules are explained below in the modules table.

Maven Plugin

EZKV has a maven plugin that will load config as properties in a Maven build. This plugin is similar to properties-maven-plugin and almost a drop in replacement except that it will use EZKV to load the properties. Unlike the Codehaus properties-maven-plugin this plugin allows chain loading of config.

However like the Codehaus properties plugin this plugin has the following limitations:

This plugin is executed when the project model is already built in memory.
Properties read from resources by this plugin can not by used in project definitions in elements like <goal>, <version> and so on.
Properties read by this plugin in one module are not propagated to other modules or child projects.
Properties are only available for other plugins in runtime like for maven-resource-plugin for filtering resources.

Example Maven Plugin:

<plugin>
  <groupId>io.jstach.ezkv</groupId>
  <artifactId>ezkv-maven-plugin</artifactId>
  <version>0.5.0-SNAPSHOT</version>
  <executions>
    <execution>
      <!-- The earlier the phase the better so that more plugins can read the newer properties -->
      <phase>initialize</phase>
      <goals>
        <goal>read-project-properties</goal>
      </goals>
      <configuration>
        <urls>
          <!-- Note that maven will interpolate first if you have variables in the url -->
          <url>file:///${project.basedir}/src/main/resources/db/database.properties</url>
        </urls>
        <!-- Will replace already set values by default is set to true -->
        <override>true</override>
          <!--
          By default EZKV will use the project.basedir as the CWD.
          Maven usually sets project.basedir for CWD for unit tests (surefire) and other plugins.
          Set this to true to use the actual system CWD (usually the terminal current working
          and not always the project basedir). However realize your build may not be repeatable
          if it is set to true.
          -->
          <realCWD>false</realCWD>
      </configuration>
    </execution>
  </executions>
</plugin>

The plugin can be disabled with ezkv.skipLoadProperties=true property.

The plugin is a great choice for plugins that connect to databases like jOOQ or Flyway/Liquibase. You can load the database credentials stored from a varierty of sources.

Modules

Module

Description

io.jstach.ezkv.boot

Ezkv Boot is a tiny opinionated configuration framework modeled after Spring Boot that uses Ezkv KVS system to load key values into a Map ( io.jstach.ezkv:ezkv-boot:0.5.0-SNAPSHOT ).

io.jstach.ezkv.dotenv

Provides a Dotenv capable key values media parser based on https://github.com/motdotla/dotenv ( io.jstach.ezkv:ezkv-dotenv:0.5.0-SNAPSHOT ).

io.jstach.ezkv.json5

Provides a JSON5 capable key values media parser that has zero dependencies ( io.jstach.ezkv:ezkv-json5:0.5.0-SNAPSHOT ).

io.jstach.ezkv.kvs

The EZKV core module, a non-opinionated Java configuration system that supports recursive chain loading of configuration from key-value pairs ( io.jstach.ezkv:ezkv-kvs:0.5.0-SNAPSHOT ).

io.jstach.ezkv.xml

Provides an XML capable key values media parser that depends on the JDK java.xml module ( io.jstach.ezkv:ezkv-xml:0.5.0-SNAPSHOT ).

(pronounced "ee-zee kee-vee")
Ezkv API 0.5.0-SNAPSHOT

User Guide

Contents

Description

Why and Use Case

Adding Ezkv to Your Project

Maven

Gradle

Example Usage

Project Information

Requirements

Architecture

KeyValue

Interpolation

KeyValuesResource

Resource Key Value Configuration

Resource Configuration in Resource

Resource Configuration in URI

Resource Flags

Resource Media Type

Resource Filters

Resource Loading

URI schema: `env`, `system`, `cmd`

URI schema: `stdin`

URI schema: `profile.`

URI schema: `provider`

URI schema: `classpaths`

KeyValuesMedia

KeyValuesEnvironment

KeyValuesSystem

Extensions and Integration

Maven Plugin

(pronounced "ee-zee kee-vee") Ezkv API 0.5.0-SNAPSHOT

User Guide

Contents

Description

Why and Use Case

Adding Ezkv to Your Project

Maven

Gradle

Example Usage

Project Information

Requirements

Architecture

KeyValue

Interpolation

KeyValuesResource

Resource Key Value Configuration

Resource Configuration in Resource

Resource Configuration in URI

Resource Flags

Resource Media Type

Resource Filters

Resource Loading

URI schema: env, system, cmd

URI schema: stdin

URI schema: profile.

URI schema: provider

URI schema: classpaths

KeyValuesMedia

KeyValuesEnvironment

KeyValuesSystem

Extensions and Integration

Maven Plugin

(pronounced "ee-zee kee-vee")
Ezkv API 0.5.0-SNAPSHOT

URI schema: `env`, `system`, `cmd`

URI schema: `stdin`

URI schema: `profile.`

URI schema: `provider`

URI schema: `classpaths`