cache2k User Guide

1. About

you are reading the cache2k user guide which walks through all major feature in a separate section to give an overview.

The following documentation resources are available:

API documentation:

1.1. Versioning

The JBoss versioning scheme is followed (https://developer.jboss.org/wiki/JBossProjectVersioning). Furthermore, a Tick-Tock scheme is used to mark development releases. Examples:

1.0.0.Final: Major version.
1.0.1.Final: Service release. Should be binary compatible to previous release.
1.1.1.Beta: Odd minor version, development version. A beta version may be used in production, but additional features may still change the API and may not completely tested.
1.2.0.Final: Even minor version, stable release, new features and compatible changes to the previous version. Minor releases may not be strictly binary compatible. Recompilation is needed.
2.0.0.Final: New Major version, adds and removes features, may have incompatible changes to the previous version.

1.2. How to read the documentation

The documentation is intended as a overview guide through the functionality of cache2k and will help you discover every important feature. At some points rationale or background information is given. It is not complete. You will find additional information in the API JavaDoc, in examples, and in the test cases.

A cache2k User Guide PDF Version is available as well.

1.3. Copyright

The documentation is licensed under the terms of CC BY 4.0.

2. Getting Started

2.1. Obtaining cache2k

The latest cache2k version is available on maven central. The recommended way to include it in your project is to add the following dependencies:

    <properties>
      <cache2k-version>2.6.1.Final</cache2k-version>
    </properties>

    <dependency>
      <groupId>org.cache2k</groupId>
      <artifactId>cache2k-api</artifactId>
      <version>${cache2k-version}</version>
    </dependency>
    <dependency>
      <groupId>org.cache2k</groupId>
      <artifactId>cache2k-core</artifactId>
      <version>${cache2k-version}</version>
      <scope>runtime</scope>
    </dependency>

For gradle users, the following scheme is recommended:

def cache2kVersion = '2.6.1.Final'

dependencies {
    implementation "org.cache2k:cache2k-api:${cache2kVersion}"
    runtimeOnly "org.cache2k:cache2k-core:${cache2kVersion}"
}

2.2. Using a Cache

For starting with cache2k, let’s construct a cache that looks up the preferred airline for one of our frequent flight routes.

    Cache<String, String> cache = new Cache2kBuilder<String, String>() {}
      .name("routeToAirline")
      .eternal(true)
      .entryCapacity(100)
      .build();
    // populate with our favorites
    cache.put("MUC-SFO", "Yeti Jet");
    cache.put("SFO-LHR", "Quality Air");
    cache.put("LHR-SYD", "Grashopper Lifting");
    // query the cache
    String route = "LHR-MUC";
    if (cache.containsKey(route)) {
      System.out.println("We have a favorite airline for the route " + route);
    } else {
      System.out.println("We don't have a favorite airline for the route " + route);
    }
    String airline = cache.peek(route);
    if (airline != null) {
      System.out.println("Let's go with " + airline);
    } else {
      System.out.println("You need to find one yourself");
    }

To obtain the cache a new Cache2kBuilder is created. Mind the trailing {}. There are a dozens of options that can alter the behavior of a cache. In the example above the cache gets a name and is instructed to keep entries forever via eternal(true). A name needs to be unique and may be used to find and apply additional configuration parameters and to make statistics available via JMX. Find the details about naming a cache at in JavaDoc of Cache2kBuilder.name().

The cache has a maximum capacity of 100 entries. When the limit is reached an entry is automatically removed that is not used very often. This is called eviction.

In the example Cache.put() is used to store values. The cache content is queried with Cache.peek() and Cache.containsKey().

2.3. Cache Aside

Let’s consider we have an operation called findFavoriteAirline(), that checks all our previous flights and finds the airline we liked the most. If we never did fly on that route it will ask all our friends. Of course this is very time consuming, so we first ask the cache, whether something for that flight route is already known, if not, we call the expensive operation.

    Cache<String, String> routeToAirline = new Cache2kBuilder<String, String>() {}
      .name("routeToAirline")
      .build();

    private String findFavoriteAirline(String origin, String destination) {
      // expensive operation to find the best airline for this route
      // for example, ask all friends...
    }

    public String lookupFavoriteAirline(String origin, String destination) {
      String route = origin + "-" + destination;
      String airline = routeToAirline.peek(route);
      if (airline == null) {
        airline = findFavoriteAirline(origin, destination);
        routeToAirline.put(route, airline);
      }
      return airline;
    }

The above pattern is called cache aside.

2.4. Read Through

An alternative way to use the cache is the so called read through operation. The cache configuration gets customized with a loader, so the cache knows how to retrieve missing values.

  Cache<String, String> routeToAirline = new Cache2kBuilder<String, String>() {}
    .name("routeToAirline")
    .loader(new CacheLoader<String, String>() {
      @Override
      public String load(final String key) throws Exception {
        String[] port = key.split("-");
        return findFavoriteAirline(port[0], port[1]);
      }
    })
    .build();

  private String findFavoriteAirline(String origin, String destination) {
    // expensive operation to find the best airline for this route
  }

  public String lookupFavoirteAirline(String origin, String destination) {
    String route = origin + "-" + destination;
    return routeToAirline.get(route);
  }

Now we use Cache.get to request a value from the cache, which transparently invokes the loader and populates the cache, if the requested value is missing. Using a cache in read through mode has various advantages:

No boilerplate code as in cache aside
Protection against the cache stampede (See Wikipedia: Cache Stampede)
Automatic refreshing of expired values is possible (Refresh Ahead)
Built-in exception handling like suppression and retries (see Resilience and Exceptions chapter)

2.5. Using Null Values

The simple example has a major design problem. What happens if no airline is found? Typically caches don’t allow null values. When you try to store or load a null value into cache2k you will get a NullPointerException. Sometimes it is better to avoid null values, in our example we could return a list of favorite airlines which may be empty.

In case a null value is the best choice, it is possible to store it in cache2k by enabling it with permitNullValues(true). See the Null Values chapter for more details.

2.6. Composite Keys

In the example the key is constructed by concatenating the origin and destination airport. This is ineffective for several reasons. The string concatenation allocates two temporary objects (the StringBuilder and its character array); if we need the two parts again we have to split the string again. A better way is to define a dedicated class for the cache key that is a tuple of origin and destination.

  public final class Route {
    private final String origin;
    private final String destination;

    public Route( String origin, String destination) {
      this.destination = destination;
      this.origin = origin;
    }

    public String getOrigin() {
      return origin;
    }

    public String getDestination() {
      return destination;
    }

    @Override
    public boolean equals( Object other) {
      if (this == other) return true;
      if (other == null || getClass() != other.getClass()) return false;
      Route route = (Route) other;
      if (!origin.equals(route.origin)) return false;
      return destination.equals(route.destination);
    }

    @Override
    public int hashCode() {
      int hashCode = origin.hashCode();
      hashCode = 31 * hashCode + destination.hashCode();
      return hashCode;
    }
  }

Cache keys needs to define a proper hashCode() and equals() method.

2.7. Keys Need to be Immutable

Don’t mutate keys

For a key instance it is illegal to change its value after it is used for a cache operation. The cache uses the key instance in its own data structure. When defining your own keys, it is therefore a good idea to design them as immutable object.

The above isn’t special to caching or cache2k, it applies identically when using a Java HashMap.

2.8. Mutating Values

It is illegal to mutate a cached value after it was stored in the cache, unless storeByReference is enabled. This parameter instructs the cache to keep all cached values inside the heap.

Background: cache2k stores its values in the Java heap by the object reference. This means mutating a value, will affect the cache contents directly. Future versions of cache2k will have additional storage options and allow cache entries to be migrated to off heap storage or persisted. In this case mutating cached values directly will lead to inconsistent results.

2.9. Exceptions and Caching

When using read through and a global expiry time (expireAfterWrite) is set, exceptions will be cached and/or suppressed.

A cached exception will be rethrown every time the key is accessed. After some time passes, the loader will be called again. A cached exception can be spotted by the expiry time in the exception text, for example:

org.cache2k.integration.CacheLoaderException: expiry=2016-06-04 06:08:14.967, cause: java.lang.NullPointerException

When a previous value is available a subsequent loader exception is suppressed for a short time. For more details on this behavior see the Resilience chapter.

2.10. Don’t Panic!

Also those familiar with caching might get confused by the many parameters and operations of cache2k controlling nuances of caching semantics. Except for the exceptions caching described above everything will work as you will expect from a cache. There is no need to know every feature in detail, yet. Think of them as a parachute. Usually you don’t need them, but when in trouble, there is one parameter that will save you.

Whenever in doubt: For asking questions please use the Stackoverflow tag cache2k. Please describe your scenario and the problem you try to solve first before asking for specific features of cache2k and how they might help you.

3. Types

This section covers:

How to construct a cache with concrete types
Why you should use types
How generic types are captured

3.1. Constructing a Cache with Generic Types

When using generic types, the cache is constructed the same way as already shown in the Getting started chapter.

    Cache<Long, List<String>> cache =
      new Cache2kBuilder<Long, List<String>>() {}
        .eternal(true)
        .build();

The {} is a trick which constructs an anonymous class, which contains the complete type information. If just an object would be created the complete type information would not be available at runtime and could not be used for configuring the cache.

Caches can be constructed dynamically with arbitrary generic types. The type information can be specified via the interface CacheType.

3.2. Constructing a Cache without Generic Types

If the cache types do not contain generic types, then the following simpler builder pattern can be used:

    Cache<Long, String> cache =
      Cache2kBuilder.of(Long.class, String.class)
        .eternal(true)
        .build();

The additional generated class as in the previous version is not needed, which saves a few bytes program code.

3.3. Key Type

The key type needs to implement equals() and hashCode(). Arrays are not valid for keys.

3.4. Value Type

Using arrays as values is discouraged, because some cache operations testing for value equality, like Cache.replaceIfEquals(), will not work as desired on arrays. To prevent problems, cache2k refuses to build a cache with an array value type specified at the configuration. However, this protection can be circumvented by not providing the proper type in the cache configuration.

If the value type is implementing ValueWithExpiryTime, an expiry policy is added automatically.

3.5. Untyped Caches

It is possible to construct an untyped cache via Cache2kBuilder.forUnknownTypes(). But, the use of untyped caches is discouraged. If different types need to be stored in a cache, construct a separate cache for each type with the proper type information.

3.6. Future Enhancements

Future versions of cache2k will leverage the type information for:

Optimizations depending on the type
Optional strict type checking
Optional copying
Derive an optimal marshaller for off heap overflow and persistence

4. Atomic Operations

Atomic operations allow a combination of read or compare and modify a cached entry without interference of another thread.

4.1. Example

The method Cache.replaceIfEquals() has the following semantics:

    if (cache.containsKey(key) && Objects.equals(cache.get(key), oldValue)) {
      cache.put(key, newValue);
      return true;
    } else
      return false;
    }

When executing the above statements sequentially, the outcome can vary because other threads can interfere. The atomic operation semantic guarantees that this is not possible.

These kind of operations are also called CAS (compare and swap) operations.

4.2. Built-in Atomic Operations

In general all operations on a single entry have atomic guarantee. Bulk operations, meaning operations on multiple keys, like getAll(), don’t have atomic guarantees.

Operation Description

Operation	Description
`boolean containsAndRemove(key)`	Remove and return true if something was present
`V peekAndRemove(key)`	Remove and return existing value
`boolean putIfAbsent(key, value)`	Insert value only if no value is present
`V peekAndReplace(key, value)`	Replace value and return old value
`boolean Cache.replace(key, value)`	Replace value and return true when successful
`V peekAndPut(key, value)`	Insert or update value and return the previous value
`boolean replaceIfEquals(key, expectedValue, newValue)`	Replace value only when old value is present in the cache
`boolean removeIfEquals(key, expectedValue)`	Remove only if present value matches
`V computeIfAbsent(K key, Function<? super K, ? extends V> function);`	Compute value if entry is not present

boolean containsAndRemove(key)

Remove and return true if something was present

V peekAndRemove(key)

Remove and return existing value

boolean putIfAbsent(key, value)

Insert value only if no value is present

V peekAndReplace(key, value)

Replace value and return old value

boolean Cache.replace(key, value)

Replace value and return true when successful

V peekAndPut(key, value)

Insert or update value and return the previous value

boolean replaceIfEquals(key, expectedValue, newValue)

Replace value only when old value is present in the cache

boolean removeIfEquals(key, expectedValue)

Remove only if present value matches

V computeIfAbsent(K key, Function<? super K, ? extends V> function);

Compute value if entry is not present

The ConcrurrentMap methods compute and computeIfPresent are available in the map view which can be accessed via Cache.asMap.

4.3. Entry Processor

With the entry processor it is possible to implement arbitrary complex operations that are processed atomically on a cache entry. The interface EntryProcessor must be implemented with the desired semantics and invoked on a cached value via Cache.invoke(). The bulk operation Cache.invokeAll() is available to process multiple cache entries with one entry processor.

Here is an example which implements the same semantics as replaceIfEquals():

      final K key = ...
      final V oldValue = ...
      final V newValue = ...
      EntryProcessor<K, V, Boolean> p = new EntryProcessor<K, V, Boolean>() {
        public Boolean process(MutableCacheEntry<K, V> entry) {
          if (!entry.exists()) {
            return false;
          }
          if (oldValue == null) {
            if (null != entry.getValue()) {
              return false;
            }
          } else {
            if (!oldValue.equals(entry.getValue())) {
              return false;
            }
          }
          entry.setValue(newValue);
          return true;
        }
      };
      return cache.invoke(key, p);
    }

Since it is an atomic operation multiple calls on MutableCacheEntry may have no effect if neutralising each other. For example:

     entry.setValue("xy");
     entry.setValue("abc");
     entry.remove();

The effect will be:

If an entry for the key is existing, the entry will be removed
If no entry for the key is existing, the cache state will not change
If a cache writer is attached, CacheWriter.delete() will be called in any case

Via the entry processor, it is also possible to specify an expiry time directly. Here is an example formulated as Java 8 lambda expression, which inserts a value and sets the expiry after 120 minutes:

     cache.invoke("key",
       e -> e.setValue("value")
             .setExpiry(System.currentTimeMillis() + TimeUnit.MINUTES.toMillis(120)));

5. Loading / Read Through

In read through operation the cache will fetch the value by itself when the value is retrieved, for example by Cache.get(). This is also called a loading cache or a self populating cache.

5.1. Benefits of Read Through Operation

When caching reads, using the cache in read through operation has several advantages:

No boilerplate code as in cache aside
Data source becomes configurable
Blocking load operations for identical keys, protection against the cache stampede (See Wikipedia: Cache Stampede)
Automatic refreshing of expired values (refresh ahead)
Build-in exception handling like suppression and retries (see Resilience chapter)

5.2. Defining a Loader

A loader is defined by implementing the interface CacheLoader. See the Getting Started example about read through.

interface CacheLoader<K, V> {
  V load(K key) throws Exception;
}

The loader actions may only depend on the input key parameter. In case the load operation will yield an exception it may be passed on to the cache. How exceptions are handled by the cache is defined by the resilience policy and explained in the Resilience chapter.

The JavaDoc about the CacheLoader contains additional details.

5.3. Advanced Loader

For more sophisticated load operations the AdvancedCacheLoader is available.

interface AdvancedLoader<K, V> {
  V load(K key, long currentTime, CacheEntry<K,V> currentEntry) throws Exception;
}

The information of the current entry can be used to optimize the data request. A typical example is the optimization of HTTP requests and use the request header If-Modified-Since.

The JavaDoc about the AdvancedCacheLoader contains additional details.

5.4. Asynchronous Loader and Bulk Loader

The additional interface variants AsyncCacheLoader and AsyncBulkCacheLoader are available to implement more efficient loaders. When using the bulk loader and refresh ahead the CoalescingBulkLoader can be used to combine single refresh ahead requests into one bulk request.

5.5. Concurrent Load Requests

A Cache.get or Cache.getAll will start a load and wait until the load is completed and return the result. With Cache.loadAll and Cache.reloadAll it is possible to start a concurrent load operation and get notified upon completion. If the configured loader does not support async operation, a thread pool (defined by loaderExecutor) will be used for the concurrent operation.

5.6. Invalidating

In case the data was updated in the external source, the current cache content becomes invalid. To notify the cache and eventually update the cached value several options exist.

Cache.remove(key): Invalidating an entry with Cache.remove() will cause the entry to be removed from the cache ultimately. The next time the entry is requested with Cache.get(key) the cache will invoke the loader (if defined). In case the loader yields an exception, this exception will be propagated to the application since there is no previous value for fallback. Cache.remove(key) is useful if the data is outdated and old data is not allowed to be delivered. Cache.remove() will also invoke CacheWriter.delete(), if specified. Priority is on data consistency.
Cache.expireAt(key, Expiry.NOW): The entry is expired immediately.
Cache.expireAt(key, Expiry.REFRESH): When invalidating an entry via Cache.expireAt(key, Expiry.REFRESH) the loader gets invoked instantly if refresh ahead is enabled. If the loader is invoked, the current value will stay visible until the updated value is available. If the loader cannot be invoked, the entry is expired. The operation will have no effect, if there is no cached entry associated with the key. The value is still available in the cache as fallback if a loader exception occurs. This variant is the better choice if outdated values are allowed to be visible and the cache should continuously serve data. Priority is on availability.

5.7. Transparent Access

When using the cache in read through and/or in write through operation, some methods on the cache will often be misinterpreted and present pitfalls. For example, the method Cache.containsKey will not return true when the value exists in the system of authority, but only reflects the cache state.

To prevent pitfalls a reduced interface is available: KeyValueSource.

6. Expiry and Refresh

A cached value may expire after some specified time. When expired, the value is not returned by the cache any more. The cache may be instructed to do an automatic reload of an expired value, this is called refresh ahead.

Expiry does not mean that a cache entry is removed from the cache. The actual removal and notification of event listeners from the cache may lag behind the time of expiry.

6.1. Parameter and Feature Overview

This section contains a brief overview of available features and configuration parameters.

6.1.1. Eternal

If the cached data does not need to be updated it is logically never expiring. A simple example would be a cache that contains formatted time stamps Cache<Date,String>. Generating or loading the value again, would always lead to the same result. If that is the case, it is a good idea to set eternal(true). Since no expiry is the default this will not alter the cache semantics, however, setting eternal documents that expiry is never needed. The cache will not initialize any timer data structures.

6.1.2. Expiry after write / time to live

Expiry after an entry is created or modified can be specified via the expireAfterWrite parameter. This is also known as time to live. It is possible to specify different expiry values for creation and modification with a custom ExpiryPolicy. More about ExpiryPolicy is explained below.

Lag time: Expiry is lagging by default, meaning that the mapping disappears shortly after the configured duration. If an entry needs to expire at an exact point in time, this can be achieved with the ExpiryPolicy and the sharpExpiry parameter, see Sharp Expiry. The parameter timerLag is available to configure the lag of timer event processing.

Exceptions: When an expiry duration is specified via expireAfterWrite, resilience features are automatically active. See the Resilience and Exceptions chapter.

6.1.3. Expiry after access / time to idle

There is no expireAfterAccess parameter in cache2k as it can be found in other caches. As an alternative, the Idle Scan feature can be used, which is typically faster and more effective.

If needed, it is possible to realize exact TTI semantics by updating the expiry time with every cache operation. This also allows variable expiry after access. However, this is consuming more resources then idle scanning. See Time To Idle Example.

6.1.4. Idle Scan - an Expiry After Access Alternative

Expiry after access or time to idle is a feature often found in caches and means that the entry will be removed after a period of no activity. Typically this is used to shrink or grow the cache size depending on the activity.

cache2k does not provide the Expiry after access or time to idle feature, because its implementation would slow down cache access times dramatically. The parameter idleScanTime can be used to achieve similar effects, without any extra overhead for a cache access. This is achieved by coupling the feature with the eviction logic and reusing tracking data for the eviction that we have anyways.

More information can be found in the JavaDoc of and in the Github issue #39

If the application needs exact "Expiry After Access" semantics this can be achieved by updating the expiry time on each access via Cache.expireAt or MutableCacheEntry.setExpiryTime.

6.1.5. Variable Expiry

Each entry may have a different expiry time. This can be achieved by specifying an ExpiryPolicy. The ExpiryPolicy calculates a point in time, when a value expires. The configuration parameter expireAfterWrite is used as a maximum value.

6.1.6. Variable Expiry for Exceptions

If the loader yields an exception ExpiryPolicy is not called. Its possible to set a variable expiry for exceptions via the ResiliencePolicy. See the Resilience and Exceptions chapter.

6.1.7. Sharp Expiry

In default operation mode, the cache does not check the time when an entry is accessed. An entry expires when the timer event is processed, which usually lags behind up to a second.

In case there is a business requirement that data becomes invalid or needs refreshed at a defined point in time the parameter sharpExpiry can be enabled. This will cause that the expiry happens exactly at the point in time determined by the expiry policy. For more details see the JavaDoc or Cache2kBuilder.sharpExpiry and ExpiryPolicy.

The actual removal of an expired entry from the cache and the notification of expiry listeners will always lag behind the expiry time.

If sharp expiry and refresh ahead is both enabled, the contract of refresh ahead is stricter. The resulting semantics will be:

Entries will expire exactly at the specified time
A refresh starts at expiry time
contains() is false, if the entry is expired and not yet refreshed
A get() on the expired entry will stall until refreshed

Sharp expiry and normal, lagging expiry can be combined. For example, if the parameter expiryAfterWrite and an ExpiryPolicy is specified and sharpExpiry is enabled. The sharp expiry will be used for the time calculated by the ExpiryPolicy, but the duration in expireAfterWrite is used if this will be sooner. If the expiry is result of a duration calculation via expireAfterWrite sharp expiry of the entry will not be enabled.

6.1.8. Resetting the Expiry of a Cache Value

The expiry value can be reset with the method Cache.expireAt(key, time). Some special values exist:

Table 1. `Cache.expireAt()` constants
constant	meaning
`Expiry.NOW`	The value expires immediately. An immediate load is triggered if refreshAhead is enabled.
`Expiry.REFRESH`	An immediate load is triggered if refreshAhead is enabled. If loading is not possible the value expires.
`Expiry.ETERNAL`	keep indefinitly or to a maximum of whats set with via `expireAfterWrite`

It is possible to atomically examine a cached entry and update its expiry with the EntryProcessor and MutableCacheEntry.setExpiry().

6.1.9. Wall Clock and Clock Skew

For timing reference the Java System.currentTimeMillis() is used. As with any application that relies on time, it is good practice that the system clock is synchronized with a time reference. When the system time needs to be corrected, it should adapt slowly to the correct time and keep continuously ascending.

In case a clock skew happens regularly a premature or late cache expiry may cause troubles. It is possible to do some countermeasures. If the time decreases, entries may expire more early. This can be detected and with the AdvancedCacheLoader the previously loaded value can be reused. If there is a time skew forward, expiry can be triggered programmatically with expireAt().

6.2. Examples

6.2.1. Expiry after a constant time span

All values will expire after 5 minutes with a lag time of around a second.

    Cache<Key, Data> cache = new Cache2kBuilder<Key, Data>() {}
      .loader(k -> new Data())
      .expireAfterWrite(5, TimeUnit.MINUTES)
      .build();

6.2.2. Expiry after an exact constant time span

If needed, it is also possible to expire without lag. Values will expire exactly after 5 minutes:

    Cache<Key, Data> cache = new Cache2kBuilder<Key, Data>() {}
      .loader(k -> new Data())
      .sharpExpiry(true)
      .expiryPolicy((key, value, startTime, currentEntry)
        -> startTime + TimeUnit.MINUTES.toMillis(5))
      .build();

Data older than 5 minutes will not be returned by the cache, however, the actual removal or execution of expiry listeners will still lag.

6.2.3. Variable Expiry Based on Value

The maximum a value is allowed to be cached can be extracted from the cached value, for example:

  class DataWithMaxAge {
    long getMaxAgeMillis() { return ... }
  }

To instruct the cache to use it, an expiry policy can be specified:

    Cache<Key, DataWithMaxAge> cache = new Cache2kBuilder<Key, DataWithMaxAge>() {}
      .loader(k -> ... )
      .expiryPolicy((key, value, startTime, currentEntry) -> startTime + value.getMaxAgeMillis())
      .build();

6.2.4. Time To Idle Example

Semantics of time to idle can be replicated by setting the a new expiry value upon every cache access.

    private Cache<K, V> cache;

    public void put(K key, V value) {
      cache.mutate(key, entry -> {
        entry.setValue(value);
        entry.setExpiryTime(entry.getStartTime() + expireAfterAccess);
      });
    }

    public V get(K key) {
      return cache.invoke(key, entry -> {
        entry.setExpiryTime(entry.getStartTime() + expireAfterAccess);
        return entry.getValue();
      });
    }

6.2.5. More Examples

Please look for the file ExpiryPolicyExampleTest in the code repository.

7. Refresh Ahead

With refresh ahead (or background refresh), in a read through configuration, values about to expire will be refreshed automatically by the cache.

7.1. Setup

Refresh ahead can be enabled via refreshAhead switch. The number of threads used for refreshing is configured by loaderThreadCount(). A (possibly shared) thread pool for refreshing can be configured via prefetchExecutor().

7.2. Semantics

The main purpose of refresh ahead is to ensure that the cache contains fresh data and that an application never is delayed when data expires and needs a reload. This leads to several compromises: Expired values will be visible until the new data is available from the load operation, more then the needed requests will be send to the loader.

After the expiry time of a value is reached, the loader is invoked to fetch a fresh value. The old value will be returned by the cache, although it is expired, and will be replaced by the new value, once the loader is finished. When a refresh is needed and not enough loader threads are available, the value will expire immediately and the next get() request will trigger the load.

Once refreshed, the entry is in a trail period. If it is not accessed until the next expiry, no refresh will be done, the entry expires and will be removed from the cache. This means that the time an entry stays within the trail period is determined by the configured expiry time or the the ExpiryPolicy.

Refresh ahead only works together with the methods invoking the loader, for example get() and getAll(). After a value is refreshed, an entry will not be visible with containsKey or peek. The first call to get() or load() on a previously refreshed item will make the loaded value available in the cache.

7.3. Sharp Expiry vs. Refresh Ahead

The setting sharpExpiry conflicts with the idea of refresh ahead. When using refresh ahead and sharp expiry in combination, the value will expire at the specified time and the background refresh is initiated. When the application requests the value between the expiry and before the new value is loaded, it blocks until the new value is available.

Combining sharp expiry, lagging expiry and refresh ahead, leads to an operation mode that cannot guarantee the sharp expiry contract under all circumstances. If an ongoing load operation is not finished before a sharp expiry point in time, non-fresh data will become visible.

Sharp timeout can also applied on a dynamic per entry basis only when needed.

7.4. Rationale: No separate refresh timing parameter?

Caches supporting refresh ahead typically have separate configuration parameters for its timing. In cache2k, refreshing is done when the value would expire, which is controlled by the expiry policy and expireAfterWrite parameter. Why? It should be possible to enable refresh ahead with a single switch. Refreshing and expiring are two sides of the same coin: When expired, we need to refresh.

7.5. Future Outlook

We plan several improvements for refresh ahead support, see the issue tracker

The effects on the event listeners and statistics when refreshing may change in the future.

8. Null Values

While a HashMap supports null keys and null values most cache implementations and the JCache standard do not. By default, cache2k does not permit null values, to avoid surprises, but storing null values can be allowed with a configuration parameter.

8.1. The Pros and Cons of Nulls

A good writeup of the topic can be found at Using and Avoiding Null Explained. The bottom line is, that for a map it is always better to store no mapping instead of a mapping to a null value. For a cache it is a different story, since the absence of a mapping can mean two things: The data was not requested from the source yet, or, there is no data.

8.2. Negative Result Caching

Caching that there is no result or a failure, is also called "negative result caching" or "negative caching". An example use case is the request of a database entry by primary key, for example via JPA’s EntityManager.find() which returns an object if it is available in the database or null if it is not. Caching a negative result can make sense when requests that generate a negative result are common.

In a Java API negative results are quite often modeled with a null value. By enabling null support in cache2k no further wrapping is required.

8.3. Alternatives

In a JCache application a null from the CacheLoader means the entry should be removed from the cache. This semantic is a consistent definition, but if Cache.get() is used to check whether data is existing, no caching happens if no data is present. A null value is passed through consistently, however, the cache performs badly if a null response is common.

Being able to store a null value is no essential cache feature, since it is always possible to store a wrapper object in the cache. However, with the null support in cache2k, it is possible to store a null value with no additional overhead.

8.4. Default Behavior

By default, every attempt to store a null in the cache will yield a NullPointerException.

In case peek() returns null, this means there is no associated entry to this key in the cache. The same holds for get() if no loader is defined. For one point in time and one key there is the following invariant: cache.contains(key) == (cache.peek(key) != null).

8.5. How to Enable Null Values

Storing of null values can be enabled by permitNullValues(true). Example:

    Cache<Integer, Person> cache =
      new Cache2kBuilder<Integer, Person>(){}
        .name("persons")
        .entryCapacity(10000)
        .expireAfterWrite(5, TimeUnit.MINUTES)
        .permitNullValues(true)
        .build();

8.6. How to Operate with Null Values

When null values are legal, additional care must be taken. The typical cache aside pattern becomes invalid:

    Cache<Integer, Person> c = ...
    Person lookupPerson(int id) {
      Person p = cache.peek(id);
      if (p == null) {
        p = retrievePerson(id);
        cache.put(id, p);
      }
      return p;
    }

In case retrievePerson returns null for a non-existing person, it will get called again the next time the same person is requested. To check whether there is a cached entry containsKey could be used. However, using containsKey() and get() sequentially is faulty. To check for the existence of a cache entry and return its value the method peekEntry (or getEntry with loader) can be used. The fixed version is:

    Cache<Integer, Person> c = ...
    Person lookupPerson(int id) {
      CacheEntry<Person> e = cache.peekEntry(id);
      if (e != null) {
        return e.getValue();
      }
      p = retrievePerson(id);
      cache.put(id, p);
      return p;
    }

The cache aside pattern serves as a good example here, but is generally not recommended. Better use read through and define a loader.

8.7. Loader and Null Values

If a loader is defined a call of Cache.get() will either return a non-null value or yield an exception. If the loader returns null, a NullPointerException is generated and wrapped and propagated via a CacheLoaderException. This behavior is different from JSR107/JCache which defines that an entry is removed, when the loader returns null.

In case the loader returns null and this should not lead to an exception the following options exist:

Always return a non-null object and include a predicate for the null case, use a list or Java 8 Optional
Enable null value support
Remove entries from the cache when the loader returns null (as in JCache)

The expiry policy can be used to remove entries from the cache when the loader returns null:

    ...
    builder.expiryPolicy(new ExpiryPolicy<Integer, Person>() {
      @Override
      public long calculateExpiryTime(K key, V value, long loadTime, CacheEntry<Integer, Person> oldEntry) {
        if (value == null) {
          return NO_CACHE;
        }
        return ETERNAL;
      }
    })
    ...

This works, since the cache checks the null value only after the expiry policy has run and had decided to store the value.

8.8. Performance

Storing null values has no additional memory or CPU overhead.

8.9. Rationale

8.9.1. Why support `null`?

Supporting null needs a more careful design inside the cache and its API. When this is done, it basically comes for free and makes the cache very effective for use cases where null values are common.

8.9.2. Why is rejecting `null` values the default?

We were using cache2k for 16 years, with the capability to store null values by default. For the 1.0 version we changed this behavior and don’t allow nulls. Here is why:

Most caches do not support null values. Allowing null by default may lead to unexpected and incompatible behavior.
Use cases with null are rare.
Returning or storing a null may be a mistake most of the time.
In case a null is allowed it is better to specify this explicitly to make the different behavior more obvious.

8.9.3. Why rejecting `null` from the loader?

If the loader returns null, a NullPointerException is generated and propagated via the CacheLoaderException. This behavior is different from JSR107/JCache which defines that an entry is removed, if the loader returns null.

The JCache behavior is consistent, since a get() in JCache returns null only in the case that no entry is present. The JCache behavior is also useful, since nulls from the loader pass through transparently. But as soon as nulls are passed through regularly, the cache is rendered useless, since a null from the loader means "no caching". This will be unnoticed during development but will lead to performance trouble in production.

In cache2k there are different options when null comes into play. A failure by default will, hopefully, lead to an explicit choice for the best option.

9. Exceptions and Resilience

This section documents the behavior since cache2k version 2.0.

In a read through mode (with a CacheLoader) the cache can be configured to tolerate temporary loader failures with the ResiliencePolicy.

Without a specified ResiliencePolicy the cache will only keep successfully loaded values. If a load yields an exception the load is retried immediately when the value is requested again. With the ResiliencePolicy it is possible to keep an exception in the cache for some time and stop sending new requests for that particular key, to not overflow a faulty data source and minimize local resource and network usage. It is also possible to suppress exception for some time, in case a loaded value is present from a previous load.

The core of cache2k contains all the infrastructure to deal with exceptions. Whenever a cached value is requested, e.g. via Cache.get(), CacheEntry.get() and also Cache.peek() the cache would rethrow a previous loader exception. Each cache call generates a would generate a new CacheLoaderException which contains the original exceptions as cause. This can be controlled via the ExceptionPropagator. In case an exception was thrown by the loader the entry contains a LoadExceptionInfo which contains useful information for decision making in the ResiliencePolicy or to propagate via the ExceptionPropagator.

9.1. Using UniversalResiliencePolicy

cache2k ships with a ResiliencePolicy implementation called UniversalResiliencePolicy that is useful in typical scenarios. The rest of this chapter describes the usage of UniversalResiliencePolicy.

To learn more on how to implement your own policy, study the Java Doc of ResiliencePolicy and the source code of the UniversalResiliencePolicy. Rather then reading the documentation here, it might be easier to look at the source code and examples. Alternatively, start and experiment with the configuration examples at the end of the chapter.

Add the cache2k-addon module to your application:

        <dependency>
            <groupId>org.cache2k</groupId>
            <artifactId>cache2k-addon</artifactId>
            <version>${cache2k-version}</version>
        </dependency>

To enable the policy with the buider use:

      builder.setup(UniversalResiliencePolicy::enable)

To set additional parameters to the policy:

      builder.setupWith(UniversalResiliencePolicy::enable, b -> b
        .resilienceDuration(5, TimeUnit.SECONDS)
      )

Depending on preference the policy can also enabled by default for all caches and then disabled on individual cache basis.

9.1.1. UniversalResiliencePolicy Default Behavior

When enabled and without additional parameters, the UniversalResiliencePolicy starts caching and suppressing exceptions in read through operation, when a expiry time (expireAfterWrite) is set.

org.cache2k.integration.CacheLoaderException: expiry=2016-06-04 06:08:14.967, cause: java.lang.NullPointerException

Cached exceptions can be misleading, because you may see 100 exceptions in your log, but only one was generated from the loader. That’s why the expiry of an exception is typically shorter then the configured expiry time. When a previous value is available a subsequent loader exception is suppressed for a short time.

If no expiry is specified or eternal(true) is specified, all exceptions will be propagated to the client. The loader will be called immediately again, if the key is requested again. This is identical to no policy.

9.1.2. Expiry and resilience duration

When an expiry time is specified, the UniversalExpiryPolicy enables the resilience features.

If a load yields an exception and there is data in the cache: The exception will not be propagated to the client, and the cache answers requests with the current cache content. Subsequent reload attempts that yield an exception, will also be suppressed, if the time span to the first exception is below the resilience duration setting.

If the loader produces subsequent exceptions that is longer then the resilience duration, the exception will be propagated. The resilience duration can be set with the parameter resilienceDuration, if not set explicitly it is identical to the expiryAfterWrite time span.

9.2. Retry

After an exception happens the UniversalResiliencePolicy will do a retry to load the value again. The retry is started after the configured retry interval (retryInterval), or, if not explicitly configured after 5% of the resilience duration. The load is started when the client accesses the value again, or the cache is doing this by itself if refreshAhead is enabled.

To keep the system load in limits in the event of failure, the duration between each retry increases according to an exponential backoff pattern by the factor of 1.5. Each duration is further randomized between one half and the full value. For example, an expireAfterWrite set to 200 seconds will lead to an initial retry time of 10 seconds. If exceptions persist the retry time will develop as follows:

Table 2. Retry intervals with exponential backoff starting at 10 seconds
retry after
10 seconds
15 seconds
22.5 seconds
33.75 seconds

When reaching the configured expiry the cache will retry at this time interval and not increase further. The start retry interval and the maximum retry interval can be specified by retryInterval and maxRetryInterval.

9.3. Exception Propagation

If an exception cannot be suppressed, it is propagated to the client immediately. The retry attempts follow the same pattern as above.

When propagated, a loader exception is wrapped and rethrown as CacheLoaderException. A loader exception is potentially rethrown multiple times, if the retry time is not yet reached. In this situation a rethrown exception contains the text expiry=<timestamp>. This behavior can be customized by the ExceptionPropagator.

9.4. Invalidating and reloading

An application may need to invalidate a cache entry, so the cache will invoke the loader again the next time the entry is requested. How the value should be invalidated depends on the usage scenario and whether availability or consistency has to be priority.

To be able to use the resilience features and increase availability in the event of failure the method expireAt should be preferred for invalidation. See the detailed discussion in the loading chapter.

9.5. Entry Status and `containsKey`

In case an exception is present, the method containsKey will return true, the methods putIfAbsent and computeIfAbsent act consequently. This means pufIfAbsent can not be used to overwrite an entry in exception state with a value.

To examine the state of an entry, e.g. whether it contains a value or exception, the method Cache.peekEntry can be used. To examine the state of an entry and modify it, the entry processor can be used.

9.6. Configuration options

To customize the behavior the following options exist. In the base configuration:

expireAfterWrite: Time duration after insert or updated an cache entry expires
resiliencePolicy: Sets a custom resilience policy to control the cache behavior in the presence of exceptions
exceptionPropagator: Sets a custom behavior for exception propagation
refreshAhead: Either the option refreshAhead or keepDataAfterExpired must be enabled to do exception suppression if an expiry is specified
keepDataAfterExpired: Either the option refreshAhead or keepDataAfterExpired must be enabled to do exception suppression if an expiry is specified

The UniversalResiliencePolicy has additional parameters in UniversalResilienceConfig:

resilienceDuration: Time span the cache will suppress loader exceptions if a value is available from a previous load. Defaults to expiredAfterWrite
minRetryInterval: The minimum interval after a retry attempt is made. Defaults to 0
maxRetryInterval: The maximum interval after a retry attempt is made. Defaults to resilienceDuration
retryInterval: Initial interval after a retry attempt is made. Defaults to 10% (or retryPercentOfResilienceDuration) of mayRetryInterval, or a minimum of 2 seconds.
backoffMultiplier: Factor to increase retry time interval after a consequtive failure

The timing parameters may be derived from expireAfterWrite. This is controlled by:

retryPercentOfResilienceDuration: Percentage from expireAfterWrite to use for the retryInterval, default is 10%

9.7. Examples

9.7.1. No expiry

Values do not expire, exceptions are not suppressed. After an exception, the next Cache.get() will trigger a load.

    Cache<Integer, Integer> c = new Cache2kBuilder<>() {}
      .eternal(true)
      /* ... set loader ... */
      .build();

9.7.2. Expire after 10 minutes with resilience

Values expire after 10 minutes. Exceptions are suppressed for 10 minutes as well, if possible. A retry attempt is made after 1 minute. If the cache continuously receives exceptions for a key, the retry intervals are exponentially increased up to a maximum interval time of 10 minutes.

The UniversalResiliencePolicy derives its parameters from the expireAfterWrite setting.

    Cache<Integer, Integer> c = new Cache2kBuilder<>() {}
      .expireAfterWrite(10, TimeUnit.MINUTES)
      .keepDataAfterExpired(true)
      .setup(UniversalResiliencePolicy::enable)
      /* ... set loader ... */
      .build();

9.7.3. Reduced suppression time

Expire entries after 10 minutes. If an exception happens we do not want the cache to continue to service the previous (and expired) value for too long. In this scenario it is preferred to propagate an exception rather than serving a potentially outdated value. On the other side, there may be temporary outages of the network for a maximum of 30 seconds we like to cover for.

    Cache<Integer, Integer> c = new Cache2kBuilder<Integer, Integer>() {}
      .expireAfterWrite(10, TimeUnit.MINUTES)
      .setupWith(UniversalResiliencePolicy::enable, b -> b
        .resilienceDuration(30, TimeUnit.SECONDS)
      )
      .keepDataAfterExpired(true)
      /* ... set loader ... */
      .build();

9.7.4. Cached exceptions

No suppression, because values never expire. The only way that a reload can be triggered is with a reload operation. In this case we do not want suppression, unless specified explicitly. The loader is not totally reliable, or a smart developer uses an exception to signal additional information. If exceptions occur, the cache should not be ineffective and keep exceptions and defer the next retry for 10 seconds. For requests between the retry interval, the cache will rethrow the previous exception. The retry interval does not increase, since a maximum timer interval is not specified.

   Cache<Integer, Integer> c = new Cache2kBuilder<Integer, Integer>() {}
      .eternal(true)
      .setupWith(UniversalResiliencePolicy::enable, b -> b
        .retryInterval(10, TimeUnit.SECONDS)
      )
      /* ... set loader ... */
      .build();

9.8. Debugging loader exceptions

The cache has no support for logging exceptions. If this is needed, it can be achieved by an adaptor of the CacheLoader.

9.9. Available statistics for resilience

The Statistics expose counters for the total number of received load exceptions and the number of suppressed exceptions.

10. Event Listeners

The cache generates events when an entry is inserted or updated.

10.1. Listening to Events

Event listeners can be added via the cache builder, for example:

    Cache2kBuilder.of(Integer.class, Integer.class)
      .addListener(new CacheEntryCreatedListener<Integer, Integer>() {
        @Override
        public void onEntryCreated(final Cache<Integer, Integer> cache,
                                   final CacheEntry<Integer, Integer> entry) {
          System.err.println("inserted: " + entry.getKey());
        }
      });

Different listener types are available for insert, update, removal and expiry. The is currently no possibility to listen to an eviction.

Listeners are executed synchronous, meaning the cache operation will not complete until all listeners are run. The expiry event is always asynchronous.

It is illegal to mutate cache values inside the listeners.

10.2. Async Listeners

Listeners will be executed asynchronously when added with addAsyncListener(). By default a shared unbounded executor is used. A custom executor can be set via asyncListenerExecutor.

The cached value is not copied during the cache operation. If a value instance is mutated after it was handed over to the cache, asynchronous listeners may not see the value as it was present during the cache operation.

11. Configuration via XML

cache2k supports an XML configuration file. The configuration file can contain additional settings that should not be "buried" inside the applications code.

11.1. Using the XML Configuration

The configuration file for the default cache manager is located at /cache2k.xml in the class path. Here is an example:

<!--
  #%L
  cache2k config file support
  %%
  Copyright (C) 2000 - 2022 headissue GmbH, Munich
  %%
  Licensed under the Apache License, Version 2.0 (the "License");
  you may not use this file except in compliance with the License.
  You may obtain a copy of the License at

       http://www.apache.org/licenses/LICENSE-2.0

  Unless required by applicable law or agreed to in writing, software
  distributed under the License is distributed on an "AS IS" BASIS,
  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  See the License for the specific language governing permissions and
  limitations under the License.
  #L%
  -->
<cache2k xmlns:xsi='http://www.w3.org/2001/XMLSchema-instance'
         xmlns='https://cache2k.org/schema/v2.x'
         xsi:schemaLocation="https://cache2k.org/schema/v2.x https://cache2k.org/schema/cache2k-v2.x.xsd">
<!-- The configuration example for the documentation -->
  <version>1.0</version>
  <skipCheckOnStartup>true</skipCheckOnStartup>
  <properties>
    <smallCacheCapacity>12_000</smallCacheCapacity>
    <userHome>${env.HOME}</userHome>
  </properties>
  <defaults>
    <cache>
      <entryCapacity>100_000</entryCapacity>
    </cache>
  </defaults>
  <templates>
    <cache>
      <name>regularExpiry</name>
      <expireAfterWrite>5m</expireAfterWrite>
      <resiliencePolicy>
        <bean>
          <type>org.cache2k.addon.UniversalResilienceSupplier</type>
        </bean>
      </resiliencePolicy>
    </cache>
    <cache>
      <name>lessResilient</name>
      <sections>
        <resilience>
          <resilienceDuration>1m</resilienceDuration>
        </resilience>
      </sections>
    </cache>
  </templates>
  <caches>
    <cache>
      <name>users</name>
      <entryCapacity>${top.properties.user.smallCacheCapacity}</entryCapacity>
      <loader>
        <byClassName>
          <className>org.example.MyLoader</className>
        </byClassName>
      </loader>
    </cache>
    <cache>
      <name>products</name>
      <include>regularExpiry,lessResilient</include>
    </cache>
  </caches>
</cache2k>

11.2. Combine Programmatic and XML Configuration

The XML configuration can provide a default setup and a specific setup for a named cache. The specific setup from the XML configuration is applied after setup from the builder, thus overwriting any defaults or settings via the builder from the program code.

When a cache is created via the builder it needs to have mandatory properties on the programmatic level:

Type for key and value
Cache name
Manager with classloader, if another manager or classloader should be used

It is recommended to do settings in the builder, that belong to the application code, for example:

eternal(true): if no expiry is needed, in case the values are immutable or never change
eternal(false): if entries need to expire, but no specific time is set on programmatic level
permitNullValues(true): if the application needs to store nulls in the cache
storeByReference(true): if the application relies on the fact that the objects are only stored in the heap and not copied.

For example, the cache is created in the code with:

    Cache<String, String> b =
      new Cache2kBuilder<String, String>(){}
        .name("various")
        .eternal(true)
        .permitNullValues(true)
        .build();

In the configuration file only the capacity is altered:

  <cache>
    <name>various</name>
    <entryCapacity>10K</entryCapacity>
  </cache>

11.3. Reference Documentation

11.3.1. File Location

The configuration for the default cache manager (CacheManager.getInstance()) is expected in the class path at /cache2k.xml. If a different class loader is used to create the cache manager, that is used to look up the configuration. If multiple cache managers are used, each cache manager can have its own configuration, which is looked after at /cache2k-${managerName}.xml.

11.3.2. Options

Some options control how the configuration is interpreted.

version: Version which controls how the configuration is interpreted. Needed for possible future changes. Always 1.0 at the moment.
ignoreMissingCacheConfiguration: If true, allows that there is no cache configuration present for a created cache. All the configuration can be provided at programmatic level, also the defaults from the configuration apply.
defaultManagerName: Set another name for the default cache manager, default is "default".
ignoreAnonymousCache: If true, allows cache without name. If a cache has no name a special configuration cannot be applied. The default is false, enforcing that all caches are named on the programmatic level.
skipCheckOnStartup: Do not check whether all cache configurations can be applied properly at startup. Default is false.

11.3.3. Default Configuration

A default configuration may be provided in defaults.cache (see example above). The defaults will be used for every cache created in the cache manager.

11.3.4. Templates

Multiple template configurations can be provided under templates. Templates have a name. In the cache configuration, a template can be included via include. Multiple templates can be included when separated with comma.

Templates can be used for other configuration sections as well.

11.3.5. Parameters

The values may contain the parameters in the the style ${scope.name}. A parameter name starts with a scope. Predefined scopes are:

env: environment variable, e.g. ${env.HOME} is the user home directory.
top: references the configuration root, e.g. ${top.caches.flights.entryCapacity} references the value of the entryCapacity of the cache named flights.
sys: a Java system property, e.g. ${sys.java.home} for the JDK installation directory.

The scope prefix can also reference a parent element name. If the scope prefix is empty an value at the same level is referenced.

A configuration can contain user defined properties in the properties.user section.

11.3.6. Primitive Types

The configuration supports basic types.

Table 3. Supported Types in XML Configuration
type	example	description
boolean	`true`	Boolean value either true of false
int	`4711`	Integer value
long	`20_000MiB`	Long value with optional suffixes (see below)
String	`alice`	A string value

For additional convenience the long type supports a unit suffix:

suffix	value
KiB	1024
MiB	1024^2
GiB	1024^3
TiB	1024^4
k	1000
M	1000^2
G	1000^3
T	1000^4
s	1000
m	1000*60
h	10006060
d	10006060*24

suffix

value

KiB

1024

MiB

1024^2

GiB

1024^3

TiB

1024^4

1000

1000^2

1000^3

1000^4

1000

1000*60

1000*60*60

1000*60*60*24

A long value may also contain the character '_' for structuring. This character is ignored. Example: 12_000_000. The unit suffix is intended to make the configuration more readable. There is no enforcement that a unit actually matches with the intended unit of the configuration value.

11.3.7. Sections

The cache configuration may contain additional sections. At the moment only the section jcache is available which tweaks JCache semantics.

11.3.8. Customizations

Customizations, for example, loaders, expiry policy and listeners, may be configured. The simplest method is to specify a class name of the customization that gets created when the cache is build (see also example above).

  <loader>
    <byClassName>
      <className>org.example.MyLoader</className>
    </byClassName>
  </loader>

It is also possible to implement an own CustomizationSupplier which can take additional parameters for additional configuration of the customization. In this case the type element is used to specify the supplier class.

  <loader>
    <bean>
      <type>org.exmample.LoaderSupplier</type>
      <!-- Additional bean properties to set on the supplier follow. -->
      <parameters>
        <database>jdbc://....</database>
      </parameters>
    </bean>
  </loader>

11.4. Rationale and Internals

Since version 1.2 the XML configuration is included in the cache2k-core jar file. In a Java SE environment the default SAX parser is used. In an Android environment the XML pull parser is used. There are no additional dependencies to other libraries.

Most of the configuration processing is not limited to XML. The general structure of the configuration can be represented in YAML or JSON as well. This is why we do not use any XML attributes. Readers for other formats can implement the ConfigurationTokenizer.

The configuration code is mostly generic and uses reflection. In case new properties or configuration classes are added, there is no need to update configuration code. This way the extra code for the XML configuration keeps small. Eventually we will separate the configuration into another project so is can be used by other applications or libraries with their configuration beans.

The structure adheres to a strict scheme of container.type.property.type.properties [ .type.property …]. This simplifies the processing and also leaves room for extensions.

12. Logging

The log output of cache2k is very sparse, however, some critical information could be send to the log, so proper logging configuration is essential.

12.1. Supported Log Infrastructure

cache2k supports different logging facades and the JDK standard logging. The supported mechanisms include:

SLF4J
JDK standard logging

The availability is evaluated in the above order and the first match is picked and used exclusively for log output. E.g. if the slf4j-api is present, the log output will be directed to SLF4J. This scheme should have the desired results, without the need of additional configuration of the used logging facade.

12.2. Wiring a Custom Log Target

In case none of the above logging infrastructure can be used the service provider interface org.cache2k.core.log.LogFactory can be implemented and provided via the ServiceLoader mechanism.

13. Statistics

cache2k collects statistics by default and is able to expose them via JMX, micro meter or via the API.

13.1. Programmatic access

Statistics are available via the API, see the API documentation: CacheStatistics

13.2. JMX Metrics

JMX support is present in the module cache2k-jmx. Registration of JMX beans is disabled by default and needs to be enabled.

The management beans are registered with the platform MBean server. The object name of a cache follows the pattern org.cache2k:type=Cache,manager=<managerName>,name=<cacheName>, the object name of a cache manager follows the pattern org.cache2k:type=CacheManager,name=<managerName>.

More detailed information can be found in the API documentation:

13.2.1. Conflicting Manager Names in JMX

Multiple cache managers with the identical name may coexist under different class loaders. With JMX enabled, this will lead to identical JMX objects and refusal of operation. A workaround is to use unique cache manager names. The name of the default manager, which is usually "default" can be changed via the XML configuration or a call to CacheManager.setDefaultName early in the application startup.

13.3. Micrometer Metrics

Micrometer support is available in the module cache2k-micrometer.

13.4. `toString()` Output

The output of the toString() method is extensive and also includes internal statistics. Example:

Cache{database}(size=50003, capacity=50000, get=102876307, miss=1513517, put=0, load=4388352, reload=0, heapHit=101362790, refresh=2874835, refreshFailed=42166, refreshedHit=2102885, loadException=0, suppressedException=0, new=1513517, expire=587294, remove=8156, clear=0, removeByClear=0, evict=868064, timer=3462129, goneSpin=0, hitRate=98.52%, msecs/load=0.425, asyncLoadsStarted=2874835, asyncLoadsInFlight=0, loaderThreadsLimit=8, loaderThreadsMaxActive=8, created=2016-12-02 03:41:34.367, cleared=-, infoCreated=2016-12-02 14:34:34.503, infoCreationDeltaMs=21, collisions=8288, collisionSlots=7355, longestSlot=5, hashQuality=83, noCollisionPercent=83, impl=HeapCache, eviction0(impl=ClockProPlusEviction, chunkSize=11, coldSize=749, hotSize=24252, hotMaxSize=24250, ghostSize=12501, coldHits=11357227, hotHits=38721511, ghostHits=294065, coldRunCnt=444807, coldScanCnt=698524, hotRunCnt=370773, hotScanCnt=2820434), eviction1(impl=ClockProPlusEviction, chunkSize=11, coldSize=778, hotSize=24224, hotMaxSize=24250, ghostSize=12501, coldHits=11775594, hotHits=39508458, ghostHits=283324, coldRunCnt=423258, coldScanCnt=674762, hotRunCnt=357457, hotScanCnt=2689129), evictionRunning=0, keyMutation=0)

14. JCache

cache2k supports the JCache API standard. The implementation is compatible to the JSR107 TCK.

14.1. Maven Dependencies

To use cache2k as JCache caching provider for local caching add the following dependency:

    <dependencies>
      <dependency>
        <groupId>org.cache2k</groupId>
        <artifactId>cache2k-jcache</artifactId>
        <version>2.6.1.Final</version>
        <type>pom</type>
        <scope>runtime</scope>
      </dependency>
    </dependencies>

The cache2k-jcache artifact transitive dependencies provide everything for the local caching functionality at runtime.

For applications that need the JCache API or cache2k API in compile scope, use this scheme:

    <properties>
      <cache2k-version>2.6.1.Final</cache2k-version>
    </properties>

    <!--
          Provides the cache2k JCache implementation and depends on
          everything needed at runtime.
      -->
    <dependency>
      <groupId>org.cache2k</groupId>
      <artifactId>cache2k-jcache</artifactId>
      <version>${cache2k-version}</version>
      <scope>runtime</scope>
    </dependency>
    <!--
         Optional. When application code depends on JCache API.
      -->
    <dependency>
      <groupId>javax.cache</groupId>
      <artifactId>cache-api</artifactId>
      <version>1.1.0</version>
    </dependency>
    <!--
         Optional. When application code depends on the cache2k API.
         Needed for the programmatic configuration inside the application.
      -->
    <dependency>
      <groupId>org.cache2k</groupId>
      <artifactId>cache2k-api</artifactId>
      <version>${cache2k-version}</version>
    </dependency>

14.2. Getting the Caching Provider

If more then one JCache provider is present in the application, the cache2k JCache provider needs to be requested explicitly. The classname of the provider is org.cache2k.jcache.provider.JCacheProvider. To request the cache2k provider use:

    CachingProvider cachingProvider =
      Caching.getCachingProvider("org.cache2k.jcache.provider.JCacheProvider");

14.3. Getting Started with the JCache API

Since cache2k is JCache compatible, any available JCache introduction can be used for the first steps. The following online sources are recommended:

14.4. Configuration

JCache does not define a complete cache configuration, for example, configuring the cache capacity is not possible. To specify a meaningful cache configuration, the cache2k configuration mechanism needs to be utilized.

14.4.1. Programmatic Configuration

To create a JCache with an additional cache2k configuration the interface ExtendedConfiguration and the class MutableExtendedConfiguration is provided. Example usage:

    CachingProvider p = Caching.getCachingProvider();
    CacheManager cm = p.getCacheManager();
    Cache<Long, Double> cache = cm.createCache("aCache", ExtendedMutableConfiguration.of(
      new Cache2kBuilder<Long, Double>(){}
        .entryCapacity(10000)
        .expireAfterWrite(5, TimeUnit.MINUTES)
    ));

14.4.2. Cache Manager and XML Configuration

To get the default cache manager, do:

    provider.getCacheManager(null, classloader, properties);

    provider.getCacheManager(getDefaultURI(), classloader, properties);

or simply, if you don’t need to specify properties or a classloader:

    provider.getCacheManager();

In case there is a cache2k.xml present in the class path, this will be picked up automatically.

Different cache manager URIs will create different cache manager have a separate XML configuration. The manager URI is not the path to the configuration file but simply the cache manager name as used within the cache2k API. For example:

    provider.getCacheManager(new URI("special"), classloader, properties);

In the above example special is used as the name of the manager and the configuration is expected to be at cache2k-special.xml. There is no need for an URI to have a scheme. For a rationale, see: (GH#91)[https://github.com/cache2k/cache2k/issues/91]

Within the XML configuration the different sections for a cache do apply for caches requested via CacheManager.createCache or CacheManager.getCache, see XML configuration.

14.4.3. Optimizations and Semantic Differences

In case a JCache cache is created with a cache2k configuration present, the generated JCache cache has slightly different semantics then those defined in the JCache specification. cache2k has different default settings, which are optimized towards performance. This section explains the differences and how to switch back to the standard behavior, if this might be really needed.

Expiry: With pure JCache configuration the expiry is set to sharp expiry by default for maximum compatiblity. When the cache2k configuration is used, sharp expiry is off and needs to be enabled explicitly.
Configuration.isStoreByValue: When the cache2k configuration is active the parameter will be ignored. Store by value semantics can enabled again in the JCache configuration section of cache2k JCacheConfiguration.Builder.copyAlwaysIfRequested. See example below how to specify the additional parameters.
Cache.registerCacheEntryListener: Online attachment of listeners is not supported unless listeners are already present in the initial configuration, or if explicitly enabled via JCacheConfiguration.Builder.supportOnlineListenerAttachment. If no listeners are needed, which is most often the case, cache2k switches to a faster internal processing mode.

To enable full JCache compliance by default via the XML configuration, use these defaults in the setup:

<cache2k>
  <version>1.0</version>
  <defaults>
    <cache>
      <sections>
        <jcache>
          <copyAlwaysIfRequested>true</copyAlwaysIfRequested>
          <supportOnlineListenerAttachment>true</supportOnlineListenerAttachment>
        </jcache>
      </sections>
      <sharpExpiry>true</sharpExpiry>
    </cache>
  </defaults>
</cache2k>

There are more optimizations in cache2k happening transparently, e.g. when a expiry policy with a static setting is detected. For this reason avoid using a custom expiry policy class if not needed.

14.4.4. Merging of JCache and cache2k Configuration

The JCache configuration and the cache2k configuration may have settings that control the same feature, for example expiry. In this case the two configurations need to be merged and conflicting settings have to be resolved. The policy is as follows:

Expiry settings: Settings in cache2k configuration take precedence. A configured expiry policy in the standard JCache CacheConfiguration will be ignored if either expiryAfterWrite or expiryPolicy is specified in the cache2k configuration.
Loader and Writer: Settings in JCache configuration take precedence. If a loader or a writer is specified in the JCache CacheConfiguration the setting in the cache2k configuration is ignored.
Event listeners: Registered listeners of both configurations will be used.

14.5. Control Custom JCache Semantics

The cache2k JCache implementation has additional options that control its semantics. These options are available in the JCacheConfig configuration section, which is provided by the cache2k-jcache-api module.

Example usage:

    CachingProvider p = Caching.getCachingProvider();
    CacheManager cm = p.getCacheManager();
    Cache<Long, Double> cache = cm.createCache("aCache", ExtendedMutableConfiguration.of(
      new Cache2kBuilder<Long, Double>(){}
        .entryCapacity(10000)
        .expireAfterWrite(5, TimeUnit.MINUTES)
        .with(new JCacheConfiguration.Builder()
          .copyAlwaysIfRequested(true)
        )
    ));

The example enables store by value semantics again and requests that keys and values are copied when passed to the cache or retrieved from the cache.

14.6. Don’t Mix APIs

The cache2k JCache implementation wraps a native cache2k. For a JCache cache instance it is possible to retrieve the underlying cache2k implementation, for example by using Cache.unwrap. Using the native API in combination with the JCache API may have unexpected results. The reason is, that a native cache is configured differently by the JCache implementation to support the JCache behavior (e.g. the ExceptionPropagator is used).

An application that mixes APIs may break between cache2k version changes, in case there is an incompatible change in the adapter layer. There is no guarantee this will never happen.

14.7. Implementation Details

14.7.1. Semantic Changes Between JCache 1.0 and JCache 1.1

The JCache specification team has made some changes to its TCK since the original 1.0 release. The cache2k implementation adheres to the latest corrected TCK 1.1.

Table 4. Corrected or Enforced JSR107 Semantics in TCK 1.1
Affected Component	JSR107 GitHub issue
`EntryProcessorException`	https://github.com/jsr107/jsr107tck/issues/85
Customizations may implement `Closeable`	https://github.com/jsr107/jsr107tck/issues/100
`CacheEntry.getOldValue()` for removed event	https://github.com/jsr107/jsr107spec/issues/391
Statistics of `Cache.putIfAbsent()`	https://github.com/jsr107/jsr107tck/issues/63
`CacheManager.getCacheNames()`	https://github.com/jsr107/jsr107tck/issues/87
`CacheManager.getCache()`	https://github.com/jsr107/jsr107spec/issues/340
JMX statistics	https://github.com/jsr107/jsr107tck/issues/83

14.7.2. Violations of TCK 1.1

cache2k passes all TCK tests with currently one exception, which we believe is more correct.

Table 5. Slightly different semantics in TCK 1.1
Affected	JSR107 GitHub issue
No statistics updates for no operation EntryProcessor	https://github.com/cache2k/cache2k/issues/132

14.7.3. Expiry Policy

If configured via cache2k mechanisms, the cache2k expiry settings take precedence.

If a JCache configuration is present for the expiry policy the policies EternalExpiryPolicy, ModifiedExpiredPolicy and CreatedExpiredPolicy will be handled more efficiently than a custom implementation of the ExpiryPolicy.

The use of TouchedExpiryPolicy or ExpiryPolicy.getExpiryAccess() is discouraged. Test performance carefully before use in production.

14.7.4. Store by Value

If configured via cache2k mechanisms, store by value semantics are not provided by cache2k by default. Instead the usual in process semantics are provided. Applications should not rely on the fact that values or keys are copied by the cache in general.

For heap protection cache2k is able to copy keys and values. This can be enabled via the parameter JCacheConfiguration.setCopyAlwaysIfRequested, see the configuration example above.

14.7.5. Loader exceptions

cache2k is able to cache or suppress exceptions, depending on the situation and the configuration.

If an exception is cached, the following behavior can be expected:

Accessing the value of the entry, will trigger an exception
Cache.containsKey() will be true for the respective key
Cache.iterator() will skip entries that contain exceptions

14.7.6. Listeners

Asynchronous events are delivered in a way to achieve highest possible parallelism while retaining the event order on a single key. Synchronous events are delivered sequentially.

14.7.7. Entry processor

Calling other methods on the cache from inside an entry processor execution (reentrant operation), is not supported. The entry processor should have no external side effects. To enable asynchronous operations, the execution may be interrupted by a RestartException and restarted.

14.7.8. Cache.getConfiguration()

It is not possible to retrieve the additional effective cache2k configuration with this method.

14.8. Performance

Using the JCache API does not deliver the same performance as when the native cache2k API is used. Some design choices in JCache lead to additional overhead, for example:

Event listeners are attachable and detachable at runtime
Expiry policy needs to be called for every access
Store-by-value semantics require keys and values to be copied

14.9. Compliance Testing

To pass the TCK tests on statistics, which partially enforce that statistic values need to be updated immediately. For compliance testing the following system properties need to be set:

org.cache2k.core.HeapCache.Tunable.minimumStatisticsCreationTimeDeltaFactor=0
org.cache2k.core.HeapCache.Tunable.minimumStatisticsCreationDeltaMillis=-1

Since immediate statistics update is not a requirement by the JSR107 spec this is needed for testing purposes only.

15. Android

cache2k version 1 is compatible with Java 6 and Android. Regular testing is done against API level 16. cache2k version 2 is compatible with API level 26. Regular CI testing with Android is implemented, see [GH#143](https://github.com/cache2k/cache2k/issues/143).

15.1. Usage

For cache2k version 1.1 and above, include the following dependency:

    implementation 'org.cache2k:cache2k-api:2.6.1.Final'
    runtimeOnly 'org.cache2k:cache2k-core:2.6.1.Final'

15.2. Proguard Rules

To minimize the footprint of the application the unused code can be removed via Proguard rules. This example might be outdated. Please give feedback if a different setting works better.

# mandatory proguard rules for cache2k to keep the core implementation
-dontwarn org.slf4j.**
# TODO: the following -dontwarn are probably not needed any more
-dontwarn javax.cache.**
-dontwarn javax.management.**
-dontwarn javax.naming.**
-keep interface org.cache2k.spi.Cache2kCoreProvider
-keep public class * extends org.cache2k.spi.Cache2kCoreProvider

# optional proguard rules for cache2k, to keep XML configuration code
# if only programmatic configuration is used, these rules may be ommitted
-keep interface org.cache2k.core.spi.CacheConfigurationProvider
-keep public class * extends org.cache2k.core.spi.CacheConfigurationProvider
-keepclassmembers public class * extends org.cache2k.configuration.ConfigurationBean {
    public void set*(...);
    public ** get*();
}

16. Spring Framework

16.1. Enable Cache2k Spring Support

To use cache2k in you Spring project, add the following libraries:

    <properties>
      <cache2k-version>2.6.1.Final</cache2k-version>
    </properties>

    <dependency>
      <groupId>org.cache2k</groupId>
      <artifactId>cache2k-api</artifactId>
      <version>${cache2k-version}</version>
    </dependency>
    <dependency>
      <groupId>org.cache2k</groupId>
      <artifactId>cache2k-core</artifactId>
      <version>${cache2k-version}</version>
      <scope>runtime</scope>
    </dependency>
    <dependency>
      <groupId>org.cache2k</groupId>
      <artifactId>cache2k-spring</artifactId>
      <version>${cache2k-version}</version>
    </dependency>

The dependency to cache2k-spring also adds cache2k-micrometer.

To enable your application to use caching via cache2k, typically define a CachingConfig class which makes the SpringCache2kCacheManager available:

@Configuration
@EnableCaching
public class CachingConfig {

  @Bean
  public CacheManager cacheManager() {
    return new SpringCache2kCacheManager();
  }

}

By default, there needs to be a cache configuration (see below) for every used cache. If the cache manager should return a cache, even the name is unknown, then use setAllowUnknownCache(true). See SpringCache2kCacheManager API documentation for a complete method reference.

16.2. XML Based Setup

When enabled as shown above, the cache2k cache manager will create caches as requested by the application. For the default cache manager, the cache configuration is looked up in the file cache2k.xml. For further details see the Configuration via XML chapter.

16.3. Programmatic Setup

It is possible to configure various caches with different configuration parameters via the builder pattern of cache2k:

  @Bean
  public CacheManager cacheManager() {
    return new SpringCache2kCacheManager()
      .defaultSetup(b->b.entryCapacity(2000))
      .addCaches(
        b->b.name("countries"),
        b->b.name("test1").expireAfterWrite(30, TimeUnit.SECONDS).entryCapacity(10000),
        b->b.name("anotherCache").entryCapacity(1000)).permitNullValues(true);
  }

The makes three caches available. The method addCaches adds one or more caches by modifying a builder with the specified parameters. The method defaultSetup can be used to apply default settings which should be used for all caches. It is possible to mix the programmatic setup with the XML configuration of cache2k, in this case the order of application is:

XML setup default section
SpringCache2kCacheManager.defaultSetup
SpringCache2kCacheManager.addCaches
XML setup cache section

Cache2k supports multiple cache managers. Caches used within the Spring DI context can be separated into another cache manager, e.g. with new SpringCache2kCacheManager("spring"). Separating caches into different cache managers helps avoid naming conflicts and makes setup and monitoring easier.

For further details, see SpringCache2kCacheManager API documentation.

16.4. Multiple Application Contexts

In testing scenarios multiple applications contexts are created. To avoid conflicts its a good idea to have separate cache managers in each application context. Within testing a feasible trick is to use the hash code of the configuration object to create a unique name for the caching manager.

  @Bean
  public CacheManager cacheManager() {
    return new SpringCache2kCacheManager("spring-" + hashCode());
  }

However, this makes it impossible to use the cache2k external XML configuration since the configuration file name is derived from the manager name.

16.5. Spring Bean XML Configuration

For completeness, together with the Cache2kConfig bean, it is also possible to use the Spring bean XML configuration.

16.6. Null Value Support

The cache abstraction of Spring is supporting null values. To support this a cache used by Spring is build with Cache2kBuilder.permitNullValues(true) by default. See Null Values.

In case the application does not store null values its good practise to include permitNullValues(false) in the cache configuration, to document and enforce this fact.

16.7. Resources

More information on the Spring Cache abstraction can be found here: