upgrade_guide

Upgrade guide

4.0.0

Java driver 4 is not binary compatible with previous versions. However, most of the concepts remain unchanged, and the new API will look very familiar to 2.x and 3.x users.

Runtime requirements

The driver now requires Java 8. It does not depend on Guava anymore (we still use it internally but it's shaded).

Packages

The root package names have changed. There are also more sub-packages than previously. See API conventions for important information about the naming conventions.

Generally, the public types have kept the same name, so you can use the "find class" feature in your IDE to find out the new locations.

New configuration API

The configuration has been completely revamped. Instead of ad-hoc configuration classes, the default configuration mechanism is now file-based, using the Typesafe Config library. This is a better choice for most deployments, since it allows configuration changes without recompiling the client application. This is fully customizable, including loading from different sources, or completely overriding the default implementation.

For more details, refer to the manual.

Expose interfaces, not classes

Most types in the public API are now interfaces (as opposed to 3.x: Session, statement classes, etc). The actual implementations are part of the internal API. This provides more flexibility in client code (e.g. to wrap them and write delegates).

Thanks to Java 8, factory methods can now be part of these interfaces directly, e.g. CqlSession.builder(), SimpleStatement.newInstance.

No more `Cluster`

In previous driver versions, initialization was done in two steps: create a Cluster, and then call its connect method to create a Session.

Those two types have now been merged: there is only one Session object, that you initialize directly.

Generic session API

Session is now a high-level abstraction capable of executing arbitrary requests. Out of the box, the driver exposes a more familiar subtype CqlSession, that provides familiar signatures for CQL queries (execute(Statement), prepare(String), etc).

However, the request execution logic is completely pluggable, and supports arbitrary request types (as long as you write the boilerplate to convert them to protocol messages). In the future, we will take advantage of that to provide:

a reactive API;
a high-performance implementation that exposes bare Netty buffers;
specialized requests in our DataStax Enterprise driver.

If you're interested, take a look at RequestProcessor.

Immutable statement types

Simple, bound and batch statements implementations are now all immutable. This makes them automatically thread-safe: you don't need to worry anymore about sharing them or reusing them between asynchronous executions.

One word of warning -- all mutating methods return a new instance, so make sure you don't accidentally ignore their result:

BoundStatement boundSelect = preparedSelect.bind();

// This doesn't work: setInt doesn't modify boundSelect in place:
boundSelect.setInt("k", key);
session.execute(boundSelect);

// Instead, do this:
boundSelect = boundSelect.setInt("k", key);

Note that, as indicated in the previous section, the public API exposes these types as interfaces: if for some reason you prefer a mutable implementation, it's possible to write your own.

Dual result set APIs

In 3.x, both synchronous and asynchronous execution models shared a common result set implementation. This made asynchronous usage notably error-prone, because of the risk of accidentally triggering background synchronous fetches.

There are now two separate APIs: synchronous queries return a ResultSet; asynchronous queries return a future of AsyncResultSet.

ResultSet behaves much like its 3.x counterpart, except that background pre-fetching was deliberately removed, in order to keep this interface simple and intuitive. This is why methods such as fetchMoreResults, getAvailableWithoutFetching and isFullyFetched have disappeared. If you were using synchronous iterations with background pre-fetching, you should now switch to fully asynchronous iterations (see below).

AsyncResultSet is a simplified type that only contains the rows of the current page. When iterating asynchronously, you no longer need to stop the iteration manually: just consume all the rows in the iterator, and then call fetchNextPage to retrieve the next page asynchronously. You will find more information about asynchronous iterations in the manual pages about asynchronous programming and paging.

Simplified request timeout

The driver-side request timeout -- defined by the request.timeout configuration option -- now spans the entire request, including all retries, speculative executions, etc. In other words, it's the maximum amount of time that the driver will spend processing the request. If it fires, all pending tasks are cancelled, and a DriverTimeoutException is returned to the client. (Note that the "cancellation" is only driver-side, currently the protocol does not provide a way to tell the server to stop processing a request; if a message was "on the wire" when the timeout fired, then the driver will simply ignore the response when it eventually comes back.)

This is in contrast to 3.x, where the timeout defined in the configuration was per retry, and a global timeout required specific user code.

Dedicated type for CQL identifiers

Instead of raw strings, the names of schema objects (keyspaces, tables, columns, etc.) are now wrapped in a dedicated CqlIdentifier type. This avoids ambiguities with regard to case sensitivity.

For example, this type is used in schema metadata or when creating a session connected to a specific keyspace. When manipulating "data containers" such as rows, UDTs and tuples, columns can also be referenced by a CqlIdentifier; however, we've also kept a raw string variant for convenience, with the same rules as in 3.x (see GettableById and GettableByName for details).

Atomic metadata updates

Session.getMetadata() is now immutable and updated atomically. The node list, schema metadata and token map exposed by a given Metadata instance are guaranteed to be in sync.

On the other hand, this means you have to call getMetadata() again each time you need a fresh copy; do not cache the result.

See the manual for all the details.

Improved protocol version negotiation

You no longer need to force the protocol version in a mixed cluster: upon connecting to the first node, the driver will read the release version of all the nodes in the cluster and infer the best protocol version that works with all of them.

Improved metrics

Metrics can now be enabled selectively. In addition, they are exposed per node when that is relevant.

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Upgrade guide

4.0.0

Runtime requirements

Packages

New configuration API

Expose interfaces, not classes

No more `Cluster`

Generic session API

Immutable statement types

Dual result set APIs

Simplified request timeout

Dedicated type for CQL identifiers

Atomic metadata updates

Improved protocol version negotiation

Improved metrics

FilesExpand file tree

upgrade_guide

Directory actions

More options

Directory actions

More options

Latest commit

History

upgrade_guide

Folders and files

parent directory

README.md

Upgrade guide

4.0.0

Runtime requirements

Packages

New configuration API

Expose interfaces, not classes

No more Cluster

Generic session API

Immutable statement types

Dual result set APIs

Simplified request timeout

Dedicated type for CQL identifiers

Atomic metadata updates

Improved protocol version negotiation

Improved metrics

No more `Cluster`