Is it possible to take advantage of batching when there's depth?

Context: An orthographically projected game, where the further away an entity is from the camera’s “Z coordinate”, the earlier it is drawn.

I do take advantage of texture atlases where I can (Animations stored in atlases), but to make the game’s engine easily extensible, two entities can’t share the same atlas.