Configuring Manila services for High Availability - OpenInfra Summit 2022

Conﬁguring manila services for high availability VICTORIA MARTINEZ DE LA
CRUZ ARCHANA KUMARI CARLOS SILVA GOUTHAM PACHA RAVI

• Gather manila users data to brainstorm at the end
of the talk. Please scan the code at the right to get to the etherpad, or you can access in https://etherpad.opendev.org/p/manila-ha-berlin-2 022 • Introduce OpenStack Manila • Discuss Manila micro-services and known issues In this presentation

What is OpenStack Manila

• Manila is the shared file systems as a service
for OpenStack • Support for more than 35+ storage backends (both privative and open source) • Support for multiple protocols (NFS, CIFS, HDFS, MapRFS, CephFS, GlusterFS) What is OpenStack Manila?

• One popular oversimplification – Manila is Cinder for file
shares • Fork of OpenStack Cinder, and built by a shared pool of developers, shares much of the architecture. • The class of problems solved has little overlap What is OpenStack Manila?

Shares usage workﬂow

Manila’s micro-service architecture

Manila micro-services and its known issues

• Exposes a REST front-end for the service • The
API is micro-versioned • All requests return immediately, but most requests are processed through the service stack, meaning that a request needs the caller to verify whether an operation has successfully completed or not. • Built for Active/Active High Availability API service

The ability to run the API service in active-active highly
available manner for availability and load balancing can lead to some pitfall scenarios when performing asynchronous calls, for example: • Allowing access in Manila is an asynchronous task that depends on the share back end to answer if it was successful or not. • Quota management management of Manila can lead to races in the reservation logic. Tweaking the worker count (“osapi_share_workers”) allow for eventlet greenthreading to enhance throughput by parallelizing reqs API service HA pitfalls

• It is responsible defining the placement of shared file
systems on share back ends based on capability, capacity and few other filters. • Since Yoga, you can influence the scheduler decision for shares or replicas via scheduler hints • API, scheduler and share services communicate over RPC calls • Any RPC mechanism can be used as long as there is support in oslo.messaging. Community prefers working with RabbitMQ • Is designed to be run in active/active HA Scheduler service

When a storage back end allows shares to be thinly
provisioned, it will be open to oversubscription. Manila is programmed to foresee and calculate oversubscription scenarios: ◦ There’s currently some oversubscription calculations that occur in the scheduler; these are being moved to the share manager service (with coordination) soon. ◦ Consumed (“allocated”) capacity calculations are pessimistically done, locally in the scheduler service. Scheduler service HA pitfalls

• It is responsible for interacting with the share back
ends through their drivers. Some operations tend to be asynchronous. The share manager will keep track of those by directly waiting an answer from the share back ends, or keep asking the back end the status of an operation through periodic tasks. • API, scheduler and share services communicate over RPC calls • Any RPC mechanism can be used as long as there is support in oslo.messaging. Community prefers working with RabbitMQ • Is designed to be run in active/passive HA Share Manager service

• Active/active isn’t tested (officially) • Drivers expect that only
a single copy of the share manager communicates to the back end storage in many cases • There are scheduled “polling” activities, and “recovery” that are wasteful when configured active/active • Most coordination is done with local file locks; constraining deployment architecture Share Manager service HA pitfalls

• Works in tandem with the share manager service and
is mostly a “stateless” service. • It can be used to provide means to migrate shared ﬁle systems for share back ends that don't natively support data migrations. • Multiple copies can be deployed; share migration/data copy operations will only ever go to a single data manager Data Manager service

• Deployment of multiple data manager services hasn’t been tested
to scale • Data operations are long running. If the node hosting the data service goes down, there is automatic recovery. But this recovery isn’t smart, the task is just reinitiated. Data Manager HA pitfalls

• Please scan the code at the right to get
to the etherpad, or you can access in https://etherpad.opendev.org/p/manila-ha-berlin-2 022 • Reach us on IRC ◦ #openstack-manila @ Freenode Questions?

Thank you!

Configuring Manila services for High Availabili...

Configuring Manila services for High Availability - OpenInfra Summit 2022

vkmc

More Decks by vkmc

Other Decks in Technology

Featured

Transcript

Conﬁguring manila services for high availability VICTORIA MARTINEZ DE LA

• Gather manila users data to brainstorm at the end

What is OpenStack Manila

• Manila is the shared file systems as a service

• One popular oversimplification – Manila is Cinder for file

Shares usage workﬂow

Manila’s micro-service architecture

Manila micro-services and its known issues

• Exposes a REST front-end for the service • The

The ability to run the API service in active-active highly

• It is responsible deﬁning the placement of shared ﬁle

When a storage back end allows shares to be thinly

• It is responsible for interacting with the share back

• Active/active isn’t tested (oﬃcially) • Drivers expect that only

• Works in tandem with the share manager service and

• Deployment of multiple data manager services hasn’t been tested

• Please scan the code at the right to get

Thank you!