Globally responsible and managing 2 teams here at Dell Technologies:
1) Global System Management Infrastructure team which is responsible for a number of management frameworks and application level infrastructure, that enable customers to communicate with the Data Domain Appliances as well simplify the end to end data backup and recovery worklfows, by enabling secure, consistent and reliable communication between control path and the data path on the appliances. The frameworks built and maintained by this global team and supported by this team include (but not limited to) REST API framework and gateway, CLI, SM service for inter process communication, Licensing service, Registry exposing a key value store to different modules, etc and are used to power several on-prem PowerProtect Data Domain appliances, virtual appliance, Multi-management center as well Integrated appliances that are packaged with Backup software. There are also projects and initiatives where we are looking at revamping all of our framework services to run as independent and scaleable microservices and pods, to run them on the next generation Kubernetes based Data Protection integrated platform. I have been leading this initiative from manageability side.
2) Data Analytics team which is responsible for enabling various ML based storage analytics workflows to the customer, involving projection of available capacity on the backup storage appliance, recommendation of placement and migration of backup data on a target system in a pool of storage systems, based on some storage heuristics, recommending the time when appliance can be upgrade in minimal disruptive way, providing several insights to the customer to take steps to improve performance, etc. this is done by implementing docker based microservices, that ate developed in python, and running them live on the appliance itself. Some analytics services run offline and provide an insight to the customer, using the offline ASUP data as the data source.
Technical Lead and Engineering Manager, System Management
11 member team focused on delivering core storage management and orchestration features at Tintri ( like replication, snapshot, VM-clone, policy based management, QOS, VM Recovery ), using RESTfull API services and plays a key role in injecting VM-level awareness in Tintri's All flash storage appliance, by integrating with multiple hypervisors like VMware, HyperV, RHEV, Xenserver, Openstack, etc.
Pragmatic Project Management Skills : Successfully managed multiple cross-functional projects across the whole SDLC stack by using effective time and project management tools like MS Project, JIRA, Clarizen, etc. Effectively used tools like Pivot Tables in Excel to do an on-demand project related resource management and allocation.
Program Management: Lead and managed the program across several cross-functional areas comprising of Backend, UI, QA and Filesystem teams, to deliver Tintri’s first-ever integrated solution with Public and Private clouds ( AWS, IBM Cleversafe, and other S3-compliant providers ), to replicate VM snapshots to the cloud for long-term retention. Encouraged the members of the team to file several patent applications for the Cloud Connector feature.
People and Career Management: Teamed up SM and HyperV teams and created 5 sub-component teams within SM, aimed at motivating innovation, better customer escalation management, better bug-backlog management and creating more Subject matter experts, to assist and support the new and existing Tintri customer base. Played a key role in initiating POC projects like Native DR, System Health Metrics, File Cataloging Cloud service, Object-aware storage, etc, aimed at adding innovation to the future roadmap.
* Conducted 1-1 with the individuals using different techniques, like monthly-lunches, walks, ping-pong games, etc, to make the atmosphere very informal in the team. This technique was very effective in improving retention rates within the team.
Enterprise Cloud Platform Management - System Management , Tintri ( Python, Java )
Backend systems engineer for the web based ( tomcat ) products at Tintri, like the Systems Management and TGC.
Played a key role in designing distributed Cloud replication workflows on Tintri Global Center, like Policy-based Scheduled Snapshot Cloud Replication, Restore Cloud Snapshots for VM-recovery, Restore-anywhere Cloud snapshots for VM DR, Snapshot Adoption based Recovery solutions for Appliance Failure Domains, Encryption and Key Management, Cloud destinations lifecycle management, etc.
Implemented key management features targeted at a VM-aware storage solution like Policy based APIs, Snapshot lifecycle, Replication, QOS, VM Thinning, etc
Redesigned Replication REST APIs and designed/implemented 1-many VM Replication feature for the product.
Component Lead for the SM Hypervisor and Replication modules.
Worked on sustaining SRA plugin for VMware Site Recovery Manager solution, and helped in redesigning the feature to make it more robust in the field.
Worked on version 1 of Synchronous Replication feature of system management and TGC product, which was designed as a 0 RPO solution for customers needing to mirror their data without any delays.
Designed and developed an Incremental vCenter update solution for refreshing VM metadata properties in real time from the vCenter, so that user can see the real time VM level awareness with respect to the 7-day historical stats at the level of IOPS, latency, throughput, etc.
Played a lead role in designing and developing next generation storage software solution aimed at a generic storage object model, so that VM-level like awareness can be injected to other object types like VVOLs, DB, iSCSI Volume, Containers, etc. Responsible for creating a OMS ( Object Management Service ) and ORS ( Object Repository Service ) layers in the product, to interface with an integration service in providing support for a generic object.
Working on a new generation of policy based storage and backup management system at NetApp called OnCommand Unified Manager. This product closely integrates with NetApp's proprietary Data ONTAP software (cluster-mode), offers system health and performance monitoring capability and various easy solutions to customers like setting up Data Protection relationship easily, setting up RTO Boost or easy Disaster Recovery, etc, all through an easy to use OnCommand UI.
Responsibilities include:
● Major contributor in product design across cross-functional teams at NetApp like Data ONTAP,
Workflow Automation, etc.
● Designed the whole new architecture of Data backup and Mirror from Unified Manager, by
creating a Java data access service layer to execute RESTfull services or APIs of NetApp
Workflow Automation engine and a way to track the execution of the operation using simple to
use Unified Manager GUI, with the ability to receive alerts when the operation fails.
● Co-managed the scrum team, to ensure all monthly deliverables are met and often acted as a
Scrum EPO, whenever required to answer design level questions.
● Designed and implemented brand new Management APIs for FlexProtect, using Java, Spring,
Hibernate, MySql, Groovy, Java for RESTfull services, etc
● Designed and implemented end to end workflows for setting up Disaster Recovery of customer’s
Primary data site, using perl based server side scripting for automation engine
● Designed and implemented end to end workflows for applying configuration data to customer’s
Secondary data site, while recovering from a disaster
● Integrating the server core logic with the UI, so that the end to end workflows and use case would
work seamlessly
● Implement on-demand active management operations in UM like applying RTO Boost,
failing over to secondary data site and failback of data to primary data site, etc.
● Enhancing new Test automation environment using E2E tests
Worked as a Database Administrator to store and manage the data effectively for the projects that other graduate
assistants were working on. The responsibilities included the following:
- Setting up MySQL database
- Converting the data in form of .csv files into MySql tables
- Using database techniques like Indexing and Filtering using
Query JOINS, etc to store and fetch appropriate data
effectively.
Worked on an Intern project to investigate into the Scalability and Performance of the Protection Manager product. Also worked on creating a prototype to build a test environment using AWS cloud service, to test the behaviors of the product to manage large number of systems.
Worked as a Teaching Assistant under 2 Professors at Columbia University. Provided help to the students by solving their queries and grading their assignments and papers, in the courses “Introduction to Computer Programming using Matlab” and “Object Oriented Analysis and Design in Java”.