Infra System Admin
Job Title: Infra System Admin
Location: Chennai, TN
Primary Objective
Own the setup, support, maintenance and administration of the Cameo (CATIA Magic) MBSE stack—including Teamwork Cloud and Cameo Collaborator (via Web Application Platform). Ensure a secure, efficient and reliable modeling environment across non production and production.
Environment / Tech Context
- Modeling tools: Cameo Systems Modeler based on MagicDraw, SysML/UML plug ins and reporting framework.
- Repository & collaboration: Teamwork Cloud for central model storage, element level versioning, RBAC, web admin, REST/OSLC endpoints, supports branching/merging and large models.
- Web access & reviews: Cameo Collaborator on Web Application Platform, typically hosted on Tomcat and recommended on separate nodes from TWC.
- Identity & access: SSO integration available via TWC Admin, role based permissions and global/custom scopes.
- Licensing: DSLS license server, clients check out licenses, admin monitors usage and compatibility with tool versions.
- Data tier & HA: Cassandra back end for TWC, supports clustered deployments and hot/cold backups, DR procedures include nodetool snapshot/restore.
- Platform requirements: Current CATIA Magic releases align on modeling tools and recommend specific server profiles for TWC/Collaborator.
Key Responsibilities (supporting the Infrastructure Lead)
Monitoring & Health
- Monitor application and system logs (TWC services, Tomcat, Cassandra) and respond to alerts. Validate repository and web admin availability.
- Track model throughput and REST/OSLC service health, tune JVM/connector/thread pools as needed.
Access, Roles & Security
- Operate SSO (configure directories, bind/timeout policies) and maintain RBAC in TWC Admin.
- Manage TLS certificates/keystores across TWC, apply vendor hardening guidance and secure container defaults.
- Patch OS/Java/Cameo tools/TWC in approved timeframes, validating against vendor compatibility.
Backups, DR & Capacity
- Run Cassandra cold/hot backups (nodetool snapshot, drain) with DBA teams and document RPO/RTO, periodically perform restores on Non-Prod env.
- Plan cluster sizing/replication factor and scale out with storage separation for data/commit logs.
- Track storage growth for repositories, Collaborator exports and forecast capacity.
Build, Release & Environment Management
- Install/upgrade Teamwork Cloud and Web Application Platform/Cameo Collaborator.
- Validate DSLS license server connectivity.
- Manage non prod refreshes from PROD, including access controls and environment alignment, coordinate with CAMEO Functional SME teams on best practices.
Troubleshooting & Performance
- Triage SSO auth, model commit conflicts, REST/OSLC errors and Collaborator rendering; conduct RCA and preventive actions.
- Tune JVM (heap/GC), web container in line with vendor guidance.
Reporting, Audits & Licensing
- Generate usage/health reports and keep audit evidence (access, changes, DR tests).
- Operate DSLS for floating licenses, monitoring and ensure tool compatibility.
Documentation & Knowledge Transfer
- Maintain architecture diagrams, config baselines, SOPs and runbooks. Train L1/engineering teams on common procedures.
Required Experience & Skills
- 6–8 years administering collaborative engineering platforms, specifically with Cameo + Teamwork Cloud in on prem Linux platform VMs.
- Strong skills in Linux, Java/Tomcat, Cassandra (snapshot/restore) process, TLS/certificates and SSO.
- Proven experience with TWC Admin (roles/permissions, project operations) and Collaborator deployment.
- Familiar with REST/OSLC endpoints and model collaboration workflows.
- Comfortable with DSLS licensing operations and client connectivity.
- Experience with TWC clustering and production HA patterns, Zookeeper.
- Observability stacks (Grafana) and automation (Bash/PowerShell).
- Familiarity with Cameo Systems Modeler plug ins (Simulation Toolkit, Requirements Modeler) and reporting.
Soft Skills
- Clear written and verbal communication with engineering and non technical stakeholders, disciplined change management and incident handling.
Skill Set:
Primary:
- Polarion/CAMEO Sys Admin
Secondary:
- Enovia Sys Admin
- Linux,DevOps/Jenkin, etc.
Please apply on our secured job site at <<https://intellibee.my.salesforce-sites.com/apps/jobs?id=a0AUU000005WqtR2AS>> or email [email protected]