DriveWealth
Alliance for Recruitment is the largest recruitment consultancy in the Baltics measured by capacity, number of successful placements and annual growth. We are a high performing team of recruitment experts from various different industries.
Our Client - DriveWealth - is a global B2B financial technology organization dedicated to democratizing access to financial independence around the world. Our mission is realized through an API-based platform, empowering our partners to offer seamless investing and trading experiences to clients worldwide, all from their mobile devices.
Our technology provides partners with a modern, extensible toolkit, enabling traditional investment workflows and innovative techniques like fractional share ownership. DriveWealth has evolved into a global platform offering trading of US equities, mutual funds, ETFs, fixed income, and options.
We seek enthusiastic professionals to contribute diverse perspectives and experiences to our Brokerage-as-a-Service platform. Our culture blends the pace and opportunity of a tech start-up with the impact, stability, and significance of Wall Street. We encourage creativity and experimentation while ensuring institutional-grade execution and regulatory compliance in everything we do. We value diversity and inclusion, celebrating the unique differences of our employees as we scale and grow together. We’re guided by operating principles grounded in accountability, teamwork, integrity, and solutions built to scale.
As a Principal Site Reliability Engineer based in Lithuania, you will spearhead initiatives to maximize the platform's reliability and performance, particularly during critical trading session hours, in coordination with our New York office. You will be pivotal in leading our efforts to support existing and new operational processes, ensuring continuous coverage of our global financial services.
What You’ll Do
- Maintain and enhance the reliability and performance of SRE solutions, ensuring 24/7 operational coverage with a focus on scalability and high availability
- Oversee and manage high-priority technical issues and incidents, ensuring effective resolution during all hours, including overnight and weekends
- Implement and manage automation tools and processes to reduce manual intervention and ensure continuous operational efficiency
- Adhere to incident and change management processes to minimize disruptions and optimize response times, maintaining robust 24/7 support
- This role demands high engagement with systems during non-traditional hours to support our international operations. It requires a proactive approach to communication and coordination across time zones, including collaboration with global teams such as the New York office, to align on operational practices and standards, ensuring consistent and effective support across all time zones
What You’ll Need:
- 5+ years of experience in a senior SRE or similar position, demonstrating deep knowledge and expertise in site reliability engineering or operations
- Advanced Python skills and proficiency in scripting for automation and system management, with a track record of developing and implementing automation solutions
- Expertise in SQL and transactional databases, including performance tuning and troubleshooting, combined with strong analytical skills and a demonstrated ability to perform in-depth troubleshooting and root cause analysis of complex technical issues
- AWS knowledge
- Availability for flexible work hours and willingness to cover US markets trading sessions, including L2/L3 on-call coverage
Nice to Have, But Not Required:
- Expertise in REST APIs and proven ability to manage and optimize complex systems with a strong understanding of API integration
- Proficiency in managing Java applications and a basic understanding of browser developer console for front-end application debugging
- Analytical and troubleshooting skills with a demonstrated ability to perform in-depth troubleshooting and root cause analysis of complex technical issues
- Advanced knowledge of Change Management Processes and risk management
- In-depth understanding of FIX protocol and Corvil to manage related connectivity incidents, network topology and functionality expertise
- Monthly Fitness/Wellness Reimbursement: €70 per month expense
- Medical Reimbursement: €100 per month expense
- Professional Development: €2,300 per year expense
- Vacation: 20 days annual leave per year
- Parental Leave: Statutory leave (required by law)
- Employee Referral Program: Eligible to receive a €900 referral bonus per referral policy
- Hybrid or Remote work experience that allows for flexibility.
Nuo 6000 - 7000 EUR Gross