Experience working with varies forms of data infrastructure inclusive of relational database such as SQL, Delta lake and Spark
Experience with orchestration tools e.g. batch and real-time data processing
Experience with CICD pipeline data and machine learning model deployment
Knowledge in ETL and data pipeline framework, data patterns such as structured, semi structured, understand data in various type of database and data store such as Hive, HBase, Impala, MongoDB, Delta Lake, graph database using batch and streaming mechanism leveraging on-premise and on cloud Big data architecture
Knowledge in data science fundamental such as database, operation system, algorithms, data structures, etc
Interest leaning and working with new technology stack