apache-streamparkpr和代码规范指南

本文涉及的产品
云数据库 RDS SQL Server,基础系列 2核4GB
RDS SQL Server Serverless,2-4RCU 50GB 3个月
推荐场景:
实时计算 Flink 版,5000CU*H 3个月
简介: apache-streamparkpr和代码规范指南

世界上本来就有许多格格不入的事物为了共存而不得不相互接受。——博尔赫斯《沙之书》

最近在为streampark起草代码规范指南,对应的pr如下

https://github.com/apache/incubator-streampark-website/pull/226

对应的内容:

<!--
    Licensed to the Apache Software Foundation (ASF) under one or more
    contributor license agreements.  See the NOTICE file distributed with
    this work for additional information regarding copyright ownership.
    The ASF licenses this file to You under the Apache License, Version 2.0
    (the "License"); you may not use this file except in compliance with
    the License.  You may obtain a copy of the License at
       https://www.apache.org/licenses/LICENSE-2.0
    Unless required by applicable law or agreed to in writing, software
    distributed under the License is distributed on an "AS IS" BASIS,
    WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
    See the License for the specific language governing permissions and
    limitations under the License.
-->

1 Pull Requests & Changes Rule

  1. ISSUE/PR(pull request) driving and naming
  • Ensure that PR corresponds to ISSUE.

Note: Hotfix issue does not need to follow this rule, such as fixing spelling errors in JavaDoc or document files.

  • Title naming formatWhen namingPR, you can refer to the[ISSUE-XXXX][Feature/Improve/Refactor/Bug/Cleanup] Titleof the pull request,whereISSUE-XXXXshould be replaced with the actualISSUEnumber.
  • The second part describes the type of PR, such as new features, improvement, refactor, etc.
  • If all changes to PR are within a certain module or component, they can be indicated in the commit message.
  1. Description
  • Please fill in the PR template to describe the contribution. So that the reviewer can understand the problem and solution from the description, rather than just from the code.
  • Ensure that the description is sufficient to illustrate the problem addressed by the PR.
  • Small changes do not require too much description.
  • In an ideal scenario, the problem is described in ISSUE, and most of the description is copied from there.
  1. Try to break down changes into pure types of changes
  • It’s recommended that PR should be arranged changes such as Cleanup, Refactor, Improve, and Feature into separated PRs/Commits.
  • In this way, the reviewers can independently view cleaning and refactoring, and ensure that these changes do not change behavior.
  • Then, the reviewer can independently review the core changes and ensure that they are a clean and robust change.
  • In extreme cases, if a rollback commit is required, it can provide the optimal granularity for version rollback selection.
  • In addition, significant contributions should be split into a set of independent changes that can be reviewed independently.
  1. Commit messageThe commit of messages should follow a pattern similar to thePR:[ISSUE-XXXX][Feature/Improve/Refactor/Cleanup] Title of the pull request.
  • [ISSUE-xxxx1][Improve(ment)] Improve ...
  • [ISSUE-xxxx2][Refactor] Refactor ...
  • [ISSUE-xxxx3][Feature] Support ...
  • [ISSUE-xxxx4][Bug] Fix ...
  • [ISSUE-xxxx5][Feature][subtask] Support ...
  • [Hotfix][module_name] Fix xxx comments ...

Note: Try to use git history instead of annotated code (not mandatory)

2 Code Checkstyle

  • Backend code formatting Maven plugin: spotless
    Just run mvn spotless:apply in the project repo root directory after installing the plugin.
  • Backend code specification Maven plugin: checkstyle
    Just run mvn checkstyle:checkstyle after installing the plugin.
  • Frontend code formatting plugineslint
  • The original command is npx eslint --cache --max-warnings 0 "{src,mock}/**/*.{vue,ts,tsx}" --fix
  • Encapsulated as npm run lint:eslint

3 Programming Specification

3.1 Naming Style

  1. Prioritize selecting nouns for variable naming, it’s easier to distinguish between variables or methods.
Cache<String> publicKeyCache;
  1. Pinyin abbreviations are prohibited for variables (excluding nouns such as place names), such as chengdu.
  2. It is recommended to end variable names with a type.
    For variables of type Collection/List, take xxxx (plural representing multiple elements) or end with xxxList (specific type).
    For variables of type map, describe the key and value clearly:
Map<Long, User> idUserMap;
Map<Long, String> userIdNameMap;
  1. That can intuitively know the type and meaning of the variable through its name.
    Method names should start with a verb first as follows:
void computeVcores(Object parameter1);
  1. The methods name of basicCRUDof the database layer (non-service layer) should be uniformly standardized according to namecom.baomidou.mybatisplus.core.mapper.BaseMapper:If perform a database select operation, the name of the method should be started withselect.For example,selectById,selectByXxx,selectPageByXxx
  • If perform a database update statement operation, the name of the method should be started with update
  • If perform a database insert statement operation, the name of the method should be started with insert
  • If perform a database delete statement operation, the name of the method should be started with delete
  1. The methods name of basicCRUDof the service layer should be named ascom.baomidou.mybatisplus.extension.service.IService:
  • If perform a database select operation to query multiple records, the name of the method should be started with a list, such as listByIds, listByXxx
  • If perform a database select operation to query a single record, the name of the method should be started with get, such as getByName and getOne
  • If perform a database update operation, the name of the method should be started with update
  • If perform a database insert operation, the name of the method should be started with save
  • If perform a database delete operation, the name of the method should be started with remove

3.2 Constant Variables Definition

  1. Set theserialVersionUIDof all classes to1L, followingFlink’sserialVersionUID.
  • Negative demo:
private static final long serialVersionUID = -8713837118340960775L;
  • Positive demo:
private static final long serialVersionUID = 1L;
  1. Redundant strings should be extracted as constants
  • Negative demo:
public static RestResponse success(Object data) {
    RestResponse resp = new RestResponse();
    resp.put("status", "success");
    resp.put("code", ResponseCode.CODE_SUCCESS);
    resp.put("data", data);
    return resp;
}
public static RestResponse error() {
    RestResponse resp = new RestResponse();
    resp.put("status", "error");
    resp.put("code", ResponseCode.CODE_FAIL);
    resp.put("data", null);
    return resp;
}
  • Positive demo:

Strings are extracted as constant references.

public static final String STATUS_SUCCESS = "success";
public static final String STATUS_FAIL = "error";
public static final String STATUS = "status";
public static final String CODE = "code";
public static final String DATA = "data";
public static RestResponse success(Object data) {
    RestResponse resp = new RestResponse();
    resp.put(STATUS, STATUS_SUCCESS);
    resp.put(CODE, ResponseCode.CODE_SUCCESS);
    resp.put(DATA, data);
    return resp;
}
public static RestResponse error() {
    RestResponse resp = new RestResponse();
    resp.put(STATUS, STATUS_FAIL);
    resp.put(CODE, ResponseCode.CODE_FAIL);
    resp.put(DATA, null);
    return resp;
}
  1. Variables that have not been reassigned must also be declared as final types.

3.3 Methods Rule

  1. Sort the methods in the class in the order of public, protected, and private
  2. When there are restrictions on the method, the parameters and returned values of the method need to be annotated with @Nonnull or @Nullable annotations and constraints.
    For example, if the parameter cannot be null, it is best to add a @Nonnull annotation. If the returned value can be null, the @Nullable annotation should be added first.
    Note: that the package name is javax.validation.requirements
  3. If there are too many lines of code in the method, please have a try on using multiple sub methods at appropriate points to segment the method body.Generally speaking, it needs to adhere to the following principles:
  • Convenient testing
  • Good semantics
  • Easy to read
  1. In addition, it is also necessary to consider whether the splitting is reasonable in terms of components, logic, abstraction, and other aspects in the scenario.

However, there is currently no clear definition of demo. During the evolution process, we will provide additional examples for developers to have a clearer reference and understanding.

3.4 Collection Rule

  1. Forcollectionreturned values, unless there are specialconcurrent(such as thread safety), always return theinterface, such as:
  • returns List if use ArrayList
  • returns Map if use HashMap
  • returns Set if use HashSet
  1. If there are multiple threads, the following declaration or returned types can be used:
private CurrentHashMap map;
public CurrentHashMap funName();

3.5 Concurrent Processing

  1. The thread pool needs to be managed, using a unified entry point to obtain the thread pool.
    Note: During the evolution process, we will provide additional examples for developers to have a clearer reference and understanding.
  2. Thread pool needs to be resource constrained to prevent resource leakage caused by improper handling

3.6 Control/Condition Statements

  1. Avoid unreasonablecondition/controlbranches order leads to:
  • Multiple code line depths of n+1
  • Redundant lines

Generally speaking, if a method’s code line depth exceeds 2+ Tabs due to continuous nested if... else.., it should be considered to try

  • merging branches,
  • inverting branch conditions
  • extracting private methods

to reduce code line depth and improve readability like follows:

  • Union or merge the logic into the next level calling
  • Negative demo:
if (isInsert) {
  save(platform);
} else {
   updateById(platform);
}
  • Positive demo:
saveOrUpdate(platform);
  • Merge the conditions
  • Negative demo:
if (expression1) {
  if(expression2) {
      ......
  }
}
  • Positive demo:
if (expression1 && expression2) {
......
}
  • Reverse the condition
  • Negative demo:
public void doSomething() {
 // Ignored more deeper block lines
 // .....
 if (condition1) {
    ...
 } else {
    ...
 }
}
  • Positive demo:
public void doSomething() {
 // Ignored more deeper block lines
 // .....
 if (!condition1) {
    ...
    return;
 }
 // ...
}
  • Using a single variable or method to reduce the complex conditional expression
  • Negative demo:
if (dbType.indexOf("sqlserver") >= 0 || dbType.indexOf("sql server") >= 0) {
 ...
}
  • Positive demo:
if (containsSqlServer(dbType)) {
  ....
}
//.....
// definition of the containsSqlServer


Using sonarlint and better highlights to check code depth looks like good in the future.

3.7 Code Comments Rule

  1. Method lacks comments:
  • When: When can the method be called
  • How: How to use this method and how to pass parameters, etc.
  • What: What functions does this method achieve
  • Note: What should developers pay attention to when calling this method
  1. Missing necessary class header description comments.
    Add What, Note, etc. like mentioned in the 1.
  2. The method declaration in the interface must be annotated.
  • If the semantics of the implementation and the annotation content at the interface declaration are inconsistent, the specific implementation method also needs to be rewritten with annotations.
  • If the semantics of the method implementation are consistent with the annotation content at the interface declaration, it is not recommended to write annotations to avoid duplicate annotations.

3.8 Java Lambdas

  1. Prefernon-capturinglambdas (lambdas that do not contain references to the outer scope).Capturing lambdas need to create a new object instance for every call.Non-capturinglambdas can use the same instance for each invocation.
  • Negative demo:
map.computeIfAbsent(key, x -> key.toLowerCase())
  • Positive demo:
map.computeIfAbsent(key, k -> k.toLowerCase());
  1. Consider method references instead of inline lambdas
  • Negative demo:
map.computeIfAbsent(key, k-> Loader.load(k));

Positive demo:

map.computeIfAbsent(key, Loader::load);

3.9 Java Streams

  • Avoid Java Streams in any performance critical code.
  • The main motivation to use Java Streams would be to improve code readability. As such, they can be a good match in parts of the code that are not data-intensive, but deal with coordination.
  • Even in the latter case, try to limit the scope to a method, or a few private methods within an internal class.

3.10 Pre-Conditions Checking

  1. Use a unified Utils.requireXXX to complete the validation of the prerequisite, and if possible, replace the AlertXXException.throwIfXXX by new pre-conditions checking.

4 Exception Processing

This streampark-console-service module is the core module for processing user requests.

It’s very necessary to strive to provide the best user experience.

So, we introduced the AbstractApiException

and its subclasses to get more friendly interaction effect. Non-AbstractApiException is treated as internal server errors correspondingly, which needn’t notify the interaction details to users.

Based on the above premise, we need to pay attention to the handling of AbstractApiException.

For example, we should throw an exception by one of followed subclasses of AbstractApiException when processing logic with the user operation errors or missing data errors:

An exception message that needs to be notified to front-end, is a detailed exception message, such as the stackTrace info, often accompanied by a large number of exception logs,

e.g: Failed to start job, need to display the exception(stackTrace info) to front-end.

An exception message that needs to be notified to front-end, usually a simple, clear message, e.g:

  1. Username already exists
  2. No permission, please contact the administrator

An exception message that needs to be notified to front-end when processing alert logic.

  • Or others exceptions used to get fine users interaction.

In addition to handling the classification of exceptions, we’d better make the precise and concise exception message and try to ensure the follows in the exception:

  • Display the current status of the abnormal case.
  • Display the solutions to the abnormal case.
  • Or others informations fit the pretty interaction.

Please click Issue-2325 for more details about the items if needed.

5 Log

  1. Useplaceholdersfor log output:
  • Negative demo
log.info("Deploy cluster request " + deployRequest);


Positive demo

log.info("load plugin:{} to {}", file.getName(), appPlugins);


  1. Pay attention to the selection oflog levelwhen printing logsWhen printing the log content, if the actual parameters of the log placeholder are passed, it is necessary to avoid premature evaluation to avoid unnecessary evaluation caused by the log level.
  • Negative demo:
    Assuming the current log level is INFO:
// ignored declaration lines.
List<User> userList = getUsersByBatch(1000);
LOG.debug("All users: {}", getAllUserIds(userList));


  • Positive demo:
    In this case, we should determine the log level in advance before making actual log calls as follows:
// ignored declaration lines.
List<User> userList = getUsersByBatch(1000);
if (LOG.isDebugEnabled()) {
  LOG.debug("All ids of users: {}", getAllIDsOfUsers(userList));    
}

6 Testing

  1. For some of the code/variables used for testing, you can use @VisableForTesting annotation to indicate that
  2. It’s recommended to use JUnit5 to develop test case preparation
  3. Using AssertJ to develop assertions statements.
  4. About the implementation of tests.
  • If the test case only tests an independent class or method that does not require external components such as hadoop,
    remote flink session cluster, etc., it can be written directly using JUnit5 & Mockito.
  • If the test case needs a real database, environment or backend environment,
    but doesn’t need to interact with external components, it’s recommended to inherit directly from SpringUnitTestBase.
  • If the test case requires a real database, environment or backend environment,
    but needs to interact with external components (Remote Flink session cluster, Hadoop cluster),
    it’s recommended to write the test case by directly inheriting SpringIntegrationTestBase.
  1. It’s only recommended to use integration tests on critical test links to avoid making the CI overhead time too long and the resource load too heavy.

References

相关实践学习
基于Hologres轻松玩转一站式实时仓库
本场景介绍如何利用阿里云MaxCompute、实时计算Flink和交互式分析服务Hologres开发离线、实时数据融合分析的数据大屏应用。
Linux入门到精通
本套课程是从入门开始的Linux学习课程,适合初学者阅读。由浅入深案例丰富,通俗易懂。主要涉及基础的系统操作以及工作中常用的各种服务软件的应用、部署和优化。即使是零基础的学员,只要能够坚持把所有章节都学完,也一定会受益匪浅。
相关文章
|
7月前
|
Shell
Flume【问题记录 01】【at org.apache.flume.node.Application.main(Application.java:xxx) 类问题整理+其他类型问题总结】【避坑指南】
【2月更文挑战第17天】Flume【问题记录 01】【at org.apache.flume.node.Application.main(Application.java:xxx) 类问题整理+其他类型问题总结】【避坑指南】
346 2
|
资源调度 前端开发 开发工具
apache-incubator-streampark源码编译本地运行
apache-incubator-streampark源码编译本地运行
151 0
|
6月前
|
测试技术 Apache
使用 Apache JMeter Flexible File Writer 插件的详细指南
Apache JMeter 是开源性能测试工具,用于负载测试。Flexible File Writer 是一个插件,用于自定义格式记录测试结果。安装该插件需通过 JMeter 的 Plugins Manager。配置时,添加监听器到测试计划,设置输出文件、文件格式及字段。执行测试后,结果将按指定格式写入 CSV 文件。此插件增强了数据记录的灵活性,便于分析和报告。
|
7月前
|
Java
apache-incubator-streampark源码编译本地运行(七)
apache-incubator-streampark源码编译本地运行(七)
116 1
|
7月前
|
消息中间件 Kafka API
【极数系列】ClassNotFoundException: org.apache.flink.connector.base.source.reader.RecordEmitter & 详细分析解决
【极数系列】ClassNotFoundException: org.apache.flink.connector.base.source.reader.RecordEmitter & 详细分析解决
330 0
|
Java Maven Windows
apache-incubator-streampark源码编译本地运行(三)
apache-incubator-streampark源码编译本地运行(三)
143 0
apache-incubator-streampark源码编译本地运行(三)
|
7月前
|
SQL 关系型数据库 MySQL
Apache StreamPark系列教程第二篇——项目打包和开发
Apache StreamPark系列教程第二篇——项目打包和开发
247 0
|
7月前
apache-incubator-streampark源码编译本地运行(六)
apache-incubator-streampark源码编译本地运行(六)
94 0
|
Java Scala Maven
apache-incubator-streampark源码编译本地运行(二)
apache-incubator-streampark源码编译本地运行(二)
168 0
apache-incubator-streampark源码编译本地运行(五)
apache-incubator-streampark源码编译本地运行(五)
79 0