belav / csharpier Goto Github PK

CSharpier is an opinionated code formatter for c#.

License: MIT License

C# 91.12% TypeScript 3.64% HTML 0.21% PowerShell 0.81% Dockerfile 0.11% JavaScript 0.51% Java 3.02% CSS 0.37% SCSS 0.10% Kotlin 0.11%

formatter csharp prettier dotnet

csharpier's People

Contributors

Stargazers

Watchers

Forkers

askazakov shocklateboy92 brikabrak respel dishuk13 ryanavogel faktored zachhats saborrie anibalvelarde ccapitalk nasa03 maximtmartin rmvermeulen alirezanet martinothamar jochenheckl michaelstgt dlech jojoman2 eai04191 zenkodr jjohnson338 jmaillet glmnet bwiklund onecyrus sethreno kurt-von-laven callemall devdevdany eirkje cyb0org nitingargprft ikvm kns98 strepto st3no anhgeeky bobwilmes winterweird exiled1 fredatgithub davidmarek ktos coderyixin shawnthebeachy boatwrong meenzen mr-badger gabynevada pigment-romainloisel parched vipentti aditnryn sebastieng84 cjewett lmaslanka davidkudera nixxen loraderon sapiensanatis stefannikolei sharemee jpfifer sea1jxr alanssitis dzins thomaslaich phault hashitaku p07r0457 mgratejot

csharpier's Issues

ConditionalExpression Formatting

Find some other complex examples + look into why this one isn't breaking. Possibly the ForceFlat for string interpolation is preventing it from breaking.

writeLine(hasArgs ? $"            return this.{apiType}<{effectiveType}>(\"{query.name}\", parameters, selectsBuilder);" : $"            return this.{apiType}<{effectiveType}>(\"{query.name}\", null, selectsBuilder);");

Foreach Statement Formatting

This should break somewhere

foreach(var someReallyLongNameYeah in this.SomeOtherReallyLongNameToGetStuff()) {
    break;
}

Review More Code

Start with some smaller repos
mission-control
stonemason
beacon

Some bigger ones
insite-commerce
newtonsoft.json

Etc

BaseList formatting

BaseList for a class currently formats like this. It should indent AndYetAnotherLongClassName

public class LongClassNameBreak
    : AnotherLongClassName<T>,
    AndYetAnotherLongClassName
{
    public void MethodName() { }
}

--validate - to validate formatting did not appear to lose source. This could also format twice to ensure formatting won't be flip blopping
~~--write - to actually write files to disk~~ this seems unnecessary
[directory] - to determine where to format files. Better syntax and ignoring files/directories in Beta
--check - validate no files need formatting - probably not til Beta
--parallel - format files in parallel - probably not til Beta
--quiet - don't write out info about each file unless there are issues - probably not til Beta, let's stick to just outputting information about failures for now.

Force trailing commas

See https://github.com/DotNetAnalyzers/StyleCopAnalyzers/blob/master/documentation/SA1413.md for why they should be forced.

Comma delimited attributes should break

This is how it currently breaks

[Obsolete, System.NonSerialized, NonSerialized, CLSCompliant(true
|| false
& true)]
private volatile int f2;

It should break something like this

[
    Obsolete, 
    System.NonSerialized, 
    NonSerialized, 
    CLSCompliant(true || false true),
]
private volatile int f2;

Trailing comma inserts extra line

The extra line before the brace is inserted if there is a trailing comma

            return new LeadingComment
            {
                Type = commentType,
                Comment = comment,
                AddExtraIndent = addExtraIndent,

            };

Extra Lines in comments

A side effect of #22 is that now extra lines are kept in situations where we would probably want to get rid of them.

    public 
    // random comment
    
    // other random comment
    void Method() { }

        if (
            someCondition
            // random comment

            // other random comment
            || someOtherCondition
        )
        {
            return;
        }

This feels like an edge case that probably won't come up very often.

#40 may be somewhat related

Total Files count varies

I noticed that I don't get consistent total file counts while I was testing something today.
Is it really not formatting the same number of files, or is the count of files not being updated correctly?

BinaryExpression Grouping

Take this example

var longNameForcesLineBreakYeahLongName =
    someValue == someOtherValue && x == 7;

The && is the root BinaryExpression, which is a LogicalAnd
The left and right of it are EqualsExpression.
CSharpier should group the LogicalAnd, but if it groups every binary Expression than the following doesn't format correctly.

if (
    longStatementName
    && longerStatementName
    && evenLongerStatementName
    && superLongStatementName
) { }

Which is basically a tree of BinaryExpressions where the Left is LogicalAnd BinaryExpression and the Right is IdentifierName.
This may require flattening out BinaryExpressions and conditionally grouping them based on what operation they are performing.

See prettier/src/language-js/print/binaryish.js for all the logic that goes into making this work for js

Block with just a comment doesn't indent the comment

if (someBoolean)
{
// code using value
}

The comment is leading trivia on the closing brace. Currently leading trivia ends up indented at the same level as the token it is on.
Because we are changing line breaks/indenting we can't really depend on the existing whitespace in front of the comment.
This probably needs to be handled as a special case somehow. There may be other special cases like this.

Record with PrimaryConstructorBaseType

public record R2(string x) : R1(x)
{ }

Figure out IDE support

What options are there currently for getting csharpier to run in an IDE?
Could plugins for the IDE's be developed?

If Statement with long expression

Currently when an if statement has a long expression, it formats like

if (
    !this.model.TranslatableFields.TryGetValue(
        fieldName,
        out var dictionary
    )
)
{
    return;
}

I think it should do this

if (
    !this.model.TranslatableFields.TryGetValue(
        fieldName,
        out var dictionary
    )
) {
    return;
}

if (shortStatement)
{
    return;
}

if (
    !this.model.TranslatableFields.TryGetValue(
        fieldName,
        out var dictionary
    ))
{
    return;
}

if (shortStatement)
{
    return;
}

This applies to other syntax nodes, like for loops and while loops.

PropertyDeclaration Lamda Body formatting

This may have less to do with the property, and more to do with the syntax inside of the lamda body

public Language CurrentLanguage =>
    this.currentLanguage ?? (this.currentLanguage = this.UnitOfWork.GetRepository<Language>()
        .Get(SiteContext.Current.LanguageDto.Id));

Should be something more like

public Language CurrentLanguage =>
    this.currentLanguage ?? 
        (this.currentLanguage = this.UnitOfWork.GetRepository<Language>().Get(
            SiteContext.Current.LanguageDto.Id
        ));

Code in IfDirective can't currently be formatted

Given the following IfDirective, roslyn does not parse the contents because DEBUG is not defined

#if DEBUG
        Console.WriteLine("Rosyln does not parse this because DEBUG is not defined");
#endif

The contents are just a block of text.
Can I determine some way to parse those contents and format them? Currently they are just written out as is.

Support --check

Add support for a --check option.
This can be used to validate that files are already formatted but will not make any changes to files.

Publish to dotnet tools

Publish from new branch?
Or start using branches for new work?
Or kick off publish against master and continue to work in master?

Extra Lines added on every format.

The following inserts a new line after the trailing trivia on every format.

class ClassName
{
    void Method()
    {
        var x = 1; /* no newline after this */
        var y = 2;
    }
}

Property Formatting

Some examples of property formatting that needs work

        public virtual ICollection<AdminActionPermission> ActionPermissions
        { get; set;
        } = new HashSet<AdminActionPermission>();

        public Dictionary<string, string> Properties { get; set; } = new Dictionary<string,
            string>();

        public virtual PropertyConfiguration PropertyConfiguration
        { get; set;
        }

Also review repositories for other property issues.

Canceling csharpier while it is formatting files can end up with lost code.

If you start running csharpier and ctrl-c the process in the middle it will sometimes leave a file half written, resulting in lost code and no indication that the code was lost.

Automate running tests + publishing

Run unit tests, and if they pass
Then run what is in publish.txt

Clean up some formatting.

I believe there are a couple very common syntax nodes that lack any kind of line breaks/indenting. Review a couple repos to look for some of these.

Improve SyntaxNodeComparer

It currently uses recursion + reflection and could probably be a lot more efficient.
I wrote a quick generated version that got rid of the reflection, but it ran into stack overflows. I should redo that without recursion.

Implement Formatting Options with Configuration File

Initial ideas

width
tabs vs spaces
configurable number of spaces
force braces
maybe brace style
maybe trailing commas
maybe end of line, \n vs \r\n

Read https://prettier.io/docs/en/option-philosophy.html and prettier/prettier#40 again before adding any options.

Create Initial Documentation

Things to document

How to install
CLI
How to use in IDES - maybe
What does alpha mean
What comes next

Declarations break weird at times.

Long declarations don't break, and declarations with initializers don't really break in a clean way.

    private string SomeFieldWithSadsfasdfasdfasdfuperLongNameYeahMan = "kljasdfkljasdklfkjlasdfkjaskdlfasasdfasdfasdfasdfdfksdf";

    private static readonly Regex FrontEndResourceRelativePathRegex = new Regex(
        @".*?/Themes/.*?/.*?/(.*)",
        RegexOptions.Compiled |
        RegexOptions.CultureInvariant |
        RegexOptions.IgnoreCase);

    private SomeObject SomeLongerName = new SomeObject(
        "lkjasdflkjasdfkljaskldjf",
        "klasldkfaksdfasdfkjasdklf") { Property1 = 1, Property2 = 2 };

Support for options in a config file.

When there are options available for formatting - #10 - it should be possible to specify those options in a config file. That will ensure that if csharpier is run via command line, build task, or some other way, it will use the same set of options without having to duplicate them.

UsingStatement Formatting

This may be cleaned up by InvocationExpression. Do that formatting first.

            using (var http = new HttpClient())
            using (var stream = Assembly.GetExecutingAssembly().GetManifestResourceStream(
                "GraphQLApiGenerator.MetadataQuery.gql"))
            using (var queryReader = new StreamReader(stream))
            using (var content = new ByteArrayContent(
                Encoding.UTF8.GetBytes(
                    JsonConvert.SerializeObject(
                        new { query = await queryReader.ReadToEndAsync() }))))
            {

Files with USC-2 LE BOM encoding lose it after formatting, then fail to compile.

There are several files in the commerce repo that fail to compile after formatting. After digging in, they are all originally USC-2 LE BOM encoding. After they are saved, they no longer have an encoding.

Lambda Body Method Formatting

// currently formats like
private Doc MethodWithParameters(
    SomeClass one,
    SomeClass two,
    SomeClass three) =>
    CallSomethingElseWithALongNameThatForcesABreak(one, two, three);

// maybe like this
private Doc MethodWithParameters(
    SomeClass one,
    SomeClass two,
    SomeClass three) =>
        CallSomethingElseWithALongNameThatForcesABreak(one, two, three);
// or this is how prettier does it, which I think I prefer
const doSomething = (
  jasdjklfasjkldflajskdflkasdlfjkasdf,
  jalsdfkasdlkfalksflksdf,
  kjladslkfasjfkasdfl
) => doSomethingElse();

// but also don't indent more when
private Doc ShortMethod(SomeClass one) =>
    CallSomeOtherLongMethodSoThatItBreaks(one);

// but probably always break when
private Doc MethodWithParameters(
    SomeClass one,
    SomeClass two,
    SomeClass three
) => CallSomething(one, two, three);

AllInOne formatting leads to extra lines each time it is formatted

I added a line to specifically exclude the file by name to make sure the csharpier --check validation will work on PRs
This appears to have something to do with leading or trailing trivia

Lamda Body implicity operator should break

Currently formats like

public static implicit operator string(Person p) => p.First + " " + p.Last;

But should be like

public static implicit operator string(Person p)
    => p.First + " " + p.Last;

Lamda body methods break this way already and I thought operators and methods uses the same formatting code.

Long Attributes double indent

See AttributeLists.cst

[BigLongAttributeWhatHappensHmmmmmmmmmmmmmmmmmmmmmmmmmmmmm(
        "11111111111111111111111111",
        "22222222222222222222222222")]
void LongAttribute() { }
// should be
[BigLongAttributeWhatHappensHmmmmmmmmmmmmmmmmmmmmmmmmmmmmm(
    "11111111111111111111111111",
    "22222222222222222222222222")]
void LongAttribute() { }

Look into SyntaxNode types that do not have their own print method.

The following are CSharpSyntaxNodes that do not currently have their own print method. I know some of these are handled by a parent method and cannot exist on there own, but what about the others? Do any of these need to be added to CSharpier?

~~AccessorDeclarationSyntax~~
~~AccessorListSyntax~~
~~AttributeArgumentListSyntax~~
~~AttributeArgumentSyntax~~
~~AttributeSyntax~~
~~AttributeTargetSpecifierSyntax~~
~~CatchDeclarationSyntax~~
~~CatchFilterClauseSyntax~~
~~ExplicitInterfaceSpecifierSyntax~~
~~FunctionPointerCallingConventionSyntax~~
~~FunctionPointerParameterListSyntax~~
~~FunctionPointerParameterSyntax~~
~~FunctionPointerUnmanagedCallingConventionListSyntax~~
~~FunctionPointerUnmanagedCallingConventionSyntax~~
~~InterpolationAlignmentClauseSyntax~~
~~InterpolationFormatClauseSyntax~~
~~JoinIntoClauseSyntax~~
~~OrderingSyntax~~
~~PositionalPatternClauseSyntax~~
~~PrimaryConstructorBaseTypeSyntax #39~~
~~PropertyPatternClauseSyntax~~
~~SubpatternSyntax~~
~~SwitchExpressionArmSyntax~~

I believe these are used internally in roslyn, but don't end up in the syntax tree.
BadDirectiveTriviaSyntax - happens when an unrecognized directive is encountered. Seems to be a compilation error so doesn't need formatting.
DefineDirectiveTriviaSyntax
DocumentationCommentTriviaSyntax
ElifDirectiveTriviaSyntax
ElseDirectiveTriviaSyntax
EndIfDirectiveTriviaSyntax
EndRegionDirectiveTriviaSyntax
ErrorDirectiveTriviaSyntax
IfDirectiveTriviaSyntax
LineDirectiveTriviaSyntax
LoadDirectiveTriviaSyntax
NullableDirectiveTriviaSyntax
PragmaChecksumDirectiveTriviaSyntax
PragmaWarningDirectiveTriviaSyntax
ReferenceDirectiveTriviaSyntax
RegionDirectiveTriviaSyntax
ShebangDirectiveTriviaSyntax
SkippedTokensTriviaSyntax
UndefDirectiveTriviaSyntax
WarningDirectiveTriviaSyntax

I believe all of these are used in xml doc
ConversionOperatorMemberCrefSyntax
CrefBracketedParameterListSyntax
CrefParameterListSyntax
CrefParameterSyntax
IndexerMemberCrefSyntax
NameMemberCrefSyntax
OperatorMemberCrefSyntax
QualifiedCrefSyntax
TypeCrefSyntax
XmlCDataSectionSyntax
XmlCommentSyntax
XmlCrefAttributeSyntax
XmlElementEndTagSyntax
XmlElementStartTagSyntax
XmlElementSyntax
XmlEmptyElementSyntax
XmlNameAttributeSyntax
XmlNameSyntax
XmlPrefixSyntax
XmlProcessingInstructionSyntax
XmlTextAttributeSyntax
XmlTextSyntax

Leading/trailing trivia Formatting

We lose new lines in places we may want them, and insert them in places we probably shouldn't.
We also gain extra new lines at times.

We lose the new line after foo

#define foo

namespace Namespace

We add an extra new line before #pragma each time we format - #21

namespace Namespace
{
#pragma
    public class ClassName { }
}

There are some forms of comments we want inline, something like this breaks I think.

this.DoSomething(1, /* no break */ 2);

What's up with the `CSharpier.Parser` directory?

While trying to get my development environment going and all the tests running/passing, I SyntaxNodeJsonWriterGenerator tries to write to a file in a CSharpier.Parser directory:

csharpier/Src/Worker/SyntaxNodeJsonWriterGenerator.cs

Line 30 in 8011144

    
           var fileName = directory.FullName + @"\CSharpier.Parser\SyntaxNodeJsonWriter.generated.cs";

which of course fails, because that directory doesn't exist.
I noticed that there's a version of SyntaxNodeJsonWriter.generated.cs in CSharpier directory, so I tried updating the path. However, the generated file seems to be drastically different than the one committed, and now generates a tonne of build errors.

Could you take a look at what's going on with that? I'm hoping it's a simple fix like a missing using in the generated file.

Remove some empty lines

I believe CSharpier should follow the same rules for empty lines as prettier, which is

Collapse multiple blank lines into a single line
Empty lines at the start and end of blocks (and whole files) are removed. Files always end with a single new line
All other empty lines are preserved.

Query Expression Formatting

Review formatting of longer query expressions and determine how they should break.

ForStatement Formatting

for (int i = 0, j = 0; i < length; i++,
j++)
{
    break;
}

for (var laksdflasdjfkaskdlfklasdfkljasdf = 0; laksdflasdjfkaskdlfklasdfkljasdf < laksdflasdjfkaskdlfklasdfkljasdf + 1; laksdflasdjfkaskdlfklasdfkljasdf++)
{
    break;
}

(finally) Improve formatting of InvocationExpressions

Some examples

                var extension = GetExtensionForLanguageType(
                    (FrontEndResourceLanguage)Enum.Parse(
                        typeof(FrontEndResourceLanguage),
                        this.Language));

            this.Address1 = addressFields.FirstOrDefault(
                field => field.FieldName == "Address1");

// Invocation inside IfStatement
            if (customerOrder == null || !customerOrder.FulfillmentMethod.EqualsIgnoreCase(
                FulfillmentMethod.PickUp.ToString()))

Deep recursion causes stack overflows.

This shows up when formatting https://github.com/dotnet/runtime
That repository has specific files that appear to be designed to fail on deep recursion.
For now there is an ugly hack in place to prevent getting too deep in the recursive print functions.
https://github.com/dotnet/roslyn/blob/master/src/Compilers/CSharp/Portable/Syntax/CSharpSyntaxWalker.cs has a potentially better way to guard against that.
Avoiding recursion would require a major rewrite, but may be more efficient.
This could provide some ideas for how to avoid recursion.
http://metacoding.azurewebsites.net/2016/08/16/how-to-avoid-recursion/

Format razor files

Ideally csharpier would be able to format razor files. It appears that razor files are parsed and converted to c#. See
https://github.com/davidebbo/RoslynRazorViewEngine/blob/master/RoslynRazorViewEngine/RoslynRazorViewEngine.cs#L127

How does prettier deal with html files, react files, etc?
Should the razor formatter be a prettier plugin instead? How would it parse/format the c# code in a razor file?

VariableDeclaration with multiple declarators should break

It should break like so

int x = 4,
    y = 5,
    z = 7;

WhileStatement formatting

While statements with BinaryExpressions(I think) don't format well.

while (directoryInfo.Name != "Automation" &&
directoryInfo.Parent != null)
{
    directoryInfo = directoryInfo.Parent;
}

Formatting may conflict with StyleCopAnalayzers

We need to investigate which style cop rules need to be disabled to avoid conflicts with csharpier

Improve formatting of field with generics

public static Dictionary<Type,
    string[]> PropertiesByType = new Dictionary<Type, string[]>();
// should be
public static Dictionary<Type, string[]> PropertiesByType =
    new Dictionary<Type, string[]>();

There are probably also some edge cases around generics that aren't handled yet, because this feels like a weird way to break if it were too long by itself

public static Dictionary<Type,
    string[]>

Some directives add extra new line on each format.

When printing an if directive, a new line is being added each time.
Original Code

namespace Namespace
{
#pragma
    public class ClassName { }
}

Next Format

namespace Namespace
{

#pragma
    public class ClassName { }
}

Potentially Rework Printing to simplify doc tree

The current method of printing results in deeply nested doc trees. See example below.

Could I rework (everything) so that this is simplified?
If I simplify things it will probably speed up performance, and it will make working on formatting easier.

Should print methods be passed a parent doc, which they append to?
Concat(Concat(Concat("get", "set"))) could be simplified to Concat("get", "set")
Concat(Concat("get"), Concat("set")) could be simplified to Concat("get", "set")
A potentially easier short term solution would be to have Concat examine what is being passed to it, and simplify it at that time.

Group(
    Concat(
        Concat(
            Line,
            Concat(
                "{"),
            Group(
                Concat(
                    Indent(
                        Concat(
                            Concat(
                                Line,
                                Concat(
                                    "get"),
                                Concat(
                                    ";")),
                            Concat(
                                Line,
                                Concat(
                                    "set"),
                                Concat(
                                    ";")))))),
            Line,
            Concat(
                "}")))),
null,
null))))),